-
Recent Posts
Recent Comments
Archives
- August 2019
- May 2018
- December 2016
- March 2015
- August 2014
- November 2013
- October 2013
- September 2013
- July 2013
- April 2013
- March 2013
- February 2013
- December 2012
- November 2012
- July 2012
- April 2012
- February 2012
- November 2011
- October 2011
- September 2011
- August 2011
- July 2011
- June 2011
- May 2011
- April 2011
- March 2011
- February 2011
Categories
Meta
Tag Archives: bsfl
The Generic SSE2 Loop
In response to a couple of comments on my post about find-first-bit-set in SSE2 registers, amounting to “what use is a routine that only does 16-byte bitvecs”, I thought I’d post the canonic, generic loop through memory using SSE2 ops. … Continue reading
SSE2 bit trick: ffs/fls for XMM registers
For the full “C” code that uses this idea for an arbitrary-length byte vector, see this later blog post In a discussion about all the wonderful uses of the combination movemask(pcmpxx(a,b)), it occurred to me that this gives you a … Continue reading