Archive for July, 2008

parallel line-oriented file processing

Thursday, July 31st, 2008

At work, I've been doing a lot of line-oriented file processing, for example, of the tabbed-separated value files produced by the Freebase project (downloads). This is similar in spirit to Tim Bray's 'wide finder' project, and I've leveraged his popularity to find a useful utility created by Preston l. Bannister ...

O(log(N)) array insertion in Ruby

Thursday, July 24th, 2008

>> require 'bdb' >> x = BDB::Btree.open('/tmp/foo.db',nil, 'w+', {'set_bt_compare' => lambda {|a,b| (a.to_i) < => (b.to_i)}}) => # >> (0..9).to_a.sort_by{rand}.each{|i| x[i] = i};true => true >> x.keys.map{|i| i.to_i} => [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]

A few personal notes …

Monday, July 21st, 2008

It's been too long since my last update, but life has been busy, especially with the Microsoft purchase. But a few notes: I enjoyed singing Sacred Harp at the Michiana and Kalamazoo annual singings this weekend, and James Nelson-Gingerich gave me the *first* copy of the print version of 26th edition ...

Thoughts on the Microsoft acquisition

Tuesday, July 1st, 2008

(The usual disclaimers: my opinion only, not my current or future employers) When Powerset began a couple of years ago, a lot of commentators called us -- and still do call us -- a would be Google killer. This, despite repeated comments by senior staff that this wasn't what we were ...

Who acquired Powerset?

Tuesday, July 1st, 2008

microsoft acquires powerset