Machine Learning Engineer at Diffbot in Palo Alto, CA
There's a lot of good data on the web, but webpages were designed for human beings to read. We've trained a computer vision robot augmented with NLP to automatically analyze and convert webpages into a semantic format. Our structured data currently powers some of the largest consumer destinations on the web: Digg, Instapaper, StumbleUpon, and AOL and we are on the cusp of converting the entire web into one queryable database.
We're a team of veteran statistical machine learning practitioners, that come from the fields of computer vision, natural language processing, and web search. Our ranks include alumni from the Stanford AI lab, Microsoft, Yahoo!, Powerset (an NLP startup), and various partially-completed PhD programs. One of us led the Stanford AI lab in the DARPA robotics challenge. Another created his own 12B search engine and launched real-time search before Google.
It's like being a member of the X-men, but without the inflated egos. We enjoy learning from each other, achieving technical breakthroughs, and hanging out together. We're in a great location adjacent to Stanford, and walking distance to restaurants and shops. We have a lot of the big-company perks: free food, excellent healthcare, and competitive salaries.
Nothing to be ashamed of, many of us are big-company refugees as well :-) At Diffbot, you'll be able to work on datasets that are just as large, and additionally have a high level of autonomy over your work, as well as take part in architecting the future of the web.
It depends. A good fit for us is someone that not only has the skills, but shares the same vision and sees how they can contribute to it.
You might not find it surprising that many machine learning techniques covered in textbooks and research papers are pedagogical, and don't necessarily work in real-world environments. At Diffbot, you'll be able to hone your real-world modeling skills, impact millions, while gaining valuable startup experience.
We're solving data problems that others have never seen, and so we often develop our own exotic machine learning algorithms when off-the-shelf techniques falter. We've built some unique infrastructure which you'll be able to leverage from day one, but much more remains to be built.
I think a conversation would be worthwhile. How do I get in touch?
Awesome, we would love to chat. Send a note to firstname.lastname@example.org telling us why you're interested in working at Diffbot, along with your CV and if you have it, a link to your homepage or Github. Our CEO will read every message with a note attached. Unfortunately, we cannot read all resumes sent without context. If we also think you'd be a good fit, we'll give you a call right away or ask you to come by in person.