Html Content / Article Extractor in Java open sourced from Gravity Labs – GitHub – Project Goose is an article extractor written in Java and its goal is to take a webpage, perform calculations and extract the main text of the article as well as make recommendations on what image might be the most relevant image on the page
June 8th: the day your phone won’t stop ringing – Users with broken IPv6 connectivity will experience long delays connecting to major public web sites. Their workstations will try to reach the content over IPv6 first and will have to experience a TCP-level timeout before retrying to get the same content over IPv4
CouchApp: Standalone CouchDB Application Development Made Simple – GitHub – CouchApp is designed to structure standalone CouchDB application development for maximum application portability. CouchApp is a set of scripts and a jQuery plugin designed to bring clarity and order to the freedom of CouchDB's document-based approach.
Why there are so many engineers in India – TNW India – Becoming an engineer in India virtually guarantees financial security for life, and this blinds parents into forcing their children to choose engineering. It is a romanticized notion, but I often wonder how many David Beckhams, Quentin Tarantinos and Carlos Santanas are currently pulling off 9-5 shifts for IT firms.
LevelDB – a fast and lightweight key/value database library – LevelDB is a library that implements a fast key-value store. Keys and values are arbitrary byte arrays, Data is stored sorted by key, Callers can provide a custom comparison function to override the sort order.
Three helpful SSH tips for developers – Atlassian Developer Blog – If you're a developer that deploys stuff to unix systems, then one of the most common tools you interact with is SSH. It never ceases to amaze me, in spite of this, how little developers really know about SSH.