links for 2011-03-28

  • HTSQL is a URI-based high-level query language for relational databases. HTSQL wraps your database with a web service layer, translating HTTP requests into SQL and returning results as HTML, JSON, etc.

    HTSQL is designed for someone who is not a SQL expert, but needs a usable, comprehensive query tool for data access and reporting.

  • jStat is a statistical library written in JavaScript that allows you to perform advanced statistical operations without the need of a dedicated statistical language (i.e. MATLAB or R).
  • Media Converter. Simple but advanced converting for Mac OS X

    Convert almost every input file.

    Since Media Converter uses ffmpeg a lot of file formats are supported. Convert avi, wmv, mkv, rm, mov and more to other formats.

    Some files aren't supported by ffmpeg, but can be decoded with QuickTime®. Media Converter uses movtowav and movtoy4m to decode them.

    Convert to a lot of formats.

    Media Converter comes with presets to convert to popular video and audio formats. These presets can be fine-tuned in the Preferences.

    You also can create your own presets in the Preferences. Send them to us to share them with the world. A bit too Advance for you? No problem, this site contains presets.

    Smart converting thanks to its father.

    Media Converter shares a lot of internals with Burn (disc burning application). This way both can be improved based on user experiences.

  • The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page.

    The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings.

    Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate.

    Boilerpipe is a Java library written by Christian Kohlschütter. It is released under the Apache License 2.0.

    The algorithms used by the library are based on (and extending) some concepts of the paper "Boilerplate Detection using Shallow Text Features" by Christian Kohlschütter et al., presented at WSDM 2010 — The Third ACM International Conference on Web Search and Data Mining New York City, NY USA.

  • Ruby port of Protovis library According to the protovis site

    Protovis composes custom views of data with simple marks such as bars and dots. Unlike low-level graphics libraries that quickly become tedious for visualization, Protovis defines marks through dynamic properties that encode data, allowing inheritance, scales and layouts to simplify construction.

  • A collection of the best open data sets and open-source tools for data science