Skip to content

Highlights

  • Pro

Organizations

@ESIPFed

Popular repositories

  1. Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

    Python 846 185

  2. Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.

    Python 84 48

  3. ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to ext…

    Java 84 39

  4. Nutch-Python is a Python binding to the Apache Nutch™ REST services allowing Nutch to be called natively in the Python community. — Edit

    Python 32 21

  5. Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.

    Java 29 19

  6. This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.

    Python 15 30

360 contributions in the last year

Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Mon Wed Fri

Contribution activity

July 1, 2020

chrismattmann has no activity yet for this period.

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.