-
Fast, multi-threaded malloc() and nifty performance analysis tools.
-
Protocol Buffers - Google's data interchange format
-
Google C++ Testing Framework
-
Commandline flags module for C++
-
Logging library for C++
-
An extremely memory-efficient hash_map implementation
-
Google C++ Mocking Framework
-
an efficient, principled regular expression library
-
The PCRE library is a set of functions that implement regular expression pattern matching using the same syntax and semantics as Perl 5.
-
jemalloc is a general-purpose scalable concurrent malloc(3) implementation.
-
A clone of the Darts (Double-ARray Trie System)
-
Static Double Array Trie.
-
Double-ARray Trie System. Chinese Document.
-
JSON data format manipulation library
-
The libevent API provides a mechanism to execute a callback function when a specific event occurs on a file descriptor or after a timeout has been reached. Furthermore, libevent also support callbacks due to signals or regular timeouts.
-
A collection of high performance c-string transformations, frequently 2x faster than standard implementations (if they exist at all).
-
Folly is an open-source C++ library developed and used at Facebook.
-
Intel Threading Building Blocks (Intel TBB) is a widely used, award-winning C++ template library for creating reliable, portable, and scalable parallel applications.
-
The Apache Portable Runtime Library provides a predictable and consistent interface to underlying platform-specific implementations, with an API to which software developers may code and be assured of predictable if not identical behavior regardless of the platform on which their software is built, relieving them of the need to code special-case conditions to work around or take advantage of platform-specific deficiencies or features.
-
The Apache Portable Runtime Utility Library provides a predictable and consistent interface to underlying client library interfaces. This API assures predictable if not identical behavior regardless of which libraries are available on a given platform.
-
Breakpad is a set of client and server components which implement a crash-reporting system.
-
The Universal Chart Component.
-
A XML parser library written in C.
-
TinyXML-2 is a simple, small, efficient, C++ XML parser that can be easily integrating into other programs.
-
HDFS c++ interface.
-
FUSE (Filesystem in Userspace) is a simple interface for userspace programs to export a virtual filesystem to the Linux kernel.
-
The Readline library provides a set of functions for use by applications that allow users to edit command lines as they are typed in.
-
Snappy is a compression/decompression library. It does not aim for maximum compression, or compatibility with any other compression library; instead, it aims for very high speeds and reasonable compression.
-
CityHash provides hash functions for strings.
-
IPython provides a rich architecture for interactive computing.
-
The Universal Chart Component.
-
matplotlib is a python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms.
-
Python-bindings for CityHash (http://code.google.com/p/cityhash/).
-
Go is an expressive, concurrent, garbage-collected programming language.
-
Go bindings for libhdfs, for manipulating files on Hadoop distributed file system.
-
The world's most popular open source database
-
MySQL++ is a C++ wrapper for MySQL’s C API.
-
A fast and lightweight key/value database library by Google.
-
MongoDB (from "humongous") is an open source document database, and the leading NoSQL database.
-
Kyoto Cabinet is a library of routines for managing a database. The database is a simple data file containing records, each is a pair of a key and a value.
-
A handy cache/storage server
-
Free & open source, high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load.
-
libMemcached is an open source C/C++ client library and tools for the memcached server.
-
Redis is an open source, advanced key-value store. It is often referred to as a data structure server since keys can contain strings, hashes, lists, sets and sorted sets.
-
SQLite is a software library that implements a self-contained, serverless, zero-configuration, transactional SQL database engine.
-
Mod_python is an Apache module that embeds the Python interpreter within the server.
-
the CGI kit and HTML templating system
-
Powerful but simple template language for C++
-
This is a template engine for C++.
-
HipHop is a source code transformer which transforms PHP source code into highly optimized C++ and then compiles it using g++.
-
An open extension to CGI that provides high performance for all Internet applications without any of the limitations of existing Web server APIs.
-
neon is an HTTP and WebDAV client library, with a C language API.
-
The OpenSSL Project is a collaborative effort to develop a robust, commercial-grade, fully featured, and Open Source toolkit implementing the Secure Sockets Layer (SSL v2/v3) and Transport Layer Security (TLS v1) protocols as well as a full-strength general purpose cryptography library.
-
Protobuf for PHP is an implementation of Google's Protocol Buffers for the PHP language, supporting its binary data serialization and including a
protoc
plugin to generate PHP classes from .proto files.
-
The project includes these modules:
- Hadoop Common: The common utilities that support the other Hadoop modules.
- Hadoop Distributed File System (HDFS™): A distributed file system that provides high-throughput access to application data.
- Hadoop YARN: A framework for job scheduling and cluster resource management.
- Hadoop MapReduce: A YARN-based system for parallel processing of large data sets.
-
ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services.
-
The Apache Thrift software framework, for scalable cross-language services development, combines a software stack with a code generation engine to build services that work efficiently and seamlessly between C++, Java, Python, PHP, Ruby, Erlang, Perl, Haskell, C#, Cocoa, JavaScript, Node.js, Smalltalk, OCaml and Delphi and other languages.
-
With a redesigned fully distributed API, HDFS integration and a wide range of new machine learning toolkits, GraphLab is now faster, more scalable, and more powerful than ever before.
-
MPICH is a high performance and widely portable implementation of the Message Passing Interface (MPI) standard.
-
Dynamic Resource Sharing for Clusters. Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed applications, or frameworks. It can run Hadoop, MPI, Hypertable, Spark (a new framework for low-latency interactive and iterative jobs), and other applications.
-
Our goal is to develop, implement, deploy, and evaluate mechanisms and policies that support High Throughput Computing (HTC) on large collections of distributively owned computing resources.
-
The RAMCloud project is creating a new class of storage, based entirely in DRAM, that is 2-3 orders of magnitude faster than existing storage systems.
-
The Internet Communications Engine (Ice)
-
A C++ implementaton of MapReduce without distributed filesystem
-
GSL - GNU Scientific Library
-
The Apache Mahout machine learning library's goal is to build scalable machine learning libraries.
-
The Vowpal Wabbit (VW) project is a fast out-of-core learning system sponsored by Yahoo! Research.
-
SVMlight is an implementation of Support Vector Machines (SVMs) in C.
-
Support Vector Machine for Ranking.
-
Support Vector Machine for Complex Outputs.
-
A Library for Support Vector Machines by Chih-Chung Chang and Chih-Jen Lin.
-
A Library for Large Linear Classification.
-
RankLib is a library of learning to rank algorithms.
-
Suite of Fast Incremental Algorithms for Machine Learning. Includes methods for learning classification and ranking models, using Pegasos SVM, SGD-SVM, ROMMA, Passive-Aggressive Perceptron, Perceptron with Margins, and Logistic Regression.
-
libLBFGS: a library of Limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS).
-
Orthant-Wise Limited-memory Quasi-Newton Optimizer for L1-regularized Objectives.
-
Maximum Entropy Modeling Toolkit for Python and C++
This package provides a (Conditional) Maximum Entropy Modeling Toolkit for Python and C++.
-
A simple C++ library for maximum entropy classification
This is a simple C++ library for maximum entropy classifiers (also known as "multinomial logistic regression"). The main features of this library are:
- supporting L1/L2 regularization [1,2]
- supporting fast parameter estimation algorithms (LBFGS [3], OWLQN [4], and SGD [5])
- supporting real-valued features
-
A parallel C++ implementation of fast Gibbs sampling of Latent Dirichlet Allocation
-
Yahoo!'s topic modelling framework using Latent Dirichlet Allocation
-
Mr. LDA is a Latent Dirichlet Allocation topic modeling package based on Variational Bayesian learning approach using MapReduce and Hadoop, developed by a Cloud Computing Research Team in University of Maryland, College Park.
-
OpenCV (Open Source Computer Vision) is a library of programming functions for real time computer vision.
-
A simple multithreaded OpenCV example application using the Qt framework.
-
C++ Libraries for Computer Vision
-
The VLFeat open source library implements popular computer vision algorithms including HOG, SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, SLIC superpixels, and quick shift.
-
Mahotas is a set of functions for image processing and computer vision in Python.
-
Caltech Large Scale Image Search Toolbox
This Matlab package implements several algorithms used for large scale image search, eg. kd-tree/lsh/akm/hkm/Hamming-Embedding/min-hash...
-
Object bank representation is a novel image representation for high-level visual tasks, which encodes semantic and spatial information of the objects within an image.
-
Modeling the shape of the scene: a holistic representation of the spatial envelope, IJCV2001.
-
LIBPMK is a C++ implementation of Grauman and Darrell's Pyramid Match algorithm.
-
face recognition via Sparse Representation-based Classification (SRC)
face recognition via Sparse Representation-based Classification (SRC).
-
A collection of tools and libraries to read, write, and manipulate an image in any of the more popular image formats.
-
An OCR Engine that was developed at HP Labs between 1985 and 1995... and now at Google.
-
The Apache LuceneTM project develops open-source search software.
-
The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software.
-
MAchine Learning for LanguagE Toolkit.
-
Japanese morphological analyzer. Chinese Document.
-
an efficient, scalable search engine with a robust query language.
-
The Chromium projects include Chromium and Chromium OS, the open-source projects behind the Google Chrome browser and Google Chrome OS, respectively.
-
Extensions are small software programs that can modify and enhance the functionality of the Chrome browser.
-
Code review.
-
SWIG is a software development tool that connects programs written in C and C++ with a variety of high-level programming languages.
-
Blade is designed to be a modernize building system.
-
Gnuplot is a portable command-line driven graphing utility for Linux, OS/2, MS Windows, OSX, VMS, and many other platforms.
by xueminzhao 3/23/2013