-
Notifications
You must be signed in to change notification settings - Fork 37
Persister API
The following are the essential elements of the API.
Method Persister4TaggerAPI.generateCSVFromLastestJSON() - generate CSV for latest Json
Method Persister4TaggerAPI.generateTweetsIDSCSVFromAllJSON() - generate Tweets IDs CSV From All JSON
Method Persister4TaggerAPI.generateJSONFromLastestJSON() - generate JSON From Lastest JSON
Method Persister4TaggerAPI.generateTweetsIDSJSONFromAllJSON() - generate Tweets IDs JSON From All JSON
Method Persister4TaggerAPI.generateCSVFromLastestJSONFiltered() - generate CSV From Lastest JSON Filtered
Method Persister4TaggerAPI.generateTweetsIDSCSVFromAllJSONFiltered() - generate Tweets IDs CSV From All JSON Filtered
Method Persister4TaggerAPI.generateJSONFromLastestJSONFiltered() - generate JSON From Lastest JSON Filtered
Method Persister4TaggerAPI.generateTweetsIDSJSONFromAllJSONFiltered() - generate Tweets IDs JSON From All JSON Filtered
Method Persister4Collector.generateCSVFromLastestJSON() - generate CSV file
Method Persister4Collector.generateJSONFromLastestJSON() - generate Json file
Method Persister4Collector.generateTweetsIDSCSVFromAllJSON() - generate Json file for tweet Ids
Method Persister4Collector.generateTweetsIDSJSONFromAllJSON() - generate Json file for tweet Ids
The following are additional elements of the API and/or obsolete documentation prior to 2015-04-28. These should be integrated into the documentation above or moved to the Javadoc-based documentation.
Base URI: http://localhost:port/aidr-persister/webresources
GET .../persister/start?file="file_location"&collectionCode=XXX
file
parameter represents the location where persister should persist the content of collection.
collectionCode
represents the collectionCode for which the persister should start persistance.
GET .../persister/stop?collectionCode
collectionCode
represents the code of collection for which the persistance should be stopped.
GET .../persister/genCSV?collectionCode=XXX&exportLimit=34234
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
Note: this export service uses full tweet content. Use the next service if only tweet-ids needed to be exported.
GET .../persister/genJson?collectionCode=XXX&exportLimit=34234&jsonType=JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
jsonType
instructs `aidr-persister to create one JSON array of all tweets to be included.
GET .../persister/genJson?collectionCode=XXX&exportLimit=34234&jsonType=TEXT_JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
jsonType
instructs `aidr-persister to create a text file with one JSON object per line.
GET .../persister/genTweetIds?collectionCode=XXX
collectionCode
represents the code of collection for which a CSV file should be generated.
GET .../persister/genJsonTweetIds?collectionCode=XXX&jsonType=JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
jsonType
instructs `aidr-persister to create one JSON array of all tweet IDs to be included.
GET .../persister/genJsonTweetIds?collectionCode=XXX&jsonType=TEXT_JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
jsonType
instructs `aidr-persister to create a text file with one JSON object per line.
For each of the above persister download REST APIs, replacing /persister/...
by /taggerPersister/...
will generate a downloadable file in the corresponding format but only for classified tweets, i.e., with AIDR classification data appended to each classified tweet.
POST .../taggerPersister/filter/genCSV?collectionCode=XXX&exportLimit=34234
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
POST .../taggerPersister/filter/genCSV?collectionCode=XXX&exportLimit=34234
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
POST request payload example:
Request Headers: Content-Type: application/json
Accept
: application/json
Note: If no filtering required, then the payload will be as follows:
{ "constraints": [] }
Otherwise, if filtering required, then:
{ "constraints": [ { "queryType": "date_query", "comparator": "is_before", "timestamp": 1495339860 }, { "queryType": "date_query", "comparator": "is_after", "timestamp": 1272339860 }, { "queryType": "classifier_query", "classifier_code": "informative_pray_personal", "label_code": "praying", "comparator": "is", "min_confidence": 0.8 }, { "queryType": "classifier_query", "classifier_code": "informative_pray_personal", "label_code": "030_info", "comparator": "is_not" }, { "queryType": "classifier_query", "classifier_code": "informative_pray_personal", "label_code": null, "comparator": "has_confidence", "min_confidence": 0.5 } ] }
Note: Only those documents that satisfy ALL the constraints are returned.
Parameter details:
-
queryType
: indicates type of query. Currently can take only two values -"date_query"
and"classifier_query"
. -
comparator
: query predicate evaluation criterion. Fordate_query
it can be eitheris_after
oris_before
depending on whether to filter documents that occurred after or before the specifiedtimestamp
value, respectively. Forclassifier_query
it can be eitheris
oris_not
orhas_confidence
. -
timestamp
: unix time specified indate_query
as Java typelong
. -
classifier_code
: corresponds to theattribute_code
. -
label_code
: corresponds to a valid label_code for a given classifier_code. -
min_confidence
: Include only those documents for which theconfidence
of the specifiedclassifier_code
is greater thanmin_confidence
. Is Javafloat
type.
POST .../taggerPersister/filter/genJson?collectionCode=XXX&exportLimit=34234&jsonType=JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
jsonType
instructs `aidr-persister to create one JSON array of all tweet IDs to be included.
For the POST request payload details, refer to the POST .../taggerPersister/filter/genCSV
documentation above.
POST .../taggerPersister/filter/genJson?collectionCode=XXX&exportLimit=34234&jsonType=TEXT_JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
exportLimit
represents limit of tweets to be exported. Datatype of this parameter is number.
jsonType
instructs `aidr-persister to create a text file with one JSON object per line.
For the POST request payload details, refer to the POST .../taggerPersister/filter/genCSV
documentation above.
POST .../taggerPersister/filter/genTweetIds?collectionCode=XXX
collectionCode
represents the code of collection for which a CSV file should be generated.
For the POST request payload details, refer to the POST .../taggerPersister/filter/genCSV
documentation above.
POST .../taggerPersister/filter/genJsonTweetIds?collectionCode=XXX&jsonType=JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
jsonType
instructs `aidr-persister to create one JSON array of all tweet IDs to be included.
For the POST request payload details, refer to the POST .../taggerPersister/filter/genCSV
documentation above.
POST .../taggerPersister/filter/genJsonTweetIds?collectionCode=XXX&jsonType=TEXT_JSON
collectionCode
represents the code of collection for which a CSV file should be generated.
jsonType
instructs `aidr-persister to create a text file with one JSON object per line.
For the POST request payload details, refer to the POST .../taggerPersister/filter/genCSV
documentation above.
- Home
- [What is AIDR?](AIDR Overview)
- The science behind AIDR
- [Operator's manual](AIDR Operator's Manual)
- [Public API documentation](API documentation)
- High-level overview
- Common
- DB Manager
- DTO standards
- Database schema
- Manager
- Manager API
- Collector
- Collector API
- Reconnect strategy
- Collector Tester
- Output
- Output API
- Output Buffered
- Output Streaming
- Output Tester
- Persister
- Persister API
- Persister Tester
- Tagger
- Tagger Concepts
- Tagger API
- Tagger Tester
- Trainer
- Trainer API
- PyBossa Trainer