-
Notifications
You must be signed in to change notification settings - Fork 0
/
NEWS
1140 lines (1087 loc) · 44.1 KB
/
NEWS
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
2015/06/11 version_1_0_9
Tokenize :
- new JSON filter
- streamlined the mbox filter a bit
- pinot-index looks for filters in the same locations as the UI
IndexSearch :
- sort documents by date, in ascending or descending order
2014/07/18 version_1_0_8
General :
- don't install the xdgmime files
- use libnotify if --enable-libnotify=yes is specified
- fixed check on library symbols that could cause a crash, added
support for LLVM-built filters
- general clean up
SQL :
- refactored the DB interface
2014/05/22 version_1_0_7
General:
- compilation fixes
- prefer off_t for file sizes, offsets, buffer lengths
Monitor :
- try and reapply watches on directories that have just been removed
Tokenize :
- refactored encoding conversion
IndexSearch :
- fixed build with libexttextcat
2013/05/26 version_1_0_6
IndexSearch:
- support boost 1.50's Spirit
SQL :
- better handling of potential errors while stepping through results
Collect :
- minor fixes to curl backend
2013/03/03 version_1_0_5
IndexSearch:
- fixes to abstract generation, CJKV tokenization
2013/02/11 version_1_0_4
IndexSearch:
- fix stripping of diacritics
- stem subject terms
2013/01/14 version_1_0_3
IndexSearch :
- fix a Unicode handling issue introduced in 1.01
UI :
- updated French translation by Eliovir
2012/11/04 version_1_0_2
General :
- turn memory pooling off by default to avoid issues with newer boost
UI :
- new Czech translation by Zbyněk Schwarz
- updated Japanese translation by Takafumi Arakaki
- updated Brazilian Portuguese translation by Adriano Steffler
2012/08/27 version_1_0_1
General :
- run rst2html on RST files, if available, and if RST files are detected as such.
See http://code.google.com/p/pinot-search/issues/detail?id=12
Tokenize :
- better mbox parts extraction
IndexSearch :
- dropped unac in favour of own code, resulting in faster indexing.
- pinot-index --override MIMETYPE:EXTENSION overrides MIME type detection
based on files extensions
2012/06/16 version_1_0_0
General :
- install headers and libraries
- prefer default programs that don't support URIs to view local files
Deskbar :
- dropped support for Deskbar since it's now dead
Tokenize :
- better mbox parsing. The internal part numbering scheme has changed;
reindexing email is recommended.
IndexSearch :
- support for LibreOffice's libexttextcat v3.2
- fixed the Google plugin
- renamed the Freshmeat plugin to Freecode
UI :
- updated Simplified Chinese translation by happymeng
- updated German translation by Gena Haltmair
- initial GTK+ 3 port. Enable with "./configure ... --enable-gtkmm3=yes"
2011/11/07 version_0_9_8
Tokenize :
- new exiv2-based filter
- new chmlib-based filter
IndexSearch :
- support for LibreOffice's libexttextcat v3.1, and possibly v3.1.1
- dropped plugins for Yahoo! REST API, Yahoo! BOSS, Google Code Search
and RollYO
UI :
- query results and view history are expired after 6 months
- updated Dutch translation by Martijn Verstrate and Tico
- updated German translation by Fitoschido
- updated Russian translation by Nikolay Kachanov
- updated Spanish translation by pkramerruiz
2011/01/09 version_0_9_7
General :
- replaced custom memory pool class with Boost's
- don't try and map more than 2Gb and use shared mappings
Deskbar :
- install the module where DeskBar > 2.28 expects it to be
IndexSearch :
- remove dots at the end of terms that don't look like acronyms
- index components of acronyms and dot-separated terms on their own
Daemon :
- index files as they are crawled, don't delegate indexing to other
threads, unless PINOT_MAXIMUM_INDEX_THREADS > 1
- fixed checking of symlinks against black-list
UI :
- the maximum number of results returned by the Query field and used
to initialize new stored queries follows PINOT_MAXIMUM_QUERY_RESULTS
- when a spelling suggestion is available, don't show the same revised
query multiple times
- updated Simplified Chinese translation by mike2718
- updated Dutch translation by Dirk Roos
- updated Italian translation by Davide Vidal and Simone Sandri
- updated Japanese translation by Mizuki-san
- updated Brazilian Portuguese translation by feen
- updated Portuguese translation by Almufadado
- updated Russian translation by Alexander Zinin and Nikolay Kachanov
- updated Spanish translation by Juan Miguel Boyero Corral, Matias Fonzo
and Fitoschido
2010/07/12 version_0_9_6
General :
- fixed "GIO can sniff PNG" program used at configure time
- README clarifies that operators should be upper-case
- builds with gmime-2.4 or 2.6
- link with the library that has dlopen() and fix Debian bug #556062
- merged Debian's patch for --as-needed support
- merged FreeBSD build patch
- dropped support for Xesam
SQL :
- fixed prepared statements interface to work with insertion and deletion
- sleep then retry operations if the database is busy
- better transaction support
Tokenize :
- the mbox filter now supports messages of type "message/external-body"
IndexSearch :
- fixed possible crash at exit time when the textcat configuration file
points to non-existing model files
Xapian :
- prefer the Chert back-end if available. Applicable to Xapian >= 1.2.0.
Daemon :
- check symlinks against black-list
- the battery status can now be obtained from DeviceKit-power or upower
UI :
- updated Simplified Chinese translation by Eleanor Chen
- updated French translation by verdy_p and Fabrice Colin
- updated Hebrew translation by Yaron
- updated Brazilian Portuguese translation by andbelo
2009/11/14 version_0_9_5
General :
- OpenBSD support, thanks to the work of Antoine Jacoutot
- fixed build when HAVE_DBUS isn't set
SQL :
- use prepared statements on most common queries, transactions on mass updates
Tokenize :
- better handling of acronyms
IndexSearch :
- updated Bing plugin
- removed plugins for Exalead and IOI
Xapian :
- fixed the "path:" operator. Reindexing may be necessary
- rewrote Search This For feature
UI :
- if gtkmm >= 2.16 is available, the Find button is replaced with an icon
- updated Dutch translation by JW
- updated French translation by Thierry Thomas
- updated German translation by Fabian Affolter and Marco Jahn
- updated Hebrew translation by Ddorda
- updated Portuguese translation by Bernardo Lopes
- updated Spanish translation by Jesus Tramullas and DiegoJ
2009/06/27 version_0_9_4
General :
- set _FILE_OFFSET_BITS=64 and fix Debian bug #530572
- merged Funda Wang's linkage patch for Mandriva
- gmime 2.4 is required
Tokenize :
- mbox filter now fully works with gmime 2.4
- archives filter supports Debian packages
- set the close-on-exec flag on document files
- better MIME type detection removes superfluous calls to external
uncompressor programs when dealing with archives
- use file names as title for files attached to mbox messages
- fixed "quashing" of results titles
- fixed indexing of the last document's attachments in an mbox
IndexSearch :
- new search filter "inurl" allows finding files from an mbox or archive
at a given URL
- pinot-index --check on an mbox or archive will return the ID of the
first nested document
- pinot-search shows an estimate of the total number of results
- the Bing plugin replaces the MSN plugin
- fixed Freshmeat plugin
Daemon :
- send a IndexFlushed signal over D-Bus when the index changes on disk
- fixed restoring of user-set metadata
UI :
- better MIME type detection fixes cases where documents nested in
archives couldn't be open and viewed
- reopen the index upon receiving the IndexFlushed signal
- show properties of external indices' documents read-only
- on exit, delete temporary files created for viewing some documents
- fixed More Like This on Web results
2009/04/13 version_0_9_3
Tokenize :
- moved the first 5Mb limit from the terms generator to the tokenizer
Daemon :
- fixed major bug that caused the daemon to reindex all files on each run,
unless started in full scan mode
2009/04/10 version_0_9_2
General :
- fixed successive initialization and cleanup of libxml2 that could lead to
a crash with libxml2 2.7.3
IndexSearch :
- work around invalid charset declarations in documents
- fixed pinot-index handling of black-listed documents
- redesigned how documents nested in other documents (eg mbox...) are indexed
- removed MozDex plugin
Tokenize :
- new filter for tar files and ISO images based on libarchive >= 2.6.2 to
index the content of those archive formats. Enable with "./configure ...
--enable-libarchive=yes"
Daemon :
- major changes to try and minimize memory usage
- PINOT_MAXIMUM_INDEX_THREADS sets the daemon's number of indexing threads and
defaults to 4
- indexes created with version < 0.92 will be automatically upgraded
UI :
- can open/view files within indexed archives
2009/03/07 version_0_9_1
General :
- patch by Adel Gadllah for gcc 4.4
- removed obsolete Encoding field from .desktop files
Tokenize :
- new HTML filter based on Xapian Omega's HTML parser
- prevent rpm from choking on files with the ".rpm" extension that are not RPMs
- look for the ROBOTS metatag in remote documents only
- only consider the first 5Mb of documents
IndexSearch :
- pinot-search can run stored queries created by the UI
- pinot-index can deal with relative paths, index directories and their contents,
open My Web Pages, My Documents or other UI-configured index by name
Daemon :
- fixed issue where symlinks would get unindexed every second run
- fixed memory leak in time to timestamp conversions
- fixed memory leak when reapplying user-set metadata
- fixed memory leak when skipping the download of a local file
- don't flush the index while files are being indexed
UI :
- fixed spelling suggestions on the live query
- the Import URL option can import local directories and their contents in
My Web Pages
- updated German translation by Gena Haltmair
- updated Portuguese translation by Flávio Martins
2009/01/29 version_0_9_0
General :
- builds with MingW
- patch by Martin Michlmayr to fix gcc 4.4 build errors
Tokenize :
- skip mbox messages flagged deleted by Evolution
- patch by Adel Gadllah to build with gmime 2.4
- catch conversions errors to/from unsupported charsets
- fixed handling of Unicode space and punctuation code points
Collect :
- look for the extended attribute user.mime_type on local files
- HTTP downloaders can do POST
Monitor :
- check /proc/sys/fs/inotify/max_user_watches and set aside 1k watches for
other applications if possible
- patch by Adrian Bunk to fix inotify support with recent kernel headers
Search :
- better rebasing of results' URLs
- dropped A9 and BitTorrent plugins
- added plugin for the Internet Open Index
- fixed various issues with getting more than one results page from Web engines
- support for HTML extracts in OpenSearch Response
Xapian :
- some terms were not always highlighted in the abstract
- support for the "path:" operator
- fix to always return the total results estimate
- don't index the title without prefix as if it were in the text body, but
let queries search across both text body and title. The "title" filter
still allows searching titles exclusively. Requires Xapian >= 1.0.4.
- always add a term for "dir:/"
- MIME type terms don't include any charset specification, as intended
- remove the original query's terms, stopwords, infrequent terms or similar
terms if the stemming language is set from More Like queries
- close all databases in an orderly manner
Search :
- pinot-search has a "sort by date first" mode
Scripts :
- pinot-cd.sh implements a "tagged cd"
- pinot-check-file.sh simplifies determining if a file is in My Documents
Deskbar :
- pinot-module.py uses the new Query method, supports deskbar v2.24 snippets
Daemon :
- user-set metadata, including labels, is saved and restored when reindexing
- D-Bus method RenameLabel obsoleted
- D-Bus method Query replaces SimpleQuery and allows querying the same engines
as the UI
- skip symlinks that refer to places that have been crawled or will be crawled
- if the daemon was interrupted while crawling, some files would never be indexed
- indexes created with version < 0.90 will be automatically upgraded
UI :
- spelling suggestions are shown inline, above results. Upon selecting one and
clicking the Yes button, a new query based on the selected suggestion will be
created.
- restructured menus
- queries' Index Results option updates documents already in My Web Pages. This
doesn't apply to those in My Documents
- on More Like This, selected documents are indexed to My Web Pages if necessary
- both results and index lists can be exported to CSV or XML
- Open Parent opens the directory a file is in
- say "No results" instead of showing a blank results list
- when viewing a document and GIO is used, make sure we consider the default
application(s) first
- support for the "path:" operator
- preferences can be open independantly with "pinot -p" or "pinot-prefs"
- smoother status window
- the List Contents Of menu wasn't refreshed after editing an index' name
- automatic migration of pre-0.90 configuration
- larger default blacklist
- updated Simplified Chinese translation by rainofchaos
- updated Dutch translation by JW
- updated French translation by Frédéric Grosshans
- updated German translation by Gena Haltmair
- updated Japanese translation by Takeo Mizuki
- updated Brazilian Portuguese translation by Henrique P. Machado
- updated Portuguese translation by _PN_boy
- updated Swedish translation by Daniel Nylander
- new Hebrew translation by Yaron
2008/09/20 version_0_8_9
Xapian :
- indexing and searching are now diacritics insensitive by default, thanks
to Unac 1.7.0 by Loic Dachary
- support for removal of stopwords at query time. Language specific lists
should be installed in $PREFIX/share/pinot/stopwords and be named
stopwords.language_code
- better abstracts for short queries
Daemon :
- fixed indexing of plain text and XML files, following changes made in 0.88
- indexes created with version < 0.89 will be automatically upgraded
UI :
- fixed boolean operators in spelling suggestions, broken in 0.88
- spelling suggestion doesn't suggest the same thing over and over again
- dehyphen queries on line breaks, useful with text pasted from an external
document
- updated Simplified Chinese translation by Aron Xu
- updated Brazilian Portuguese translation by André Gondim
2008/08/30 version_0_8_8
General :
- replace xdgmime with GIO if it can sniff PNG at configure time
- with gcc 4.x, set symbol visibility to hidden by default
Tokenize :
- for unknown text formats, don't be too quick to fall back on the plain
text filter
- the output of external filters can be scanned if need be
- some support for OpenXML formats
Search :
- obsolete Google API engine now built as a dynamic backend
- backends provide slightly more information
- updated several plugins
- removed the CreativeCommons plugin
Index :
- pinot-index --showinfo shows which actions are associated with a MIME type
Xapian :
- only support boolean operators in upper-case. This helps fixing issues
with queries made of text pasted from elsewhere.
Daemon :
- extended GetStatistics to return the flags "low disk space", "on battery"
and "crawling"
UI :
- on first runs, create useful stored queries
- display extended status in the Status window
- updated Simplified Chinese translation by rainofchaos
- updated German translation by Gena Haltmair
- updated Brazilian Portuguese translation by Rafael Porto Rodrigues
- updated Swedish translation by Daniel Nylander
2008/07/20 version_0_8_7
General :
- install the Amazon API plugin with other plugins
Collect :
- obey META REFRESH if set
Search :
- basic Xesam back-end based on xesam-glib. Enable with configure's option
--enable-xesam-glib
- in plugins, setting a value to "EDIT:description" makes it editable and
allows to assign it a value at search time
- pinot-search can set editable parameters with -e/--seteditable
- plugin for the new Yahoo! BOSS API
Xapian :
- skip very short non-CJKV terms when expanding queries.
- fixed mangling of some CJKV queries
- abstract generation is less skewed towards common terms
UI :
- support for drag-n-drop to the stored queries list. Dropping a file will
create a query to look for similar documents. Dropping text will create a new
query set to that text.
- preferences let the user edit all editable parameters defined in the plugins.
They are saved to the configuration file.
- extracts can be selected, copied and dropped onto the queries list to create
new queries
- let foreground threads run for a minute max
- search-only backends (such as Xesam) will appear in the Current User channel
2008/06/21 version_0_8_6
General :
- make manuals generates the manuals with help2man
- dropped date parser for curl's/neon's
Tokenize :
- decode emails subject lines properly
- the HTML filter skips HTDig's no_index block
SQL :
- query history can keep more than one results set
Collect :
- use Last-Modified header as document's date
Search :
- don't run queries consisting exclusively of spaces
- fixed A9 plugin, removed Accoona
Xapian :
- don't attempt offering suggestions for CJKV terms
Daemon :
- in ignore-version mode, reapply labels too
- documents from directories removed from indexing/monitoring should now be
unindexed on full scans
- SIGTERM wasn't caught !
UI :
- use buttons on notebook tabs
- the live query text field doesn't offer suggestions for filters and ranges
- after a query edit, lists of documents are refreshed correctly
- fixed date displayed when viewing query history
- query history keeps the last two sets
- .desktop file was missing Japanese and Simplified Chinese comments
- SIGTERM wasn't caught !
2008/05/11 version_0_8_5
Build :
- removed reference to m4 directory
General :
- synced with gtk+'s xdgmime
Tokenize :
- new libexif-based filter to extract image metadata
- better conversion of mbox messages and HTML documents to UTF-8
- tweaks to the CJKV tokenizer
SQL :
- more abstract database interface
Search :
- fixed CJKV queries on Web engines
- fixed repetition of CJKV characters in abstracts
Xapian :
- mixed CJKV queries should be processed correctly. See README.
- fixed repetition of CJKV characters in abstracts
- if the document specifies a language, double check it's valid
Daemon :
- check whether there's already a daemon process running and exit if there is
- fixed concurrency issues
- fixed crawling and monitoring of new directories
UI :
- if the global configuration file can't be open, don't reset the configuration
- new Japanese translation by Takeo Mizuki
2008/03/27 version_0_8_4
Build :
- patch by Adel Gadllah to fix gcc 4.3 build errors
General :
- updated FAQ with how to compact the index
Monitor :
- patch by Michael Biebl for inotify on m68k, mips, mipsel and hppa
Search :
- new plugin for UNdata
Index :
- fixed possible crash when pinot-index exits
Xapian :
- faster CJKV indexing
- the spelling database is populated with CJKV terms too
Daemon :
- fixed possible crash when pinot-dbus-daemon exits
- added option --ignore-version to deal with compacted indexes
UI :
- fixed possible crash when pinot exits
- fixed issue with signaling between crawler and indexer
- stored queries can index all, or only new, results
- in Preferences, patterns can be reset to default values
- the Status window shows whether the daemon was stopped by, or
disconnected from D-Bus
- new simplified Chinese translation by Ashlee Ma
2008/02/28 version_0_8_3
Build :
- fixed build errors with gcc 4.3, thanks to Adel Gadllah
- fixed backend and non-backend flags mismatch
General :
- dropped deprecated Encoding keys in .desktop files, as pointed out by
David Paleino
Tokenize :
- filters definition in external-filters.xml can specify what charset
the text output is in
- convert documents into UTF-8 prior to indexing
Search :
- pinot-search supports option "--stemming LANGUAGE_NAME"
- updated results parsing in Google.src
Xapian :
- initial support for CJKV. See README for details
- consider stemmed terms when building extracts
Daemon :
- fixed options parsing
- don't stop the directory crawler thread after 5 minutes
UI :
- fixed extract display, broken in the previous release
- tabs can be reordered, notebook is scrollable
- make sure the Status window doesn't miss crawler errors
- updated Spanish translation by Jesus Tramullas
2008/01/26 version_0_8_2
Build :
- don't link to unnecessary libraries
Search :
- removed the WiseNut plugin
- fixed the Sherlock plugin parser's handling of input items, thanks to
Claudio Bustos Navarrete
- support for Xesam RC1
- don't build the Xesam UL parser if Spirit is not available, thanks to
Reuben Thomas
Xapian :
- back-end moved into a dynamic library
- fixed several issues with query stemming
- generate terms for the MIME class
Daemon :
- log an error when there's no inotify watch left
- export HasDocument over D-Bus
UI :
- when the index needs updating, tell the user on every run until he clicks
the "Don't warn me again" checkbox
- stemming is now configured separately and not driven by the "lang" filter
- don't correct spelling of auto-generated and previously corrected queries
- defer importing to the main window
- Status window shows which engines are available
- larger default blacklist
- viewed documents are added to the list of recently used files. Requires
gtkmm >= 2.10
- updated Dutch translation by JW
- updated Spanish translation by Jesus Tramullas
- updated Swedish translation by Zirro
2007/11/24 version_0_8_1
Build :
- misc fixes
General :
- updated FAQ and README
- fixed Icon field in desktop files
Index :
- workaround for broken shared-mime-info rules that identify HTML files as
Mozilla bookmarks
- files whose name includes a question mark were not indexed correctly
- pinot-label would loop forever if the supplied file name wasn't in the index
- don't build the spelling table if the env var PINOT_SPELLING_DB is set to NO
Deskbar :
- new plugin compatible with Deskbar 2.20
Daemon :
- stop crawling and indexing if the partition on which the index resides is
getting full. By default, that means less than 50 Mb. This can be overriden
with the env var PINOT_MINIMUM_DISK_SPACE, eg PINOT_MINIMUM_DISK_SPACE=100
for 100 Mb
- stop crawling when the system goes on battery and restart when on AC. This
requires support for the freedesktop.org's Power Management spec, or pre-spec
gnome-power-manager.
UI :
- fixed build against libsigc++ 2.1
- updated Portuguese translation by Tiago Silva
- updated Swedish translation by Daniel Nylander
2007/11/01 version_0_8_0
Build :
- SMP builds, thanks to Gabriel C
Index :
- unknown document types can be indexed if one of their parent types is known
- new pinot-label tool to get, set and list labels on indexed files from the
command-line
- limit external programs to 5 minutes of CPU time
Search :
- support for date (year, month, day), time (hours, minutes, seconds) and
size (in bytes) ranges
- attempt to correct the spelling of index queries that don't match anything
- log how long queries take
- keep connection to remote databases alive
- better query expansion
- the Yahoo! plugin was replaced with the Yahoo! API plugin
- pass queries to Web engines unmodified, without attempting to filter results
based on a filter or a range used in the query
Daemon :
- fixed Reload method
- new D-Bus methods to manage labels
- new --reindex option
UI :
- Search This For menu to search in results
- suggest spelling corrections for index queries that don't match anything
- all indices can be browsed
- inline URL completion in the import dialog box, based on previous results
- better support for user-specific MIME settings, thanks to Lee Marks
- reload MIME settings when they are edited
- results of stored queries can be sorted by relevance (default) or by date
- new History button to show previous results for a stored query
- documents' properties are updated in the background
- documents' terms can be saved to a file
- Status window shows description of errors
- send a Reload to the daemon only when the relevant preferences are modified
- updated Dutch translation by JW
- updated Brazilian Portuguese translation by Leonardo Melo
2007/08/23 version_0_7_6
Build :
- also look for textcat.h in libtextcat
Monitor :
- a deletion would deadlock the monitor and prevent from processing any
further event
- unindex directories' contents when deleted and update when moved
Collect :
- try to open files with NO_ATIME if possible
- Neon-based downloader had not been brought up to date
Index :
- replaced na(t)ive tokenizer with Xapian's TermGenerator. No effort is
made to convert text to UTF-8 yet, so this depends on document formats
and encodings
- preliminary support for spelling corrections, without user feedback
- index directories are tagged as cache directories so that they are skipped
by "tar --exclude-caches"
- preserve documents' title if possible, use user-specified title on import
Search :
- fixed Sherlock parser for boost 1.34
- support for gSOAP 2.7.9e
- when searching an index, don't resort to OR'ing all terms if the original
query doesn't match anything, this only confused users
Deskbar :
- removed unnecessary shebang in script
Daemon :
- new D-Bus method Reload, that enables to reload the configuration and act
upon it whenever modified by the UI
- SimpleQuery still resorts to OR'ing all terms if the original query doesn't
match anything
UI :
- fixed some minor cosmetic bugs, tweaked a few things
- user-specific MIME settings in ~/.local have priority over system settings
- queries with at least a start date can be run, and filter a documents list
- the label specified on import was ignored
- new traditional Chinese translation by Yung-Chung Lin
- updated Portuguese translation by _PN_boy
- updated Swedish translation by Daniel Nylander
2007/07/28 version_0_7_5
General :
- install Dijon's ChangeLog
Index :
- files with nested documents (eg mbox) could sometimes not be fully indexed
and/or and the wrong MIME type was reported
Search :
- updated Xesam Query Language parser to reflect current spec
- updated Sherlock plugin for Exalead, removed Ask
Daemon :
- mbox files are no longer configured separately : those found during
a crawl are indexed and monitored automatically
- patterns list can be used as a blacklist (default) or whitelist
UI :
- fixed a bug where filtering a documents list with an empty query would
prevent from viewing the list without or without a filter query
- avoid a crash when viewing the properties of several documents one after
the other
- updated Dutch translation by Balaam's Miracle
- updated Portuguese translation by _PN_boy
2007/06/24 version_0_7_4
General :
- make uninstall actually uninstalls all files
Index :
- can detect Hungarian, Romanian and Turkish with libtextcat 2.2 and stem
with Xapian 1.0
- adopted Xapian 1.0's new indexing strategy
Search :
- basic support for the Xesam Query and User Language in pinot-search
Daemon :
- history database is separate from the UI's
UI :
- in index list tabs, replaced labels filtering with stored queries filtering,
so that one can find out which and how many documents in the index being
shown match a query (the query's maximum number of results is ignored)
- Hungarian, Romanian and Turkish are valid document languages
2007/05/23 version_0_7_3
Tokenize :
- mbox filter was broken in previous release
- TagLib filter returns the file's name as title if no tag is found
SQL :
- don't force opening and closing the database on every request
Index :
- overwrite index on upgrades, it's faster than deleting all its documents
- support for Xapian 1.0 API
Search :
- stemming wasn't activated when a language is set
- support for Xapian 1.0 API
Daemon :
- don't scan for deleted files on every run
UI :
- fixed crash when unindexing documents
- updated Dutch translation by JW
- updated Italian translation by Marco Bazzani
2007/04/28 version_0_7_2
General :
- synced with gtk+'s xdgmime
Index :
- add extra term to documents so that filter "dir:/" can be applied
Search :
- results lists can be saved as CSV or OpenSearch response XML/RSS
Daemon :
- indexes created with version < 0.72 will be automatically upgraded
- fixed problem where too much time spent upgrading would cause the daemon
to be killed
UI :
- open documents on HTTP/HTTPS with the default Web browser
- updated German translation by Christian Dywan
- updated Italian translation by Vincenzo Consales
2007/03/31 version_0_7_1
Tokenize :
- HTML filter wasn't always properly initialized, which could cause a crash
Monitor :
- file creation wasn't acted upon
- moves to unmonitored locations are treated as deletions
Collect :
- fixed building with a version of curl that doesn't depend on OpenSSL
- dropped pinot-collect tool
Index :
- added versioning
- label names, file names, directory names and URLs are escaped
- directories are indexed as separate documents
Search :
- filters with spaces or control characters should be double-quoted, eg :
file:"Cats & Dogs.txt"
Daemon :
- extended D-Bus methods Set and GetDocumentInfo
- fixed off-by-one error in numbering of mail messages parts
- blacklist wasn't applied to files for which the monitor reports events
- an index created with Pinot < 0.71 is automatically upgraded
Deskbar :
- use deskbar.Utils.url_show() if available
UI :
- Status window is now live
- View on a plain text message shows the mail headers
- added proxy support for Web engines queries and collection of documents on HTTP
- better work-around for desktop files that attempt setting env variables in Exec
- use Gtk::ComboboxText where appropriate
- German translation by Christian Dywan
- Italian translation by Michele Angrisano
- Portuguese translation by _PN_boy
2007/03/06 version_0_7_0
Tokenize :
- now use Dijon's filters (http://dijon.berlios.de/)
Index :
- support for remote indexes served by xapian-progsrv+ssh
- index to use by pinot-index specified with --db
Search :
- date range filtering applies to index searches
- limit the number of results returned by pinot-search with --max
Daemon :
- log the daemon's PID
- sped up unindexing of documents after a directory is deleted
- caught up with changes in D-Bus 1.0
UI :
- stored queries can do date range filtering. If dates don't make sense
(eg From >= To), they are ignored
- under the Session menu, Status shows various bits of information about
the indexes and the daemon's crawler
- prompt for command to use to open documents of a type for which no
application is defined
- revamped configuration dialog for external indexes
- Index > Properties now shows a document's size and number of unique terms
- most operations that involve peeking at the index are done in the background
- work-around for desktop files that attempt setting env variables in Exec
- caught up with changes in D-Bus 1.0
- Brazilian Portuguese translation by Leonardo Melo
- Russian translation by Sergey Vostrikov
2006/12/21 version_0_6_5
Build :
- added option --enable-debug to configure, --enable-soap replaces --with-soap
- complain bitterly if libtextcat header is not found
- Pthreads may be provided by a library other than libpthread
- install configuration files in sysconfdir, libraries in libdir
Tokenize :
- optimized Ogg/MP3 filter
Index :
- fixed extraction of language and size from document data
- store date terms for future date range filtering
- fixed argument checking in pinot-index
Daemon :
- new DBus method SetDocumentsLabels to relabel several documents at once
- process DBus messages in a separate thread, not in the main thread
UI :
- fixed expansion of .desktop's Exec
- initialize D-Bus ! Not sure why this didn't cause problems before.
- fixes for when the locale is not UTF-8
- on first run, open the Preferences box and show the Indexing tab
- for stored queries set to index and label results, only apply the new label
to results that already are in one of the indexes, don't do a full update
- refresh labelled documents list correctly after properties are changed
2006/12/05 version_0_6_4
Collect :
- don't needlessly load files that are going to be handled by an helper application
Index :
- store documents size and file extension, if any
- fixed concurrency bug that could thrash the index on SMP systems
- very long capitalized terms could lead to document loss
- file names were always lower-cased
Search :
- regenerated the Google SOAP API stubs with gsoap 2.7.8c
UI :
- added filter on file extension
- fixed crash on SMP systems when listing an index
- fixed deadlock when indexing a query's results. Oddly enough, it seems it
happened only on FreeBSD !
- Dutch translation by Tikkel
- Swedish translation by Daniel Nylander
Daemon :
- set a lower scheduling priority
- queue events in the database, not in memory
- clean exit when signalled/stopped while crawling
2006/11/18 version_0_6_3
Collect :
- watch out for NULL characters in data
Tokenize :
- fixed memory leak. Temporary documents were not deleted most of the time
UI :
- prettified results list
- better abstract highlighting
- fixed clipboard copy of results list and abstract
Daemon :
- autostart the daemon process
- with dbus < 0.70, close the connection
2006/11/04 version_0_6_2
General :
- query shared-mime-info prefix, so that the applications database can be
loaded even when Pinot is installed under a different prefix
- copyright notice was missing in source
Index :
- detect and support libtextcat 3.0 peculiarities
- can skip files based on glob pattern
Search :
- fixed issue where label and directory filters were not applied correctly
when the filter doesn't start with an upper-case letter. Directory filters
starting with a non-alphanumeric character only work with Xapian >= 0.9.8.
- fixed A9, Accoona and Exalead plugins
UI :
- file patterns to skip can be set in Preferences, Indexing
- columns showing a timestamp were sorted alphabetically
- refresh index lists correctly when exiting Preferences
Daemon :
- fixed major bug where the daemon would loop endlessly reindexing mp3/ogg
files. When notified that a writable file was closed, check the file was
actually modified before reindexing it.
- fixed D-Bus warning about closing the connection when exiting
2006/10/18 version_0_6_1
General :
- switched to gtk+'s version of xdgmime
Index :
- can now run queries like "type:text/html and lang:en and (tcp near ip)".
See README for more information about the syntax and a list of filters.
Search :
- don't reject MozSearch plugins
- added plugin for Google code search service
- filters 'site' and 'file' (host name, file name) apply to Web engines
- fixed abstract generation
UI :
- revamped the stored queries editor to allow any number of terms and filters
- preferences relative to My Documents and My Web Pages gathered under the
same tab
- hide the stored queries and engines lists by default to avoid scaring
people used to Beagle too much ;-)
- mail accounts configuration wasn't always saved correctly
- updating a document from the My Web Pages index messed the abstract
Daemon :
- index attachements in mbox files
- documents deleted since last crawl were not removed from the index if the
corresponding location wasn't monitored
2006/09/25 version_0_6_0
Build :
- modifications for building on Cygwin, contributed by Reini Urban
Monitor :
- inotify monitor more flexible
SQL :
- retry if the database is busy
- save the daemon's crawler history
Index :
- able to open remote indexes that were not initially available
- indexes are open in write mode only when necessary
- prefer Xapian's Flint back-end to Quartz
- terms starting with an upper-case letter are not ignored by terms suggestion
- refresh terms generated from the previous title, location etc... when updating a document
- MIME type and directory hierarchy are stored as terms
- store documents last modification date in a way compatible with Xapian Omega
Search :
- fixed abstract generation with remote indexes, and queries with OR'ed terms !
- brought MozDex plugin back
UI :
- show results scores
- fixed language shown in the properties box when language is unknown
for one or more of the documents selected
- better at queuing document indexing
- My Email index is replaced with the index managed by the daemon
- search terms suggestion, More Like This and the index status icon on results
rely on both internal indexes
- only URLs can be imported now as the daemon takes care of local files
Daemon :
- crawls and indexes user-defined locations on the filesystem, mbox files included
- recursively monitors these locations for changes and updates the index
- provides a DBus interface suitable for querying and document inspection
- no dependency on gtkmm
Deskbar Applet :
- plugin for Deskbar Applet that searches documents indexed by the daemon
2006/07/05 version_0_5_0
Tokenize :
- switched to pdftotext, which seems to support more files than pdftohtml
SQL :
- remove history items older than one month at startup
Search :
- can search for documents with language "Unknown" (ie those for which
language detection failed)
UI :
- multiple user-configurable cache services. They are defined in
$PREFIX/share/pinot/globalconfig.xml and listed under the View Cache menu.
- allow to change the language of several documents at once
- after editing a query, don't change selection in the queries list
- More Like This extracts representative terms from the selected documents
(if they are indexed) and creates a "More Like ..." query
- set the indexed status icon when results are indexed
- view results on double clicks
- show a message if there is no application defined for a file's type
- most tree columns can be sorted at last ! :-)
- work-around for bizarre bug that causes a segfault when creating a query
that indexes and labels results based on a language filter
2006/06/11 version_0_4_9
General :
- able to find the default application for a given MIME type
Tokenize :
- better HTML tokenizer
UI :
- titles in documents lists are no longer truncated
- if import fails, the progress bar displays the error message
- dropped the internal viewer, default applications are launched on View
- View Cache no longer dependant on Google SOAP API. It just points to the
Google Cache for http[s] results
- remember whether the queries list is expanded
2006/05/25 version_0_4_8
Monitor :
- replaced FAM/Gamin with inotify
Search :
- fixed BitTorrent source
UI :
- fixed menuitems inconsitencies
- always use user-provided language on update !
- all query terms are highlighted in the extract !
- threads that finish while the import dialog is up are processed when the
dialog box is closed
2006/05/12 version_0_4_7
General :
- all programs have man pages and support --help and --version
Search :
- the Google API engine is no longer unnecessarily picky about queries parameters
- updated Creative Commons plugin, based on the one shipped with Firefox
UI :
- merged channel Web Services with The Web
- the state of engine channels is saved and restored