DocFetcher

Grandalf · 16 Czerwiec 2018

DocFetcher jest programem open source, który został stworzony, aby pomóc nam w wyszukiwaniu plików na naszym komputerze, pozwalając zarazem na podejrzenie ich zawartości. Oczywiście do wyszukiwania można użyć narzędzia systemowego, ale będzie to trwało o wiele dłużej.
Indeksowanie i wyszukiwanie obiektów w programie oparte są na Apache Lucene, powszechnie stosowanej wyszukiwarki open source.
Jedną z wielu zalet aplikacji jest to, że jest dostępny w wersji przenośnej, która pozwala na stworzenie przenośnego repozytorium dokumentów w pełni indeksowanego. Można go nosić przy sobie na dysku USB, nagrać go na płytę CD-ROM dla celów archiwalnych, umieścić w zaszyfrowanym woluminie, synchronizować je między wieloma komputerami za pośrednictwem usługi chmury (np. Dropbox).

Poniższy zrzut ekranu pokazuje główny interfejs użytkownika. Zapytania są wpisane w polu tekstowym (1) – jeśli chcemy wyszukać frazę, to między wyrazami wstawiamy znak „+” bez spacji. Wyniki wyszukiwania są wyświetlane w okienku wyników (2). Okienko podglądu (3) pokazuje podgląd tylko tekstu pliku aktualnie wybranego w panelu wyników. Wszystkie wyniki poszukiwań są podświetlone na żółto.
Można filtrować wyniki według minimalnego i/lub maksymalnego rozmiaru pliku (4), typu pliku (5) i lokalizacji (6). Przyciski (7) służą do otwierania pomocy, otwierania preferencji i minimalizacji programu do zasobnika.

DocFetcher wymaga utworzenia tzw. indeksów dla folderów, które chcemy przeszukać. Indeksowanie pozwala, aby dowiedzieć się bardzo szybko, które pliki zawierają konkretny zestawy słów, co znacznie przyspiesza wyszukiwanie. Aby otworzyć okno „Kolejka indeksowania” klikamy prawym przyciskiem myszy na okno „Zakres wyszukiwania” i wybieramy jakiś folder (lub archiwum) i klikamy „Uruchom”. Proces indeksowania może trochę potrwać, w zależności od liczby i wielkości plików, które mają być indeksowane. Czynność tę robimy tylko raz na poszczególnym folderze. Program aktualizuje tego typu indeksy przy następnym uruchomieniu.

Obsługiwane formaty dokumentów przez program:

Microsoft Office (doc, xls, ppt)

Microsoft Office 2007 and newer (docx, xlsx, pptx, docm, xlsm, pptm)

Microsoft Outlook (pst)

OpenOffice.org (odt, ods, odg, odp, ott, ots, otg, otp)

Portable Document Format (pdf)

EPUB (epub)

HTML (html, xhtml, …)

TXT and other plain text formats (customizable)

Rich Text Format (rtf)

AbiWord (abw, abw.gz, zabw)

Microsoft Compiled HTML Help (chm)

MP3 Metadata (mp3)

FLAC Metadata (flac)

JPEG Exif Metadata (jpg, jpeg)

Microsoft Visio (vsd)

Scalable Vector Graphics (svg).

źródło:

Zaloguj lub Zarejestruj się aby zobaczyć!

Instalator:

Zaloguj lub Zarejestruj się aby zobaczyć!

Portable:

Zaloguj lub Zarejestruj się aby zobaczyć!

Program do poprawnego działania wymaga w systemie środowiska
Zaloguj lub Zarejestruj się aby zobaczyć!

Grandalf · 22 Czerwiec 2018

DocFetcher 1.1.21

version 1.1.21 2018-06-22

Bugfixes

The previous version of DocFetcher could not be started on OS X when running Java 8 or older.

Crash with “root cannot be null” message on certain malformed PDF files (bug #1443).

Crash with “ClassCastException” error on certain PDF files (bug #1459).

Crash on certain EPUB files (bug #1463).

Crash when trying to set strings like ”(zip jar” in the plain text or zip extension field on the indexing dialog (bug #1457).

Instalator:

Zaloguj lub Zarejestruj się aby zobaczyć!

Portable:

Zaloguj lub Zarejestruj się aby zobaczyć!

Grandalf · 25 Sierpień 2018

DocFetcher 1.1.22

2018-07-30 - DocFetcher 1.1.22

Bugfixes

DocFetcher could not be started on OS X with Java 9 or newer.

Crash on Windows due to hotkey issues.

Crash on some PDF files.

MS Office files containing very large amounts of text could not be read.

DocFetcher could not read the metadata of certain JPEG files.

Instalator:

Zaloguj lub Zarejestruj się aby zobaczyć!

Portable:

Zaloguj lub Zarejestruj się aby zobaczyć!

Grandalf · 15 Maj 2021

DocFetcher 1.1.24

2021-05-10 - DocFetcher 1.1.24

Bugfixes

Emergency bugfix: Index saving was broken because one of the index files, called "tree-index.ser", was not properly written to disk. As a result, the program would later fail to load newly created indexes and warn about index incompatibility. This release fixes the index saving issue. Note that you can manually repair indexes that fail to load by closing DocFetcher and then renaming all "tree-index.ser.temp" files on your computer to "tree-index.ser".

2021-05-07 - DocFetcher 1.1.23

Features

The preview pane now displays line numbers for plain text file formats, such as source code.

Improved MS PowerPoint text extraction: Now notes, comments and master text are extracted.

Improved MS Excel support: Now cell comments are extracted.

MS Office support: Added various file extensions. (Word 2007+: dotm; Excel 2007+: xltm; PowerPoint 2007+: ppsm, potx, potm; PowerPoint before 2007: pot; Visio before 2007: vss, vst, vsw.)

Added GUI und manual translations for Chinese Simplified, Turkish and Ukrainian.

Advanced setting "CheckSingleInstance" to suppress the warning message that DocFetcher displays when it detects after launch that another DocFetcher instance is running or was not cleanly terminated.

Advanced setting "ShowPathsDuringIndexing" to display file paths instead of filenames during indexing. This is useful for locating files that cause DocFetcher to hang.

Advanced setting "WriteIndexingLog" to write the paths of files being indexed to a log file. This helps with identifying problematic files DocFetcher chokes and crashes on during indexing.

Advanced setting "PdfPreviewVisualOrder": If the preview pane shows the words in PDF files in a jumbled order, experimenting with this setting may help.

Advanced setting "OpenLimit" to adjust the maximum number of files that can be opened all at once from the search results.

Any changes you've previously made to the values in the advanced settings file (program-conf.txt) will now be preserved when upgrading to this DocFetcher version or a later version. In earlier versions, such changes were lost when the upgrade added new entries to the advanced settings file.

Bugfixes

The global hotkey (by default Ctrl + F8) is now disabled by default due to known stability issues (e.g., bug #1514). You can enable the hotkey at your own risk via the new advanced setting "HotkeyEnabled".

Upgraded the GUI library SWT for 64-bit systems (SWT 4.9 → 4.19). This fixes a NullPointerException crash on macOS that prevented the program from being started.

Added an extra macOS launch script in portable DocFetcher, as a fallback in case of launch problems.

On macOS, DocFetcher can now be run on the current Java runtimes from Oracle. The legacy Java 6 runtime from Apple is not required anymore.

On Java 9 and later, the -Duser.home variable in the various launcher scripts was ignored.

The startup message about launching another DocFetcher instance was opened under all other windows.

On Windows, the taskbar pinning of the program did not work entirely correctly.

Previously, it was possible to add new indexes via the indexing dialog despite index creation being disabled in the advanced settings.

ClassCastException crash during indexing of PDF files.

On Java 10, DocFetcher was unable to read certain EPUB files, displaying a "Use a Path constructor or method instead!" error (bug #1559).

Fixed an issue with the indexing of old MS Word files.

When writing the tree-index.ser file (a vital part of the index files) to disk, the program will now first write to a temporary file instead of overwriting the old file directly. This is a safeguard against potential corruption of the tree-index.ser file.

Since DocFetcher 1.1.20, the wildcard '?' did not match numbers preceded by a dot anymore, due to changes in the underlying Lucene search engine. As a workaround, there's now a "Whitespace" word segmentation option in the preferences to somewhat restore the old behavior (bug #1558).

On KDE-based Linux distributions such as Kubuntu, double-clicking files in the search results did not open them.

The icon in the top right of the file size filter was always in the state "minimize" after program launch, even if the file size filter is already minimized.

On macOS, the program erroneously showed the Ctrl+C keyboard shortcut instead of ⌘C in several places, e.g., in the context menu of the result pane.

Changes

New DocFetcher*.exe launchers with 12 GB and 16 GB memory limit.

Added documentation in the DocFetcher.bat file.

Increased default memory limit from 512 MB to 1 GB.

Upgraded PDFBox library for reading PDF files (PDFBox 2.0.9 → 2.0.13).

Added default exclusion rules for .git and .svn folders on the indexing dialog.

DocFetcher no longer considers the file extensions "php", "asp" and "jsp" as HTML file extensions.

Indexing: The keep-discard dialog now takes the "platform dismissal alignment" into account, meaning the order of the dialog's buttons now follows the conventions of the platform.

Slight design change with respect to the borders of the filter controls in the left part of the GUI.

Added a link to the DocFetcher Pro website in the status bar, and an info message about DocFetcher Pro.

Instalator:

Zaloguj lub Zarejestruj się aby zobaczyć!

Portable:

Zaloguj lub Zarejestruj się aby zobaczyć!

Inne systemy:

Zaloguj lub Zarejestruj się aby zobaczyć!

DocFetcher

Grandalf

Bardzo aktywny

Grandalf

Bardzo aktywny

Grandalf

Bardzo aktywny

Grandalf

Bardzo aktywny