LITS: Local Image Tagging and Search

Locally-run neural-network tagging for images. To eventually include faces, emotion, quality, and more, but just recognizes faces as of August 2020.

LITS generates encoded versions of known people and will scan all pictures found in a specified folder, apply face recognition to all supported file formats and keep track of which people are in which images by writing known, found people to the metadata tags of a picture. This allows those pictures to be searched from photography library management tools such as Photoshop Lightroom. _Note: If you are running LITS on a library that's already in place using something like Lightroom/Darkroom/Digikam, please be sure to have said tool write metadata to your files prior to running LITS against your dataset, and to force a metadata read after LITS completes.

Initially, LITS will only tag images to enable search elsewhere, but future versions will supply a facility for querying the internal database. In the interim, tools such as DBBrowser allow you to query the internal database. Take a look at the queries dictionary in dashboard.py for some samples queries to get started.

JPEG is the only fully supported file type as other types either lack appropriate metadata fields or have different fields.

Untested... but probably functional file formats: .png, .bmp, .gif and other simple raster image formats should work, but their lack of JPEG-like metadata fields limits the utility of doing so. Future versions may support more metadata types than IPTC. By default, LITS will filter these out, but the user can modify the valid_extensions variable in lits.py if they want to test out a specific file type.

Untested... and unlikely-to-be functional formats: everything Pillow can read. By default, LITS will filter these out, but the user can modify the valid_extensions variable in lits.py if they want to test out a specific file type.

Unsupported file formats: DNG and other raw files. Anything that's not a picture.

Usage

Prior to running LITS, you'll need to do the following:

Create a known set. This will be a folder with image files named for the people in them, with one person per image. The filename portion of the files in this folder will be used as the name of the person when LITS later applies tags.
Create an unknown set in its own folder, or choose an existing folder. Example folder setup:

Effect: A file in your known folder called John Doe.jpg (a picture of John Doe and only John Doe) will cause LITS to check your scanroot for other people with a face like John Doe

After installation, run LITS from a command line in the following fashion:
py lits.py -scanroot c:\pictures -known c:\pictures\lits-people [-db cache.db -tolerance 0.5]

Future versions may include a graphical interface.

`-scanroot` / Images to scan

-scanroot specifies the root of the folder structure where pictures are to be scanned for matches to the known people and, later, other features.

Not yet implemented: If the scan root path includes the known image root, the known image root and all its subfolders will be ignored.

`-known` / Known Image Root

-known specifies the root of the folder structure where identified people can be found. Inside should be any combination of:

single pictures labeled with a name, such was William Lockwood.jpg
folders labeled with a name, such as William Lockwood, under which one or more pictures for that person can be found. File names inside this named folder don't need to follow any special logic.

`-db` / Database

-db specifies where the SQLite database will be stored. Defaults to be lits.db in the unknown image root.

In addition to writing directly to each image's metadata, LITS will build a database with much of this information in it so that it can quickly identify which files it's already encoded and not need to reprocess them. This significantly decreases the time it takes to update the tagging data when doing a delta after adding more pictures to the collection.

Future versions will enable the user to search the database directly.

`-tolerance` / Face Matching Tolerance

-tolerance is optional and adjusts how strict face matches should be to be considered a match. This defaults to 0.6, and lower inputs (ex: 0.2) force stricter matches at the cost of more false negatives

Install Manual

Development environment is Windows, so installation assumes that. Installing in other environments should be doable with slight modifications that are left as an exercise to the Linux-using reader.

Windows Install

If you don't already have it, install Python3 from https://www.python.org/downloads/.
- This should already come with pip and virtualenv.
Clone LITS: git clone https://github.com/wlockwood/lits.git
Navigate into LITS' folder that Git created: cd lits
Create a virtual environment for your LITS install: py -m venv env
Install dependencies: pip install -r requirements, or if you prefer:
1. Install face_recognition: pip install face_recognition.
2. Install pyexiv2: pip install pyexiv2
3. Install matplotlib: pip install matplotlib
[Optional] Download SQLite Tools or DBBrowser for SQLite if you want to run queries against the database that is built. This may be integrated in future versions. SQLite Tools has a CLI, and DBBrowser has a quite nice GUI.

Example file structure

Root of installation
|   lits.py
|   dashboard.py
|   tests.py
|   lits.db
+---Model
|   |   FaceEncoding.py
|   |   ImageFile.py
|   |   Person.py
|
+---Controllers
|   |   Database.py
|   |   FaceRecognizer.py
|
+---unittest-data
|   +---known
|   |       alyssa.jpg
|   |       bruce.jpg
|   |       will.jpg
|   |       
|   \---unknown
|       |   alyssa dean heidi.jpg
|       |   mecha art.jpg
|       |   mushroom.jpg
|       |   PANO_20151029_143515.jpg
|       |   work group will.jpg
|       |   
|       +---Test subfolder
|       |   |   catbus.jpg

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
Controllers		Controllers
Model		Model
test-scripts		test-scripts
unittest-images		unittest-images
.gitignore		.gitignore
README.md		README.md
dashboard.py		dashboard.py
lits.py		lits.py
requirements		requirements
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LITS: Local Image Tagging and Search

Usage

`-scanroot` / Images to scan

`-known` / Known Image Root

`-db` / Database

`-tolerance` / Face Matching Tolerance

Install Manual

Windows Install

Example file structure

About

Releases

Packages

Languages

wlockwood/lits

Folders and files

Latest commit

History

Repository files navigation

LITS: Local Image Tagging and Search

Usage

-scanroot / Images to scan

-known / Known Image Root

-db / Database

-tolerance / Face Matching Tolerance

Install Manual

Windows Install

Example file structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`-scanroot` / Images to scan

`-known` / Known Image Root

`-db` / Database

`-tolerance` / Face Matching Tolerance

Packages