Triplet Embeddings in Deeplearning4j

Thanks for reading crockpotveggies.

Triplet Embeddings in Deeplearning4j - Adapting FaceNet

Inspired by Google’s FaceNet project and OpenFace (and some help of Skymind.io), I implemented a java-based verion of triplet embeddings for the purpose of better computer vision at Bernie AI.

An example of facial landmark detection

Thanks to the help of Alex Black, one of Skymind’s most knowledgeable engineers, I was able to write an implementation of triplet embeddings for Deeplearning4j and create a variant of FaceNet for Bernie AI. This work was prompted by performance issues from our fork of OpenFace to add a custom image pipeline, and a desire to continue improving the accuracy of facial comparisons for our users. As Bernie grew, the combination of Python and LUA became troublesome and we saw resource lock-ups - our fork was not meant to develop high-concurrency mmicroservices. Hence, it was time to port our facial comparison service to the JVM.

If you haven’t read the FaceNet Paper, it revealed a very interesting usage of neural network embeddings for the purpose of generating a representation of an input (in this case, a face). In simple terms, embeddings are mappings of relationships in data passed through a neural network. Word2Vec is an excellent example of using embeddings to map the relationships between words and use them for clustering, classification, and comparison. For the purpose of FaceNet, the embeddings are a mapping of facial features.

Illustrative example of triplet training loss via OpenFace paper

To properly train a neural network to create vision embeddings, you must split your data into triplets before passing through a network (there are also other ideas such as center loss that I won’t describe here). This involves passing three examples through the neural network, and examining the “loss” at the end. 3 images are passed, where the first x is the anchor class, and x+ is a positive example of that image, and x- is a negative example of that image. This leads to 2D representations of each image placed on a hypersphere, and each subsequent representation can be compared, linearly. You no longer have to re-train the neural network in order to identify new objects or faces.

Diagram of DL4J Triplet Embedding Model

What’s special about the implementation for Deeplearning4j is that the pieces required for loss calculation are more modular, and the vertices we created for DL4J’s ComputationGraph can be re-used for other setups. The implementation is JVM - a breath of fresh air when doing research and production implementations in the same language (portability!). In the immediate future I hope to use triplet embeddings to discover the differences between objects and find a way to speed up identification of scene markers for our context detection at Bernie.

I’m in the process of finishing up a FaceNet variant on my Github which I will merge into the DL4J model zoo in hopes that others can also explore and use the work. Many thanks to Alex Black, the Skymind team, and brief feedback from Brandon Amos for their help on this implementation.

Notes on Nothingness in a Physics and Philosphy Perspective

Since two years ago when I was heavily researching advanced propulsion, physics, and the Alcubierre solutions I’ve come to realize just how inadequate our physics theory is at explaining the underlying universe.

Triplet Embeddings in Deeplearning4j - Adapting FaceNet

Inspired by Google’s FaceNet project and OpenFace (and some help of Skymind.io), I implemented a java-based verion of triplet embeddings for the purpose of better computer vision at Bernie AI.

Automating Tinder with Artificial Intelligence

Back in 2014 I was absolutely frustrated with online dating and tried to automate everything. It was average, so I came back to the project, increased feedback accuracy to 99%, and found a girlfriend and built a team.

Neutrons, Light, Gravity, and Unexplored Warp Research

Our little warp research team has been making incremental progress towards reaching our goal of engineering an experimental setup. We discovered some mind-blowing concepts and some overlooked research.

Feature Sharing in Classes and Convolutional Neural Network Troubles

I’ve been taking a stab at computer vision and convolutional neural networks. Interestingly, I discovered a hard-to-solve problem when some classes contained features that indicated to multiple classes.

What's Missed in NASA's Warp Drive Research

Much skepticism surrounds NASA’s Eagleworks Lab and its Warp drive experiments. Spending the last six months playing catch-up on the field, we discovered a potentially large oversight in the early-stage research.