https://github.com/ivreo/sift_anatomy_extra
Tip revision: 0f6cc7d60ae4355bdfeb0b70cbb79a60806f1d0d authored by ivreo on 08 July 2016, 18:31:44 UTC
Update README.md
Update README.md
Tip revision: 0f6cc7d
README.txt
IPOL SIFT
Copyright (C) 2014, Ives Rey-Otero, CMLA ENS-Cachan
<ives.rey-otero@cmla.ens-cachan.fr>
Version 20140911 (September 11th, 2014)
===============================================================================
== Overview ===================================================================
This C/C++ ANSI source code is related to the article
[1] "An Analysis of the scale-space sampling in SIFT."
I. Rey Otero, J.M. Morel, M. Delbracio
ICIP 2014.
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7025982&tag=1
Part of the code is also related to the article
[2] "Anatomy of the SIFT method."
I. Rey Otero, M. Delbracio
IPOL (Image Processing On Line - 2014)
http://www.ipol.im/pub/art/2014/82/
http://dx.doi.org/10.5201/ipol.2014.82
It adds extra functionalities for the numerical analysis of the SIFT method.
- Exact scale-space computation using DFT
- spline interpolation
- extrema interpolation satisfying the least square error
- EXR input/output
- define the DoG differential operator independently of the scale-space
sampling
===============================================================================
== Patent Warning and License =================================================
The methods implemented here are extensions of the SIFT method.
This method is patented
[3] "Method and apparatus for identifying scale invariant features
in an image."
David G. Lowe
Patent number: 6711293
Filing date: Mar 6, 2000
Issue date: Mar 23, 2004
Application number: 09/519,89
These source codes are made available for the exclusive aim of serving as
scientific tool to verify the soundness and completeness of the algorithm
description. Compilation, execution and redistribution of this file may
violate patents rights in certain countries. The situation being different
for every country and changing over time, it is your responsibility to
determine which patent rights restrictions apply to you before you compile,
use, modify, or redistribute this file. A patent lawyer is qualified to make
this determination. If and only if they don't conflict with any patent terms,
you can benefit from the following license terms attached to this file.
This program is free software: you can use, modify and/or
redistribute it under the terms of the simplified BSD
License. You should have received a copy of this license along
this program. If not, see
<http://www.opensource.org/licenses/bsd-license.html>.
===============================================================================
== Compiling (Linux) ==========================================================
To compile
Type
make
in the directory where the Makefile is located.
The code relies of the following common libraries:
- libpng12-dev
- libjpeg-dev
- libtiff5-dev
- libfftw3-dev
- libopenexr-dev
The compilation of the source code provides three executables:
1) sift_cli: (original algorithm) applies the SIFT method to a PNG image.
Parameters are documented in [2].
2) match_cli: matches the SIFT keypoints extracted from two images.
3) extra_sift_cli: Compute the Gaussian scale-space using an exact
implementation of the Gaussian convolution based on the
Fourier interpolation.
The DoG operator is defined independently of the
scale-space sampling.
4) gradual: same as extra_sift_cli, computes the scale-space gradually to
reduce the memory usage.
===============================================================================
== SIFT executable - Usage ====================================================
$ ./sift_cli image [options...] [> keys]
options :
-ss_noct (8) number of octaves
-ss_nspo (3) number of scales per octaves
-ss_dmin (0.5) the sampling distance in the first octave
-ss_smin (0.8) blur level on the seed image
-ss_sin (0.5) assumed level of blur in the input image
-thresh_dog (0.0133) threshold over the DoG response
-thresh_edge (10) threshold over the ratio of principal curvature
-ori_nbins (36) number of bins in the orientation histogram
-ori_thresh (0.8) threhsold for considering local maxima in
the orientation histogram
-ori_lambda (1.5) sets how local is the analysis of the gradient
distribution
-descr_nhist (4) number of histograms per dimension
-descr_nori (8) number of bins in each histogram
-descr_lambda (6) sets how local the descriptor is
-verb_keys label flag to output the intermediary sets of keypoints
-verb_ss label flag to output the scalespaces (Gaussian and DoG)
EXTRA parameters
-ss_fnspo (3.00) float number of scales per octaves (-ss_nspo not used)
-flag_semigroup BOOL (1) semigroup (1) or direct (0)
-flag_dct BOOL (0) dct (1) or discrete (0)
-flag_log BOOL (0) normalized Laplacian (1) or DoG (0)
-flag_interp (0) bilin (0) / DCT (1)/ bsplines (3,5,..,11)
-itermax 5 max number of iterations
-epsilon FLT_EPSILON (for _myfloat comparison)
-dog_nspo 3 number of scales per octaves for DoG operator definition
-ofstMax_X (0.5) interpolation validity domain definition in space
-ofstMax_S (0.5) ... in scale
-flag_jumpinscale (0 in gradual / not an option in gradual)
-------------------------------------------------------------------------------
Parameter std value Definition
-------------------------------------------------------------------------------
n_oct 8 number of octaves in the scale-space,
n_spo 3 number of scales per octave,
sigma_min 0.8 minimal level of blur featured in the scale-space,
delta_min 0.5 minimal inter-pixel distance featured in the
scale-space,
sigma_in 0.5 assumed level of blur in the input image,
C_DoG 1.33 (0.04/3) threshold on DoG operator (expressed for n_spo = 3),
C_edge 10 threshold on the ratio of principal curvatures,
n_bins 36 number of bins in the orientation histogram,
lambda_ori 1.5 sets the width of the orientation patch,
t 0.8 reference orientation minimal value in the histogram,
n_hist 4 the descriptor is composed of n_histXnhist weighted
histograms,
n_ori 8 each weighted histogram is composed of n_ori bins,
lambda_descr 6.0 sets the width of the descriptor patch.
label string used to label all the extra output files.
-------------------------------------------------------------------------------
List of files produced by sift_cli with the '-verb_keys' option.
-------------------------------------------------------------------------------
Filename Content
-------------------------------------------------------------------------------
1) extra_NES_[label].txt discrete 3D extrema of DoG,
2) extra_DoGSoftThresh_[label].txt discrete 3D extrema passing a
conservative threshold on DoG,
3) extra_ExtrInterp_[label].txt interpolated 3D extrema,
4) extra_DoGThresh_[label].txt interpolated extrema passing the
threshold on DoG,
5) extra_OnEdgeResp_[label].txt interpolated extrema passing the
Harris-Stephen edgeness test,
6) extra_FarFromBorder_[label].txt keypoints with reference orientation,
Each line of theses files follows the data formatting
x y sigma theta octa sca
where (octa) is the octave index and (sca) is the scale index.
10) extra_keypoints_[label].txt keypoints with descriptors
Each line of this file follows the data formatting
x y sigma theta octa sca fv[1] fv[2] ... fv[d] ...
... orihist[1] ... orihist[n_bins]
where (fv) is the feature vector of dimension d=n_hist*n_hist*n_ori
and (orihist) is the orientation histogram of n_bins bins.
===============================================================================
== MATCHING executable - Usage ================================================
Usage: match_cli keys1 keys2 [options...]
-ori_nbins (36) number of bins in the orientation histogram
(used only for keypoints input/output)
-descr_nhist (4) number of histograms per dimension
-descr_nori (8) number of bins in each histogram
-absolute thresh (250) threshold applied on the euclidean distance
-relative thresh (0.6) threshold applied on the ratio of distance
-verb label flag for output
The output is a list of matches with the following formatting
x1 y1 sigma1 theta1 x2 y2 sigma2 theta 2
-------------------------------------------------------------------------------
List of files produced by match_cli with the 'verb' option
-------------------------------------------------------------------------------
Filename Content
-------------------------------------------------------------------------------
1) OUTmatches.txt The pairs matches,
2) [label]_im0.txt The subset of matching keypoints in the first image
3) [label]_im1.txt The subset of matching keypoints in the second image
-------------------------------------------------------------------------------
File 1) has the following formatting:
key1 key2a key2b
where (key1) designates a keypoint in image1, (key2a) and (key2b) designate
respectively the nearest and the second nearest neighbors in image 2.
The data relative to each keypoint is formatted as follows
x y sigma theta fv[1] fv[2] ... fv[d]
octa sca orihist[1] ... orihist[n_bins]
where (fv) is the feature vector of dimension d=n_hist*n_hist*n_ori and
(orihist) is the orientation histogram of n_bins bins.
Files 3) 4) and 5) have the same formatting:
x y sigma theta fv[1] fv[2] ... fv[d]
octa sca orihist[1] ... orihist[n_bins]
===============================================================================
== lib_sift.h ================================================================
File lib_sift.h provides a simplified interface to the sift library.
To extract the keypoint from the SIFT scale-space.
struct sift_keypoint_std* sift_compute_points(double* x, int w,
int h, int* n);
To compute the feature descriptors for oriented keypoints provided by the user:
void sift_fill_descriptors(double *x, int w, int h,
struct sift_keypoint_std *k, int n);
To compute orientations and feature descriptors for keypoints provided
by the user:
void sift_fill_descriptors(double *x, int w, int h,
struct sift_keypoint_std *k, int n);
To run the standard sift algorithm:
struct sift_keypoint_std *sift_compute_features(double *x,
int w, int h, int *n);
For input/output:
struct sift_keypoint_std *sift_read_from_file(char *filename, int *n);
void sift_write_to_file(char *filename, struct sift_keypoint_std *k, int n);
===============================================================================
== How to link your code to lib_sift.h =====================================
These are the steps to follow in order to use the library lib_sift.h
in a code.
1) add #include "lib_sift.h"
2) compile object files:
lib_sift.o
lib_sift_anatomy.o
lib_scalespace.o
lib_keypoint.o
lib_description.o
lib_discrete.o
3) link
Here a two short examples of source code with their respective compilation
commands.
//--------------------- example.c -----------------------------------
#include <stdlib.h>
#include "lib_sift.h"
int main(void)
{
// create input image
int w = 300;
int h = 200;
float *x = malloc(w*h*sizeof(*x));
for (int i = 0; i < w*h; i++)
x[i] = rand();
// compute sift keypoints
int n;
struct sift_keypoint_std *k = sift_compute_points(x, w, h, &n);
// write to standard output
sift_write_to_file("/dev/stdout", k, n);
// cleanup
free(k);
free(x);
return 0;
}
//-------------------------------------------------------------------------
gcc -std=c99 -c -o lib_keypoint.o lib_keypoint.c
gcc -std=c99 -c -o lib_discrete.o lib_discrete.c
gcc -std=c99 -c -o lib_scalespace.o lib_scalespace.c
gcc -std=c99 -c -o lib_sift_anatomy.o lib_sift_anatomy.c
gcc -std=c99 -c -o lib_description.o lib_description.c
gcc -std=c99 -c -o lib_sift.o lib_sift.c
gcc -std=c99 -c -o lib_util.o lib_util.c
gcc -std=c99 -o example example.c lib_sift.o lib_sift_anatomy.o \
lib_keypoint.o lib_scalespace.o lib_description.o \
lib_discrete.o lib_util.o -lm
//--------------------- example2.c -----------------------------------
#include <stdlib.h>
#include <stdio.h>
#include "lib_sift.h"
#include "io_png.h"
int main(int argc, char **argv)
{
if(arg != 2){
fprintf(stderr, "usage:\n./exemple2 image\n");
return -1;
}
// Loading image
size_t w, h;
float* x = io_png_read_f32_gray(argv[1], &w, &h);
for(int i=0; i < w*h; i++)
x[i] /=256.;
// compute sift keypoints
int n;
struct sift_keypoint_std *k = sift_compute_features(x, w, h, &n);
// write to standard output
sift_write_to_file("/dev/stdout", k, n);
// cleanup
free(k);
free(x);
return 0;
}
//-------------------------------------------------------------------------
gcc -std=c99 -c -o lib_keypoint.o lib_keypoint.c
gcc -std=c99 -c -o lib_discrete.o lib_discrete.c
gcc -std=c99 -c -o lib_scalespace.o lib_scalespace.c
gcc -std=c99 -c -o lib_sift_anatomy.o lib_sift_anatomy.c
gcc -std=c99 -c -o lib_description.o lib_description.c
gcc -std=c99 -c -o lib_sift.o lib_sift.c
gcc -std=c99 -c -o lib_util.o lib_util.c
gcc -std=c99 -c -o io_png.o io_png.c
gcc -std=c99 -o example example.c lib_sift.o lib_sift_anatomy.o \
lib_keypoint.o lib_scalespace.o lib_description.o \
lib_discrete.o lib_util.o io_png.o -lm -lpng
===============================================================================
== Comparison with other implementations ======================================
The executable provided by D.Lowe
(http://www.cs.ubc.ca/~lowe/keypoints/, retrieved on September 11th,2014)
uses a different coordinate system.
This results in different orientation and different feature vectors.
In Lowe's executable, the x component increases to the right and the y component
increases upward and the coordinate system adopted in the description phase.
In this code, the x component increases downward and the y component increases
to the right. This is consistent with the coordinate system used during
detection.
A conversion tool is provided in the source called anatomy2lowe.c to convert
To compile this tool
gcc -o anatomy2lowe anatomy2lowe.c -std=c99
===============================================================================
= Generating the doxygen
In the src/ directory type :
doxygen -g
doxygen Doxyfile
doxygen documentation in directory ./html/
===============================================================================
== Acknowledgements ===========================================================
Work partially supported by
Centre National d’Etudes Spatiales (CNES, MISS Project),
European Research Council (Advanced Grant Twelve Labours),
Office of Naval Research (Grant N00014-97-1-0839),
Direction Generale de l’Armement (DGA),
Fondation Mathematique Jacques Hadamard,
Agence Nationale de la Recherche (Stereo project).
The author would like to thank Enric Meinhardt-Llopis for fruitful comments
and discussions.
===============================================================================