Miners¶

Mining functions take a batch of n embeddings and return k pairs/triplets to be used for calculating the loss:

Pair miners output a tuple of size 4: (anchors, positives, anchors, negatives).
Triplet miners output a tuple of size 3: (anchors, positives, negatives).
Without a tuple miner, loss functions will by default use all possible pairs/triplets in the batch.

You might be familiar with the terminology: "online" and "offline" miners. These miners are online. Offline miners should be implemented as a PyTorch Sampler.

Miners are used with loss functions as follows:

from pytorch_metric_learning import miners, losses
miner_func = miners.SomeMiner()
loss_func = losses.SomeLoss()
miner_output = miner_func(embeddings, labels)
losses = loss_func(embeddings, labels, miner_output)

AngularMiner¶

miners.AngularMiner(angle=20, **kwargs)

Parameters

angle: The miner will return triplets that form an angle greater than this input angle. The angle is computed as defined in the angular loss paper

Default distance:

LpDistance(p=2, power=1, normalize_embeddings=True)
- This is the only compatible distance.

BaseMiner¶

All miners extend this class and therefore inherit its __init__ parameters.

miners.BaseMiner(collect_stats=False, distance=None)

Every miner outputs a tuple of indices:

Pair miners output a tuple of size 4: (anchors, positives, anchors, negatives)
Triplet miners output a tuple of size 3: (anchors, positives, negatives)

See custom miners for details on how to write your own miner.

Parameters:

collect_stats: If True, will collect various statistics that may be useful to analyze during experiments. If False, these computations will be skipped. Want to make True the default? Set the global COLLECT_STATS flag.
distance: A distance object. If None, then the default distance will be used.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

Required Implementations:

# Return indices of some form
def mine(self, embeddings, labels, ref_emb, ref_labels):
    raise NotImplementedError

Note: by default, embeddings == ref_emb and labels == ref_labels.

# Validate the output of the miner. 
def output_assertion(self, output):
    raise NotImplementedError

BatchEasyHardMiner¶

Improved Embeddings with Easy Positive Triplet Mining

Returns positive and negative pairs according to the specified pos_strategy and neg_strategy.

To implement the loss function described in the paper, use this miner in combination with NTXentLoss(temperature=0.1).

miners.BatchEasyHardMiner(
        pos_strategy=BatchEasyHardMiner.EASY,
        neg_strategy=BatchEasyHardMiner.SEMIHARD,
        allowed_pos_range=None,
        allowed_neg_range=None,
        **kwargs
    ):

Parameters

pos_strategy: one of the following:
- BatchEasyHardMiner.HARD or "hard": returns the hardest positive sample per anchor.
- BatchEasyHardMiner.SEMIHARD or "semihard": returns the hardest positive sample per anchor, such that it is closer than the selected negative.
- BatchEasyHardMiner.EASY or "easy": returns the easiest positive sample per anchor
- BatchEasyHardMiner.ALL or "all": returns all possible positive pairs
neg_strategy: one of the following:
- BatchEasyHardMiner.HARD or "hard": returns the hardest negative sample per anchor.
- BatchEasyHardMiner.SEMIHARD or "semihard": returns the hardest negative sample per anchor, such that it is further than the selected positive.
- BatchEasyHardMiner.EASY or "easy": returns the easiest negative sample per anchor
- BatchEasyHardMiner.ALL or "all": returns all possible negative pairs
allowed_pos_range: Optional tuple containing the allowed range of anchor-positive distances/similarties. For example, allowed_pos_range = (0.2, 1). If None, then a range is not applied.
allowed_neg_range: Optional tuple containing the allowed range of anchor-negative distances/similarties. For example, allowed_neg_range = (0.2, 1). If None, then a range is not applied.

Restrictions

pos_strategy and neg_strategy cannot both be set to "semihard"
If pos_strategy is set to "semihard", then neg_strategy cannot be set to "all", and vice versa.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

BatchHardMiner¶

In Defense of the Triplet Loss for Person Re-Identification

For each element in the batch, this miner will find the hardest positive and hardest negative, and use those to form a single triplet. So for a batch size of N, this miner will output N triplets.

This miner is equivalent to using miners.BatchEasyHardMiner(pos_strategy="hard", neg_strategy="hard"), and converting the output pairs to triplets.

miners.BatchHardMiner(**kwargs)

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

DistanceWeightedMiner¶

Implementation of the miner from Sampling Matters in Deep Embedding Learning.

miners.DistanceWeightedMiner(cutoff=0.5, nonzero_loss_cutoff=1.4, **kwargs)

Parameters

cutoff: Pairwise distances are clipped to this value if they fall below it.
nonzero_loss_cutoff: Pairs that have distance greater than this are discarded.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)
- This is the only compatible distance.

Important note:

This miner works well only with low dimensionality embeddings (e.g 64-dim) and L2-normalized distances. Check out UniformHistogramMiner for a miner that is roughly equivalent, but works with embeddings of any dimensionality and any distance metric.

EmbeddingsAlreadyPackagedAsTriplets¶

If your embeddings are already ordered sequentially as triplets, then use this miner to force your loss function to use the already-formed triplets.

miners.EmbeddingsAlreadyPackagedAsTriplets()

For example, here's what a batch size of size 6 should look like:

torch.stack([anchor1, positive1, negative1, anchor2, positive2, negative2], dim=0)

HDCMiner¶

Hard-Aware Deeply Cascaded Embedding

miners.HDCMiner(filter_percentage=0.5, **kwargs)

Parameters:

filter_percentage: The percentage of pairs that will be returned. For example, if filter_percentage is 0.25, then the hardest 25% of pairs will be returned. The pool of pairs is either externally or internally set. See the important methods below for details.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

Important methods:

# Pairs or triplets extracted from another miner, 
# and then passed in to HDCMiner using this function
def set_idx_externally(self, external_indices_tuple, labels):
    self.a1, self.p, self.a2, self.n = lmu.convert_to_pairs(external_indices_tuple, labels)
    self.was_set_externally = True

# Reset the internal state of the HDCMiner
def reset_idx(self):
    self.a1, self.p, self.a2, self.n = None, None, None, None
    self.was_set_externally = False

Example of passing another miner output to HDCMiner:

minerA = miners.MultiSimilarityMiner(epsilon=0.1)
minerB = miners.HDCMiner(filter_percentage=0.25)

hard_pairs = minerA(embeddings, labels)
minerB.set_idx_externally(hard_pairs, labels)
very_hard_pairs = minerB(embeddings, labels)

MultiSimilarityMiner¶

miners.MultiSimilarityMiner(epsilon=0.1, **kwargs)

Parameters

epsilon:
- Negative pairs are chosen if they have similarity greater than the hardest positive pair, minus this margin (epsilon).
- Positive pairs are chosen if they have similarity less than the hardest negative pair, plus this margin (epsilon).

Default distance:

CosineSimilarity()

PairMarginMiner¶

Returns positive and negative pairs that violate the specified margins.

miners.PairMarginMiner(pos_margin=0.2, neg_margin=0.8, **kwargs)

Parameters

pos_margin: The distance (or similarity) over (under) which positive pairs will be chosen.
neg_margin: The distance (or similarity) under (over) which negative pairs will be chosen.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

TripletMarginMiner¶

Returns hard, semihard, or all triplets.

miners.TripletMarginMiner(margin=0.2, type_of_triplets="all", **kwargs)

Parameters

margin: The difference between the anchor-positive distance and the anchor-negative distance.
type_of_triplets:
- "all" means all triplets that violate the margin
- "hard" is a subset of "all", but the negative is closer to the anchor than the positive
- "semihard" is a subset of "all", but the negative is further from the anchor than the positive
- "easy" means all triplets that do not violate the margin.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

UniformHistogramMiner¶

Returns pairs that have uniformly distributed distances. This is like DistanceWeightedMiner, except that it works well with high dimension embeddings, and works with any distance metric (not just L2 normalized distance).

miners.UniformHistogramMiner(num_bins=100, 
                            pos_per_bin=10, 
                            neg_per_bin=10, 
                            **kwargs):

Parameters

num_bins: The number of bins to divide the distances into. For example, if the distances for the current batch range from 0 to 2, and num_bins = 100, then each bin will have a width of 0.02.
pos_per_bin: The number of positive pairs to mine for each bin.
neg_per_bin: The number of negative pairs to mine for each bin.

Default distance:

LpDistance(normalize_embeddings=True, p=2, power=1)

Example:

from pytorch_metric_learning.miners import UniformHistogramMiner
from pytorch_metric_learning.distances import SNRDistance

miner = UniformHistogramMiner(
    num_bins=100,
    pos_per_bin=25,
    neg_per_bin=33,
    distance=SNRDistance(),
)

In a given batch, this will:

Divide up the positive distances into 100 bins, and return 25 positive pairs per bin, or 0 if no pairs exist in that bin
Divide up the negative distances into 100 bins, and return 33 negative pairs per bin, or 0 if no pairs exist in that bin