Nearest Neighbor Model #158

akleeman · 2019-09-10T12:02:20Z

Adds a model which (given a distance metric) will produce predictions for the nearest neighbor.

https://en.wikipedia.org/wiki/Nearest-neighbor_interpolation

pgrgich

Lgtm, but would be great if we could unify it with the Oracle model, in this PR if feasible.

pgrgich · 2019-09-13T16:42:06Z

include/albatross/src/evaluation/cross_validation_utils.hpp

  for (const auto &pair : indexer) {
    assert(preds.at(pair.first).size() == pair.second.size());
    set_subset(preds.at(pair.first).mean, pair.second, &mean);
-    set_subset(preds.at(pair.first).covariance.diagonal(), pair.second,
-               &variance);
+    if (preds.at(pair.first).has_covariance()) {


Do we want to do this or just add 1e6 as variances where we currently have none?

In this method we're taking a pair of MarginalDistribution and concatenating them, so if the both don't have a defined covariance then we want to preserve that in the concatenation. Somewhere on my list of want to dos is to remove the optional behavior for convariances in favor of a third distribution type, something like:

using MeanOnlyDistribution = Distribution<Empty>; using MarginalDistribution = Distribution<DiagonalMatrixXd>; using JointDistribution = Distribution<Eigen::MatrixXd>;

or something along those lines, but that's out of scope here.

pgrgich · 2019-09-13T16:43:11Z

include/albatross/src/models/nearest_neighbor.hpp

+                           const JointDistribution &prediction) const {
+    const NearestNeighborModel<DistanceMetric> m(*this);
+    MarginalDistribution marginal_pred(
+        prediction.mean, prediction.covariance.diagonal().asDiagonal());


Wouldn't prediction.covariance work here? Or are you looking to zero the non-diagonal elements?

Yeah exactly, I need to zero the non-diagonal elements since the NearestNeighbor model can never actually predict off diagonals.

pgrgich · 2019-09-13T16:45:32Z

include/albatross/src/models/nearest_neighbor.hpp

+    std::size_t min_index = 0;
+    double min_distance = distance_metric(ref, features[0]);
+
+    for (std::size_t i = 1; i < features.size(); ++i) {


Could we turn min_distance into an optional so the loop can start at 0? The only difference in the loop would be !min_distance && going at the start of the if.

I like that pattern better too ... but so far albatross doesn't used any optionals! So we'd have to add a third party lib for it which I've been avoiding (though perhaps the time has come).

An alternative is to initialize the min_distance to DBL_MAX or some such, so it will always be replaced by the first distance.

akleeman force-pushed the nearest_neighbor branch from a5b8512 to eb263c3 Compare September 10, 2019 12:48

Add a nearest neighbor model.

772116d

akleeman force-pushed the nearest_neighbor branch from eb263c3 to 772116d Compare September 13, 2019 15:10

akleeman requested review from pgrgich and seth-swiftnav September 13, 2019 15:53

pgrgich approved these changes Sep 13, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nearest Neighbor Model #158

Nearest Neighbor Model #158

akleeman commented Sep 10, 2019 •

edited

Loading

pgrgich left a comment

pgrgich Sep 13, 2019

akleeman Sep 13, 2019

pgrgich Sep 13, 2019

akleeman Sep 13, 2019

pgrgich Sep 13, 2019

akleeman Sep 13, 2019

pgrgich Sep 13, 2019

Nearest Neighbor Model #158

Are you sure you want to change the base?

Nearest Neighbor Model #158

Conversation

akleeman commented Sep 10, 2019 • edited Loading

pgrgich left a comment

Choose a reason for hiding this comment

pgrgich Sep 13, 2019

Choose a reason for hiding this comment

akleeman Sep 13, 2019

Choose a reason for hiding this comment

pgrgich Sep 13, 2019

Choose a reason for hiding this comment

akleeman Sep 13, 2019

Choose a reason for hiding this comment

pgrgich Sep 13, 2019

Choose a reason for hiding this comment

akleeman Sep 13, 2019

Choose a reason for hiding this comment

pgrgich Sep 13, 2019

Choose a reason for hiding this comment

akleeman commented Sep 10, 2019 •

edited

Loading