Multiple Device Roadmap

u_cap 2014-06-09 12:26:45 UTC #1

Thread branched from general V2 tracking:
https://community.leapmotion.com/t/v2-tracking-now-in-public-developer-beta/1202/23

jdonald: No, we're not determined to ship the overlapping interaction space case before the multiple interaction spaces. However, the latter isn't as easy as one might think. Even our quick-and-dirty implementations raise questions of differing device framerates, dropped frames, or multiple instances in our API. Some of these challenges are absent in the single interaction space case.

So we're working on both.

codemercenary: I'm the lead on multi-device support right now.

Broadly speaking, the processing chain for the Leap is broken up into four steps: Image acquisition, image processing, geometry reconstruction, and then tracking. For multi-device support to work, we intend to run the first three stages in parallel--it's only when we get to Tracking that a question arises about what should be done with the simultaneous inputs.

Internally, our services uses a context-based dependency injection framework as the higher level architecture. The benefit of this approach is that we can insert filter layers between the components at runtime in order to alter their behavior, which means that the approach we choose for implementing multi-device support doesn't lock us in to any specific architecture decisions. As you intuited, it's a lot simpler to assume multiple devices with non-overlapping fields of view than it is to attempt to perform any kind of integration.

The problem, however, doesn't necessarily lie with how we intend to operate the Tracking module. Instead, it's more focused on the API that we're providing the to you, the developer. Once we add a second device, we invalidate the implicit assumption that a single frame equals a single user or that the objects in that frame are globally unique.

We could, of course, generalize our API. We could provide a secondary interface that allows you to enumerate the set of available devices and then you create a controller object for those devices--but the complexity of this gets to runaway levels awfully fast. How do we tell you when a device is detached? What response should you make when this happens? Does a controller abstraction make sense anymore when it's bound to a specific device? What should be done in the event of reattachment of a previously identified device? It's not that these questions don't have answers; rather, it's that the answers involve invalidating prior assumptions about how those interfaces are supposed to work.

*So, if we want to preserve existing interfaces, we have a few options about how to proceed:
The most obvious and easiest is Strict Redundant Nonoverlapping. This means that the Leaps are used in settings where their fields of view strictly do not overlap, but we treat each as a redundant source of user input. This means that a hand positioned squarely over one Leap will generate the same API output as a hand positioned squarely over the other. For testing this is fine, but obviously this is going to yield a really bad user experience if the devices are almost nonoverlapping rather than strictly nonoverlapping.*

The next option is Fixed Spatial Relationship. The user (or a tool, or an OEM, etc) is responsible for configuring in the spatial relationship between two devices directly into a configuration file, and then we read the difference and apply a correction between the reconstruction and tracking stages. Assuming that the spatial relationship is very precisely defined and doesn't change while the devices are in use, this is a pretty easy way to use the devices to achieve the aim of extending the field of view. Unfortunately, the relationship is going to have to be very precisely defined, and practically would require some kind of custom-made jig to hold the devices, or the user would have to epoxy them to some surface and then run calibration in a separate step. Again, useful for testing, not so much for anything but niche applications.

The final option, and the one we ultimately want to achieve, is Dynamic Spatial Relationship. In this model, overlapping fields of view are detected dynamically at runtime by a Recombinator stage, and then we update the relative offsets on a frame-by-frame basis. This is perhaps the only implementation that would actually satisfy the requirements of a viable product, but each of the previously mentioned stages are all waypoints that get us closer to this final objective. I expect that we will implement each in the order listed, though whether or not we release each mode or support them as a public setting will depend on timelines and community desire.

V2 Tracking Now in Public Developer Beta

u_cap 2014-06-09 12:40:10 UTC #2

DavidH: Thus far we're only thinking of it in terms of the following options:

Option 1: Two controllers looking at the same space (and thus our software needs to combine things into one calibrated space)

Option 2: Two controllers looking at different spaces (thus people need to be able to access api data of each controller separately).

Option 1 can only be supported with our new tracking software (which is in private beta) and thus was held up on that. Option 2 in theory could be done anywhere, but we would prefer to also only support it in the newer software.

From:
https://community.leapmotion.com/t/multiple-leap-motion-support/770

u_cap 2014-06-09 12:43:24 UTC #3

DavidH:
The different modes use the same amount of USB bandwidth except for 'low resource' mode.

Multiple devices will require multiple USB 2 buses or a USB 3 bus with a USB 3 hub (or multiple USB 3 ports).

From:
https://community.leapmotion.com/t/leap-motion-bandwidth-use-modes-e-g-multiple-devices/1265

u_cap 2014-06-09 13:10:00 UTC #4

Hello Jason,

Quick feedback, per earlier discussion with DavidH - the device enumeration issue has some similarities with supporting multiple controllers on a game console. As a developer, my preference is to go with the simplest solution for the initial (beta/developer) releases, which is that any removal will abort tracking, and any addition will be ignored. If device enumeration is complicated by not being able to persistently ID the devices, always abort.

Further, I still suggest that developers will benefit from an initial release in which all four stages of your processing chain are run in parallel, i.e. no tracking integration is attempted at all. I'd be willing to arbitrate between 2+ completely separate tracking results myself to get started.

Being able to run two instances of the processing chain would also facilitate a release prior to resolving issues of differing device framerates and dropped frames.

My rationale for a "two chains" approach is that some of your developers are currently working across LAN or with VM instances just to work with two devices.

u_cap 2014-06-09 14:03:19 UTC #5

For simplicity of discussion, here's a sketch of an extension of the existing API, starting with modified version of

DeviceList Controller::devices()

How about adding

void Controller::makeCurrent( int64_t )

assuming a unique, persistent device ID. Subsequent calls to

Frame Controller::frame(int history = 0)
bool addListener(Listener & listener)

etc. apply to only the current device. You could add a legacy mode - active by default - to automatically make the first device found current, or an only device, and a config that can disable this legacy mode.

This approach would be less awkward if the API had

void Listener::onFrame(const Frame &)

instead of

void onFrame(const Controller &)

which has to use

 Frame Controller::frame(int history = 0)

That call is not thread safe - see my discussion with Raffi

https://developer.leapmotion.com/forums/forums/10/topics/1919

ten months ago. Further, Controller HasA Devices, extended to Has All Devices, implies Controller is a singleton anyway. The Controller::Frame() call should be driven by

  int64_t Frame::id()

not a queue index. I'd also pitch again for

  int64_t Frame::timestamp()

Adding

  int64_t Device::id()
  int64_t Frame::getDeviceId()

would allow the app to register a single listener for all devices. Using an Id instead of a const reference avoids addition/removal issues (unique, increment on every change). In turn, it might be worth adding

  Device& Controller::getDevice::( int64_t )

to avoid list sorting issues.

I understand that this raises legacy issues as this API does not have a notion of Frames generated from the data from multiple devices. You could deprecate Controller::devices() - no significant legacy- and change from Controller HasA Device to attaching Controller to Device. You could then explicitly distinguish between Controller and MultiController, the latter able to attach to multiple Devices and capable of integration:

void Controller::setDevice( int64_t )
void MultiController::addDevice( int64_t )
void MultiController::removeDevice( int64_t )

I am not saying that this is a good approach. I also have no idea how this translates into web and other non-C++ APIs. Maybe you could sketch in a similar manner how you intend to extend the API. I'd also be curious to know whether you will have to extend the firmware to support persistent controller IDs. The LMCs are - for now - interchangeable, but the same is not true e.g. for USB input devices such as joysticks and gamepads, and add/remove scenarios are easier to handle if devices have some kind of persistent ID - even if set from the app.

hackerpoet 2014-06-10 20:43:12 UTC #6

To be clear, the only benefit of non-overlapping devices is simply a larger overall field of view for tracking.

The strongest advantage of multiple devices is actually when the field of views overlap because the multiple vantage points help dramatically reduce occlusion and increase robustness and accuracy. In addition, running a unified tracking mean that the API remains unchanged and there is more shared computation, meaning less CPU overhead compared to running 2 completely independent instances of Tracking.

tylerz 2014-06-10 21:06:52 UTC #7

I believe there is a great benefit of non-overlapping devices in their ability to provide multiple distinct contexts with maximum useable interaction space. We do this all the time with multiple touchscreen devices, multiple monitor setups, multiple game controllers, etc. Separating the context based on devices can allow an immediate change of gesture contexts associated with each device. This is useful for productivity applications, but also for other uses, such as music and VJ'ing, where you may want the maximum range of interaction and you can conceptualize different instruments set at the location of the distinct device locations.

So, I second, simplest multiple device support as a beginning place. Solving the more complex interest of overlapping devices is not coming in any near-term, so why not release a simple solution as a first step? As a benefit this let's the community also start working at their own solutions that may aid the process of experimenting and discovering the best solution for an overlapping setup.

zalo 2014-06-16 15:19:05 UTC #8

I "third" the idea of independent streams of data coming from each device. So long as they don't interfere with each other, adding in support for overlap could be as "simple" as finding the hands or tools that have the same magnitude of velocity between the two device streams and calculating the device offset (position and orientation (perhaps assuming a level surface)) from that.

The developer could handle that (at first) since we need to be the ones that add that calibration step into our programs anyway. Maybe later the Leap tray app can add a hotkey that will bring up a calibration window, changing device.offsetpositions relative to a device[0]. Lower level image processing changes (hacker poet's idea) based on this would be sweet.

Separate streams also leaves open the possibility for non-traditional configurations such as covering the surface of an HMD with leaps.

EDIT: We could also write simple functions that could take multiple Handlists and "merge" them into a unified hand list (perhaps weightedly averaging the hand poses between overlaps based on confidence).

flipphillips 2014-06-24 20:33:37 UTC #9

I'm following this a little bit because I do research on human perception / action and I need to be able to detect object manipulation. (Imagine, say, a 'bumpy' tennis ball in your hand or two hands) We're not terribly successful with a single tracker- but I would think that multiple trackers would somehow be able to help us here. In a perfect world we'd be able to put reference marks on the object (UV ink, say) and be able to track -its- orientation also.

Does this make sense? Any potential seen here or should we be looking at a different solution?

vimaxus 2015-01-15 12:37:19 UTC #10

We would really like an update as to the state of multiple leaps on one PC.

@codemercenary Are you still on this? Has this been abandoned? I understand the new automotive and the dragonfly implementation take precedence but please, at least give us 2 separate frames (as in 2 interaction spaces) or at least 4 raw images.
I'm sure it's not that simple but it can't be years of work either. Maybe even an unstable raw pre-alpha that gives us something.

Sorry for bumping this but since this thread has no input from any leap staff I thought maybe this one has some chance.

u_cap 2015-01-15 15:01:53 UTC #11

I agree. Most of the thread discussions have been about the trade-off between getting Something, Anything Soon and getting Everything At Once - net result we have not gotten anything.

V2 is out of beta and released, so there is no issue of V1 vs. V2 support.

I have not seen any claims that support for two entirely separate devices - no SDK integration whatsoever - is more difficult, or more time-consuming, than trying to solve every problem before shipping anything.

I'd think that offering 2 separate frames (or even 4 raw images) would be a logical milestone on the way to a fully integrated overlap.

vimaxus 2015-01-16 23:53:56 UTC #12

Thank you Leap Motion! apparently with the introduction of serialNumber in 2.2.2 we finally get what we wanted! I'm trying it now.

kpietroszek 2016-01-07 01:59:11 UTC #13

I am following this topic for a while and I'm also really interested in chaining together many leap motions. I don't expect you to do an integrated approach. It would be enough if I can connect many Leaps to single PC and I will deal with the data stitching myself