Generalized dataset plugin by wermos · Pull Request #429 · ILLIXR/ILLIXR

wermos · 2024-11-05T14:10:46Z

A new, updated version of the old PR (#393), which incorporates the new changes in ILLIXR (spdlog, CMake).

wermos · 2024-11-05T14:10:59Z

I'm marking it as draft for now.

wermos · 2024-12-09T18:01:13Z

I want to unmark this PR as a draft and open it for code review. There are some parts of this that I need some feedback and guidance on, most notably the data format and CMake integration (and YAML parsing, which is related to the CMake thing). To that end, I will explain the design of the dataset plugin so that everyone is on the same page.

wermos · 2024-12-09T18:02:03Z

Plugin Design

The original proposal can be read here. I will rewrite the most important and pertinent parts here again.

The high level overview is that, the work for the plugin modifies the YAML parsing code, and implements a system to load in and store data in memory and publish it at certain intervals. It also has some customization points to be more future-proof and robust and accomodate more varied datasets.

Implementation Details

YAML Side

The proposed YAML syntax is discussed at length in the proposal. I will simply copy the full example from the end of the proposal:

delimiter: ',' # tells us what the delimiter for the data is.
# We can make the delimiter configurable and keep ',' as default.

root_path: /path/to/dataset # this tells us where the dataset is on the system
# we can have this path variable work just like the existing `data` option (as
# described here), with support for downloading files and stuff.

# All paths from this point forward are relative to `root_path`
imu:
  - timestamp_units: microseconds # somewhat fast IMU

  - path: /path/to/imu/data1
    format: true # this means that linear acceleration is first, followed by
                 # angular velocity
  - path: /path/to/imu/data2
    format: false

image:
  - timestamp_units: milliseconds
  - rgb:
      # rgb1 will be left eye, rgb2 will be right eye, etc.
      - path: /path/to/left/eye/rgb/images
      - path: /path/to/right/eye/rgb/images
      # There can be more if needed
  - depth:
      # depth1 will be left eye, depth2 will be right eye, etc.
      - path: /path/to/left/eye/depth/images
      - path: /path/to/right/eye/depth/images
      # There can be more if needed
  - grayscale:
      # grayscale1 will be Camera 1, grayscale2 will be Camera 2, etc.
      - path: /path/to/first/camera/grayscale/images
      - path: /path/to/second/camera/depth/images
      # There can be more if needed

pose:
  - timestamp_units: nanoseconds
  - path: /path/to/pose/data1
  - path: /path/to/pose/data2

ground_truth:
  - timestamp_units: nanoseconds
  - path: /path/to/groundtruth/data

(Note that there are some differences between the proposed YAML syntax in the proposal vs what the plugin currently supports. Where the two differ, this comment has the more up-to-date information.)

C++ Side

The dataset plugin has 3 main internal classes, and 1 ILLIXR-facing class.

The internal classes are:

Config: This reads data from environment variables and sets up internal flags and configuration data. For example, in the format of the IMU data, whether the linear acceleration or the angular acceleration is first, is something we have as a config flag.
DatasetLoader: Using the information retrieved by the config classes, it loads in all the data and stores it in a multimap with the timestamps as keys and the data as the values.
Emitter: Since there are 4 channels of data to write, the emitter collates all the information together and then emits the relevant data at the correct time. It also has a couple of helper that allow the Publisher class figure out how long to make the dataset plugin sleep for, and if there is more data to emit.

The Publisher is the class that ties together the internal classes and establishes the link to the ILLIXR system as a whole via the writers and the managed thread model (waking up every specified internal to emit some more test data).

Possible Improvements

The Config struct data does not need to live in memory after the dataset is loaded in via the DatasetLoader. Currently, it lives as a member variable in the DatasetLoader instance, due to the simplicity of that implementation.
DatasetLoader would probably be simpler as a POD struct instead of a singleton class. That simplifies a lot of the calls in the emitter and eliminates unnecessary hassle.
The emit function in the Emitter class is not that robust currently. Whenever it wakes up, it simply emits all the dataset items up to the current time. This was done because it's a simpler implementation.
- A far superior and smarter approach would be to construct an interval $[t_{\text{current}} - \varepsilon, t_{\text{current}}]$ (where $\varepsilon$ is a constant that is heuristically chosen), and drop anything in the list outside that interval (while informing the user), and emit everything in that interval.

All of these are enhancements that can be made after the basic system is up and running and testable.

CMake

The CMake build system integration is a bit difficult to understand and opaque to me, an outsider.

Specifically, I need help with what files to modify to make the YAML parsing code understand the meaning of the new config files.

I know that the YAML parsing is done in CMake, but where and how exactly is information passed from the config to the ILLIXR code? For example, the path to the dataset (the data config value).

I have written the dataset plugin's CMake code and it builds (or at least tries to). But I need some help with the YAML parsing part.

Since it looks to me like @astro-friedel did most of the CMake work, I am pinging you for help on this matter.

YAML to C++ Message-Passing

A key step in the pipeline for the dataset plugin is the passing of information from the YAML config file to the C++ code.

Among other things, I need to know where the data files are, some basic things about the format, etc.

My original design, pre-CMake integration, was to simply define a bunch of environment variables via the Makefile and the Python script. However, the issue with that approach is that it is somewhat brittle, hacky, and Linux-specific.

A more robust approach would be to use CMake's configure_file feature where it can write a bunch of #define macros during the configuration step. (See here for more details.) Then, my C++ code can simply read those #defines to get the dataset configuration info it needs. However, this configure_file step would also need to be done in the YAML parsing stage, which takes us back to my original question.

wermos · 2024-12-09T18:02:51Z

Questions

The plugin, as of now, is not fully compiling yet. There are a few bugs that I need to iron out, but they require some input from the ILLIXR core devs.

Channels

In the dataset plugin, I have taken extra care to ensure that every part (Image, IMU, Pose, Ground Truth) supports reading in data from multiple files. We have the option to also publish this information, via a channel parameter (1 means the first file, and so on). The current definitions of imu_type:

ILLIXR/include/illixr/data_format.hpp

Lines 18 to 27 in 1963c10

    
           struct imu_type : switchboard::event { 
        
               time_point      time; 
        
               Eigen::Vector3d angular_v; 
        
               Eigen::Vector3d linear_a; 
        
               imu_type(time_point time_, Eigen::Vector3d angular_v_, Eigen::Vector3d linear_a_) 
        
                   : time{time_} 
        
                   , angular_v{std::move(angular_v_)} 
        
                   , linear_a{std::move(linear_a_)} { } 
        
           };

and pose_type:

ILLIXR/include/illixr/data_format.hpp

Lines 99 to 113 in 1963c10

    
           struct pose_type : public switchboard::event { 
        
               time_point         sensor_time; // Recorded time of sensor data ingestion 
        
               Eigen::Vector3f    position; 
        
               Eigen::Quaternionf orientation; 
        
               pose_type() 
        
                   : sensor_time{time_point{}} 
        
                   , position{Eigen::Vector3f{0, 0, 0}} 
        
                   , orientation{Eigen::Quaternionf{1, 0, 0, 0}} { } 
        
               pose_type(time_point sensor_time_, Eigen::Vector3f position_, Eigen::Quaternionf orientation_) 
        
                   : sensor_time{sensor_time_} 
        
                   , position{std::move(position_)} 
        
                   , orientation{std::move(orientation_)} { } 
        
           };

don't have a channel parameter. If this feature is useful, then I would need to modify those structs as required. So, is this feature something we would be interested in?

Publishing Structs

There used to be a cam_type struct:

ILLIXR/common/data_format.hpp

Lines 22 to 31 in 5913619

    
           struct cam_type : switchboard::event { 
        
               time_point time; 
        
               cv::Mat    img0; 
        
               cv::Mat    img1; 
        
               cam_type(time_point _time, cv::Mat _img0, cv::Mat _img1) 
        
                   : time{_time} 
        
                   , img0{_img0} 
        
                   , img1{_img1} { } 
        
           };

which has since been removed, which I was using to publish image data. What should I use to publish it now?

The other data that I need to publish is the ground truth. We had decided (@jianxiapyh and I) that the simplest and most robust thing to do would be to perform no computations on the ground truth data, and to kind of just pass them through the dataset plugin and emit it at the correct times as a string.

However, we need a struct for that in data_format.hpp. So is it fine if I just add one in? The ground truth data struct would simply be a very thin wrapper around Eigen::VectorXd.

ILLIXR Profiles

There are many YAML files throughout the ILLIXR repo. There is a plugins/plugins.yaml and there's also a bunch of profiles in the profiles/ directory. I need some explanation of what plugins/plugins.yaml does, and which of the profiles plugins I should use to test my dataset plugin.

The end result I can envision of my work is that I can demonstrate one of the profiles using an actual dataset and moving the camera and showing the images from that dataset.

astro-friedel · 2024-12-10T17:49:23Z

First off, the struct cam_type has not been removed, just moved to a different headers file.

As part of the hand tracking work I have notable re-worked the pose_type and image type structs to be more general. You can find the most recent code here.

As far as the YAML parsing in CMake, there is not much done there. The input profile file (e.g. native_gl.yaml) is parsed to determine what plugins need to be built, and to generate the illixr.yaml file which can be used by the runtime. CMake reads the profiles.yaml file (which list all available profiles) and generates the individual profiles/*.yaml files. This is mostly done in cmake/HelperFunctions.cmake. CMake itself does not feed anything from the yaml files into the C++ code. The runtime (main.dbg.exe) reads the given profile yaml file and passes the parameters to the C++ code.

wermos added 5 commits October 6, 2024 23:54

wip commit, first pass at modernizing the old code.

099e911

Removed some publisher classes.

b484bd9

Removed some now-unnecessary source files.

b4038e4

Merge branch 'ILLIXR:master' into new-gen-plugin

4aee3cd

Minor changes.

2e3b766

[pre-commit.ci] Run clang-format

ce37840

wermos marked this pull request as draft November 5, 2024 14:11

This was referenced Nov 5, 2024

Generalized dataset plugin #393

Closed

Unify file io - issue #118 #416

Merged

wermos and others added 16 commits November 5, 2024 19:44

Switched to using spdlog instead of using iostream.

abec0aa

WIP commit for refactor.

85daff5

[pre-commit.ci] Run clang-format

e99e02e

Wrote up a full implementation of the DataEmitter class.

69b0a19

Removed useless cruft from the publisher class.

e7b8da7

[pre-commit.ci] Run clang-format

8c0a2b6

Modified the plugin YAML files to make it easier to work with for me.

c9e039c

Updated CMakeLists.txt.

62c3c7b

Updated include paths.

968f168

More include path fixes.

5593c88

Updated CMakeLists.txt to link to Eigen3 and OpenCV.

bb8f447

Commented a sanity check for now.

7da1cee

Fixed Eigen's include path.

d178175

[pre-commit.ci] Run clang-format

b528ae7

Fixed some build errors.

4514935

[pre-commit.ci] Run clang-format

b005c82

wermos marked this pull request as ready for review December 9, 2024 18:02

wermos and others added 3 commits December 9, 2024 23:43

Misc. fixes.

7a85579

[pre-commit.ci] Run clang-format

c394ac6

Fixed a time_point error.

7060ef4

Jebbly marked this pull request as draft December 18, 2024 17:19

wermos added 2 commits December 20, 2024 11:21

Fixed a typo.

afbaee2

Merge branch 'master' into new-gen-plugin

14c12b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalized dataset plugin#429

Generalized dataset plugin#429
wermos wants to merge 27 commits intoILLIXR:masterfrom
wermos:new-gen-plugin

wermos commented Nov 5, 2024

Uh oh!

wermos commented Nov 5, 2024

Uh oh!

wermos commented Dec 9, 2024

Uh oh!

wermos commented Dec 9, 2024

Uh oh!

wermos commented Dec 9, 2024

Uh oh!

astro-friedel commented Dec 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wermos commented Nov 5, 2024

Uh oh!

wermos commented Nov 5, 2024

Uh oh!

wermos commented Dec 9, 2024

Uh oh!

wermos commented Dec 9, 2024

Plugin Design

Implementation Details

YAML Side

C++ Side

Possible Improvements

CMake

YAML to C++ Message-Passing

Uh oh!

wermos commented Dec 9, 2024

Questions

Channels

Publishing Structs

ILLIXR Profiles

Uh oh!

astro-friedel commented Dec 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants