Skip to content

Images in the exported dataset is different to the origin one #8771

Open
@imyhxy

Description

Actions before raising this issue

  • I searched the existing issues and did not find anything similar.
  • I read/searched the docs

Steps to Reproduce

Create a task with a JPEG image, export the dataset with images. Use md5sum to compare the image in the exported dataset and the origin. They does not match. Sometime, a 8M jpg will become 16M in the exported dataset.

Expected Behavior

The origin image and the exported image should be exactly the same.

Possible Solution

Do not decode the origin image when export dataset with images. Just copy the binary content.

Context

No response

Environment

  • git hash log:
commit c737f083ac6d9eaf5013acef1392b5922105f3c6 (HEAD, tag: v2.22.0)
Merge: 8d990c986 333df3563
Author: cvat-bot[bot] <147643061+cvat-bot[bot]@users.noreply.github.com>
Date:   Mon Nov 11 13:57:35 2024 +0000

    Merge pull request #8678 from cvat-ai/release-2.22.0
    
    Release v2.22.0
  • docker version
Client: Docker Engine - Community
 Version:           25.0.2
 API version:       1.44
 Go version:        go1.21.6
 Git commit:        29cf629
 Built:             Thu Feb  1 00:23:03 2024
 OS/Arch:           linux/amd64
 Context:           default

Server: Docker Engine - Community
 Engine:
  Version:          25.0.2
  API version:      1.44 (minimum version 1.24)
  Go version:       go1.21.6
  Git commit:       fce6e0c
  Built:            Thu Feb  1 00:23:03 2024
  OS/Arch:          linux/amd64
  Experimental:     false
 containerd:
  Version:          1.6.28
  GitCommit:        ae07eda36dd25f8a1b98dfbf587313b99c0190bb
 runc:
  Version:          1.1.12
  GitCommit:        v1.1.12-0-g51d5e94
 docker-init:
  Version:          0.19.0
  GitCommit:        de40ad0
  • system version
Ubuntu 24.04.1 LTS

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingdataset

    Type

    No type

    Projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions