{
    "data_id": "44974",
    "name": "video_transcoding",
    "exact_name": "video_transcoding",
    "version": 6,
    "version_label": null,
    "description": "**Data Description**\n\nThe dataset contains a million randomly sampled video instances listing 10 fundamental video characteristics along with the YouTube video ID.\n\nThe videos were all transcribed from one format into another, measuring the memory usage and the transcription time.\n\nThe goal is to predict the transcription time using the input information and the desired output format.\n\n**Attribute Description**\n\n1. *id* - Youtube video id (should be dropped for the analysis)\n2. *duration* - duration of video\n3. *codec* - coding standard used for the video (\"mpeg4\", \"h264\", \"vp8\", \"flv\")\n4. *width* - width of video in pixles\n5. *height* - height of video in pixles\n6. *bitrate* - video bitrate\n7. *framerate* - actual video frame rate\n8. *i* - number of i frames in the video\n9. *p* - number of p frames in the video\n10. *b* - number of b frames in the video\n11. *frames* - number of frames in video\n12. *i_size* - total size in byte of i videos\n13. *p_size* - total size in byte of p videos\n14. *b_size* - total size in byte of b videos\n15. *size* - total size of video\n16. *o_codec* - output codec used for transcoding (\"mpeg4\", \"h264\", \"vp8\", \"flv\")\n17. *o_bitrate* - output bitrate used for transcoding\n18. *o_framerate* - output framerate used for transcoding\n19. *o_width* - output width in pixel used for transcoding\n20. *o_height* - output height used in pixel for transcoding\n21. *umem* -  total codec allocated memory for transcoding, alternate target feature\n22. *utime* - total transcoding time for transcoding, target feature",
    "format": "arff",
    "uploader": "Sebastian Fischer",
    "uploader_id": 30127,
    "visibility": "public",
    "creator": null,
    "contributor": null,
    "date": "2022-12-22 16:13:45",
    "update_comment": null,
    "last_update": "2022-12-22 16:13:45",
    "licence": "CC BY 4.0",
    "status": "active",
    "error_message": null,
    "url": "https:\/\/api.openml.org\/data\/download\/22111838\/file22f1662968585.arff",
    "default_target_attribute": "utime",
    "row_id_attribute": null,
    "ignore_attribute": "\"umem\",\"id\"",
    "runs": 0,
    "suggest": {
        "input": [
            "video_transcoding",
            "The dataset contains a million randomly sampled video instances listing 10 fundamental video characteristics along with the YouTube video ID. The videos were all transcribed from one format into another, measuring the memory usage and the transcription time. The goal is to predict the transcription time using the input information and the desired output format. 1. *id* - Youtube video id (should be dropped for the analysis) 2. *duration* - duration of video 3. *codec* - coding standard used for  "
        ],
        "weight": 5
    },
    "qualities": {
        "NumberOfInstances": 68784,
        "NumberOfFeatures": 19,
        "NumberOfClasses": 0,
        "NumberOfMissingValues": 0,
        "NumberOfInstancesWithMissingValues": 0,
        "NumberOfNumericFeatures": 17,
        "NumberOfSymbolicFeatures": 2,
        "MajorityClassSize": null,
        "MinorityClassPercentage": null,
        "MinorityClassSize": null,
        "NumberOfBinaryFeatures": 0,
        "PercentageOfBinaryFeatures": 0,
        "PercentageOfInstancesWithMissingValues": 0,
        "AutoCorrelation": -7.180910748295344,
        "PercentageOfMissingValues": 0,
        "Dimensionality": 0.0002762270295417539,
        "PercentageOfNumericFeatures": 89.47368421052632,
        "MajorityClassPercentage": null,
        "PercentageOfSymbolicFeatures": 10.526315789473683
    },
    "tags": [
        {
            "tag": "Images",
            "uploader": "38960"
        },
        {
            "tag": "Machine Learning",
            "uploader": "38960"
        },
        {
            "tag": "study_353",
            "uploader": "0"
        }
    ],
    "features": [
        {
            "name": "utime",
            "index": "20",
            "type": "numeric",
            "distinct": "10960",
            "missing": "0",
            "target": "1",
            "min": "0",
            "max": "225",
            "mean": "10",
            "stdev": "16"
        },
        {
            "name": "id",
            "index": "0",
            "type": "string",
            "distinct": "1099",
            "missing": "0",
            "ignore": "1"
        },
        {
            "name": "duration",
            "index": "1",
            "type": "numeric",
            "distinct": "1086",
            "missing": "0",
            "min": "31",
            "max": "25844",
            "mean": "286",
            "stdev": "287"
        },
        {
            "name": "codec",
            "index": "2",
            "type": "nominal",
            "distinct": "4",
            "missing": "0",
            "distr": []
        },
        {
            "name": "width",
            "index": "3",
            "type": "numeric",
            "distinct": "6",
            "missing": "0",
            "min": "176",
            "max": "1920",
            "mean": "625",
            "stdev": "463"
        },
        {
            "name": "height",
            "index": "4",
            "type": "numeric",
            "distinct": "6",
            "missing": "0",
            "min": "144",
            "max": "1080",
            "mean": "413",
            "stdev": "241"
        },
        {
            "name": "bitrate",
            "index": "5",
            "type": "numeric",
            "distinct": "1095",
            "missing": "0",
            "min": "8384",
            "max": "7628466",
            "mean": "693702",
            "stdev": "1095628"
        },
        {
            "name": "framerate",
            "index": "6",
            "type": "numeric",
            "distinct": "261",
            "missing": "0",
            "min": "6",
            "max": "48",
            "mean": "23",
            "stdev": "7"
        },
        {
            "name": "i",
            "index": "7",
            "type": "numeric",
            "distinct": "306",
            "missing": "0",
            "min": "7",
            "max": "5170",
            "mean": "101",
            "stdev": "85"
        },
        {
            "name": "p",
            "index": "8",
            "type": "numeric",
            "distinct": "1042",
            "missing": "0",
            "min": "175",
            "max": "304959",
            "mean": "6532",
            "stdev": "6076"
        },
        {
            "name": "b",
            "index": "9",
            "type": "numeric",
            "distinct": "20",
            "missing": "0",
            "min": "0",
            "max": "9407",
            "mean": "9",
            "stdev": "93"
        },
        {
            "name": "frames",
            "index": "10",
            "type": "numeric",
            "distinct": "1044",
            "missing": "0",
            "min": "192",
            "max": "310129",
            "mean": "6642",
            "stdev": "6153"
        },
        {
            "name": "i_size",
            "index": "11",
            "type": "numeric",
            "distinct": "1099",
            "missing": "0",
            "min": "11648",
            "max": "90828552",
            "mean": "2838987",
            "stdev": "4325137"
        },
        {
            "name": "p_size",
            "index": "12",
            "type": "numeric",
            "distinct": "1099",
            "missing": "0",
            "min": "33845",
            "max": "768996980",
            "mean": "22180569",
            "stdev": "50973061"
        },
        {
            "name": "size",
            "index": "13",
            "type": "numeric",
            "distinct": "1099",
            "missing": "0",
            "min": "191879",
            "max": "806711069",
            "mean": "25022942",
            "stdev": "54144015"
        },
        {
            "name": "o_codec",
            "index": "14",
            "type": "nominal",
            "distinct": "4",
            "missing": "0",
            "distr": []
        },
        {
            "name": "o_bitrate",
            "index": "15",
            "type": "numeric",
            "distinct": "7",
            "missing": "0",
            "min": "56000",
            "max": "5000000",
            "mean": "1395036",
            "stdev": "1749352"
        },
        {
            "name": "o_framerate",
            "index": "16",
            "type": "numeric",
            "distinct": "5",
            "missing": "0",
            "min": "12",
            "max": "30",
            "mean": "21",
            "stdev": "7"
        },
        {
            "name": "o_width",
            "index": "17",
            "type": "numeric",
            "distinct": "6",
            "missing": "0",
            "min": "176",
            "max": "1920",
            "mean": "802",
            "stdev": "610"
        },
        {
            "name": "o_height",
            "index": "18",
            "type": "numeric",
            "distinct": "6",
            "missing": "0",
            "min": "144",
            "max": "1080",
            "mean": "504",
            "stdev": "316"
        },
        {
            "name": "umem",
            "index": "19",
            "type": "numeric",
            "distinct": "9395",
            "missing": "0",
            "ignore": "1",
            "min": "22508",
            "max": "711824",
            "mean": "228225",
            "stdev": "97431"
        }
    ],
    "nr_of_issues": 0,
    "nr_of_downvotes": 0,
    "nr_of_likes": 0,
    "nr_of_downloads": 0,
    "total_downloads": 0,
    "reach": 0,
    "reuse": 0,
    "impact_of_reuse": 0,
    "reach_of_reuse": 0,
    "impact": 0
}