目标检测标注生成

我想要做一个能够很方便地用于目标检测的数据集，目前是调用了Qwen2.5 VL的API来进行一个检测，能够对每张图片输出零个，一个或者多个json格式的检测框，我想要保存成标准的YOLO或者COCO格式，方便之后使用YOLO或者其他算法进行目标检测，你有什么建议吗

结合你的需求，以下是构建YOLO/COCO格式数据集的系统化建议（附代码框架和注意事项）：

一、核心步骤建议

dataset/
├── images/
│   ├── train/  # 训练集图片
│   └── val/    # 验证集图片
├── labels_yolo/  # YOLO格式标签
│   ├── train/
│   └── val/
└── annotations/  # COCO格式json
    ├── train.json
     └── val.json

类别映射文件（必须）
创建classes.txt，每行一个类别：

1
2
3

person
car
dog

YOLO V5格式转换（核心逻辑参考）

import json
from PIL import Image

def qwen_to_yolo(api_result, img_path, save_dir):
    # 获取图片尺寸
    img = Image.open(img_path)
    img_w, img_h = img.size
    
    # 转换每个检测框
    yolo_lines = []
    for box in api_result['boxes']:
        cls_id = class_map[box['class']]  # 需预先构建class_map字典
        x_center = (box['xmin'] + box['xmax'])/2 / img_w  # 归一化
        y_center = (box['ymin'] + box['ymax'])/2 / img_h
        width = (box['xmax'] - box['xmin']) / img_w
        height = (box['ymax'] - box['ymin']) / img_h
        
        yolo_lines.append(f"{cls_id} {x_center:.6f} {y_center:.6f} {width:.6f} {height:.6f}")

    # 保存到对应txt文件
    txt_path = os.path.join(save_dir, os.path.splitext(os.path.basename(img_path)) + '.txt')
    with open(txt_path, 'w') as f:
        f.write('\n'.join(yolo_lines))

COCO格式转换（参考）

coco_template = {
    "images": [],
    "annotations": [],
    "categories": []
}

# 构建categories
for idx, cls in enumerate(classes):
    coco_template['categories'].append({
        "id": idx+1,  # COCO类别ID从1开始
        "name": cls,
        "supercategory": "none"
    })

# 处理每张图片
image_id = 0
annotation_id = 0
for img_path in all_images:
    # 添加image记录
    img = Image.open(img_path)
    coco_template['images'].append({
        "id": image_id,
        "file_name": os.path.basename(img_path),
        "width": img.width,
        "height": img.height
    })
    
    # 处理标注
    for box in api_results[img_path]:
        coco_template['annotations'].append({
            "id": annotation_id,
            "image_id": image_id,
            "category_id": class_map[box['class']] + 1,  # COCO类别ID从1开始
            "bbox": [box['xmin'], box['ymin'], box['xmax']-box['xmin'], box['ymax']-box['ymin']],
            "area": (box['xmax']-box['xmin'])*(box['ymax']-box['ymin']),
            "iscrowd": 0
        })
        annotation_id += 1
    
    image_id += 1

# 保存json
with open('annotations/train.json', 'w') as f:
    json.dump(coco_template, f)

二、关键注意事项

坐标归一化：YOLO格式需要将坐标除以图片宽高（0-1范围），COCO使用绝对像素值
ID起始值：YOLO类别从0开始，COCO类别建议从1开始（兼容性更好）
数据拆分：建议按8:1:1划分train/val/test，可使用sklearn.model_selection.train_test_split
验证工具：
- YOLO格式：使用Ultralytics的yolo val命令验证数据加载
- COCO格式：使用pycocotools验证json合法性