update read image function #5053

WZMIAOMIAO · 2021-12-24T18:00:14Z

在参加OCR十讲课程中，有使用到官方指定的数据集det_data_lesson_demo。在使用过程中发现，其中有很多是gif格式的图片（但后缀是.jpg），通过PaddleOCR官方的代码读取这些数据后全部为None（例如：mtwi/train/TB1Zj7Un4rI8KJjy0FpXXb5hVXa_!!1-item_pic.gif.jpg），还有些数据opencv会报Corrupt JPEG data看着很难受（例如：xfun/train/zh_train_43.jpg）。详情可以查看 #5092。

PaddleOCR官方读取图片方式精简如下：

with open(img_path, 'rb') as f:
    img = f.read()
img = np.frombuffer(img, dtype='uint8')
img = cv2.imdecode(img, 1)

我将图片读取的方式修改成：

img = Image.open(img_path).convert('RGB')
img = cv2.cvtColor(np.asarray(img), cv2.COLOR_RGB2BGR)

通过我提供的方式读取gif图片时能够正常读取不会为None，并且也解决了opencv报Corrupt JPEG data警告的问题。

paddle-bot-old · 2021-12-24T18:00:17Z

Thanks for your contribution!

littletomatodonkey · 2021-12-25T09:15:10Z

您好，感谢反馈！这里是因为cv2的数据读取效率比PIL高一些，所以我们广泛使用了cv2的读图逻辑，这里不建议直接修改源码，您可以在自己的代码中添加PIL的数据读取~

WZMIAOMIAO · 2021-12-25T09:16:54Z

好吧

littletomatodonkey · 2021-12-25T09:17:48Z

我们记录下，总结在FAQ中~

littletomatodonkey · 2021-12-25T09:19:47Z

#4982

您可以先在这里记录一下，作为特殊图像处理的解决办法哈

WZMIAOMIAO · 2021-12-25T09:20:53Z

好滴

WenmuZhou · 2022-01-05T08:22:23Z

对于读取之后为None的图像，建议改为判断为None后使用Image读取

Corrupt JPEG data的问题不影响读取到的图片内容

update read image function

84088ea

update code

391154a

WZMIAOMIAO closed this Dec 25, 2021

WZMIAOMIAO mentioned this pull request Dec 25, 2021

PaddleOCR社区常规赛 #4982

Closed

WZMIAOMIAO deleted the wzmiaomiao branch December 25, 2021 14:45

WZMIAOMIAO mentioned this pull request Dec 28, 2021

Parsing line Error: list index out of range #5101

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update read image function #5053

update read image function #5053

WZMIAOMIAO commented Dec 24, 2021 •

edited

Loading

paddle-bot-old bot commented Dec 24, 2021

littletomatodonkey commented Dec 25, 2021

WZMIAOMIAO commented Dec 25, 2021

littletomatodonkey commented Dec 25, 2021

littletomatodonkey commented Dec 25, 2021 •

edited

Loading

WZMIAOMIAO commented Dec 25, 2021

WenmuZhou commented Jan 5, 2022

update read image function #5053

update read image function #5053

Conversation

WZMIAOMIAO commented Dec 24, 2021 • edited Loading

paddle-bot-old bot commented Dec 24, 2021

littletomatodonkey commented Dec 25, 2021

WZMIAOMIAO commented Dec 25, 2021

littletomatodonkey commented Dec 25, 2021

littletomatodonkey commented Dec 25, 2021 • edited Loading

WZMIAOMIAO commented Dec 25, 2021

WenmuZhou commented Jan 5, 2022

WZMIAOMIAO commented Dec 24, 2021 •

edited

Loading

littletomatodonkey commented Dec 25, 2021 •

edited

Loading