Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple streams stopped generating new .ts files on the server #6486

Closed
amarantmeida opened this issue Jul 12, 2024 · 11 comments
Closed

Multiple streams stopped generating new .ts files on the server #6486

amarantmeida opened this issue Jul 12, 2024 · 11 comments
Assignees
Labels

Comments

@amarantmeida
Copy link

Short description

Observed that several streams stopped generating new .ts files on the server. The .m3u8 files have remained unchanged for over 16 hours.

In the antmedia-error.log file, we found the following error repeated approximately 62,137,440 times, causing the log file to grow to around 8GB:
Error: [hls @ 0x7f4470079f40] Application provided invalid, non monotonically increasing dts to muxer in stream 1: 193273527510 >= 12606721920

Temp fix- Restarting the stream.

Environment

  • Operating system and version:
  • Java version:
  • Ant Media Server version:
  • Browser name and version:

Steps to reproduce

  1. NA

Expected behavior

Stream should not stop generating .ts files

Actual behavior

__several streams stopped generating new .ts files on the server. The .m3u8 files have remained unchanged for over 16 hours. __

Logs

(https://drive.google.com/drive/folders/1TbL80vblMyNZXy02MCD1cgdT6Lzpzk5R)

Ask your questions on Ant Media Github Discussions

@oleul05
Copy link

oleul05 commented Jul 16, 2024

@amarantmeida, @burak-58
We have 10 camera streams running on the same server. We noticed that when the ts file number for any camera stream reaches 1073740, it stops generating new ts files. Other camera stream which didn't reach 1073740 it wasn't stopped generating new ts files.
Ant Media Server version: Enterprise Edition 2.8.2 20240201_1142
Running on Vultr kubernetes Engine with clustering.
Database: MongoDB
Server configuration: 8vCPU and 16GB of System Memory
Please let me know if you need any more information.
image

@mekya
Copy link
Contributor

mekya commented Jul 16, 2024

Hi @oleul05,

Thank you for the insight. It helped a lot to get basic understanding. My point is that it's likely overflowing the timestamps.

1073740 * 2000(assumption of key frame interval in milliseconds) = 2147480000.
It's close to half of integer limit(4294967296/2 = 2147483648). It overflows after 2147483648 because it's a signed value.

We can check if this problem happens due to the Ant Media Server.

Cheers
Oguz

@mekya mekya moved this to 📋 Backlog in Ant Media Server Jul 16, 2024
@oleul05
Copy link

oleul05 commented Jul 16, 2024

@mekya Good afternoon.
Thank you for your investigation. Please confirm us whether it is a problem inside Ant Media server. Is so please let us know the exact root cause and possible fix.

@lastpeony
Copy link
Contributor

@oleul05 Is this happening with all cameras or some particular cameras?

@oleul05
Copy link

oleul05 commented Jul 16, 2024

@lastpeony We had 10 cameras running on one server. The issue occurred only when a camera's .ts file count exceeded 1,073,740. In our case, this happened with 6 cameras, which had been running continuously for around 26 days. The other 4 cameras were restarted during this period.

@lastpeony
Copy link
Contributor

@lastpeony We had 10 cameras running on one server. The issue occurred only when a camera's .ts file count exceeded 1,073,740. In our case, this happened with 6 cameras, which had been running continuously for around 26 days. The other 4 cameras were restarted during this period.

Are they all the same camera models? I checked the server and found that we do have some protection for overflow cases. I suspect that the camera might be sending faulty values, which we have observed in some other instances.

@oleul05
Copy link

oleul05 commented Jul 18, 2024

@lastpeony Yes, all the encoder are the same model.

@burak-58 burak-58 moved this from 📋 Backlog to 🔖 Sprint in Ant Media Server Jul 24, 2024
@oleul05
Copy link

oleul05 commented Jul 26, 2024

@burak-58, @lastpeony Are there any updates for this bug?
@burak-58, were you able toschedule a meeting which you mentioned yesterday on AMS tech talk meeting?

@lastpeony
Copy link
Contributor

lastpeony commented Jul 26, 2024

@oleul05 Hi ole i sent you a test server ip through e mails on wednesday for us to debug this issue further. Could you please check your e mails/tickets?

@oleul05
Copy link

oleul05 commented Jul 26, 2024

@lastpeony The issue was occurred in production environment and client was streaming from encoder. So is it really difficult for us to stream from those encoders. Could you please investigate from the log files or try to reproduce the same error from your side?
We will also try to reproduce this same error from our side with some other camera/encoder. If we succeed to reproduce the issue in our dev or QA environment we will let you know.
I already replied to that email.

@oleul05
Copy link

oleul05 commented Jul 29, 2024

@lastpeony, @burak-58
We have observed the same error in the QA environment:
Application provided invalid, non-monotonically increasing DTS to muxer in stream 1: 4374767250 >= 4374757710.
However, after disabling the camera audio, this error no longer appears.

@burak-58 burak-58 moved this from 🔖 Sprint to 🏗 In progress in Ant Media Server Aug 5, 2024
@mekya mekya moved this from 🏗 In progress to After sprint in Ant Media Server Aug 12, 2024
@mekya mekya self-assigned this Aug 12, 2024
@burak-58 burak-58 closed this as completed Sep 2, 2024
@github-project-automation github-project-automation bot moved this from After sprint to ✅ Done in Ant Media Server Sep 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

No branches or pull requests

5 participants