Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

output files made by SquigglePull.py containing nothing #43

Open
liu-pinnng opened this issue Dec 16, 2020 · 12 comments
Open

output files made by SquigglePull.py containing nothing #43

liu-pinnng opened this issue Dec 16, 2020 · 12 comments
Assignees
Labels
bug Something isn't working

Comments

@liu-pinnng
Copy link

Hi,

Thank you for developing and maintaining this project!
I am trying to extract the raw signal using SquigglePull.py.
It seems to work, but the .tsv files were containing nothing.
Can you help me with this problem?
Thank you in advance.

Ping

@Psy-Fer
Copy link
Owner

Psy-Fer commented Dec 16, 2020

Hello Ping,

Sure. Could you please tell me a few things?

Python version?
Single or multi fast5 files?
Command being used?
Operating system?

Thanks!

@liu-pinnng
Copy link
Author

Hi,

Thank you for your quick reply.
Python version: Python 3.6.10
Single fast5 file
Command as python SquigglePull.py -r -p downloads/SquiggleKit/example/test.fast5/ > data6.tsv
Operating system: macOS 10.14.6

Ping

@Psy-Fer
Copy link
Owner

Psy-Fer commented Dec 16, 2020

Thanks for that Ping.

Try chaning the command to just the top folder, rather than the individual file

python SquigglePull.py -r -p downloads/SquiggleKit/example/ > data6.tsv

The -p is meant to take a path as an argument. It then finds every fast5 file in that folder after that.

Also, when I asked if it was single or multi. I didn't mean "multiple files" or a "single file" I mean the format of the fast5 file itself. Does the file contain onle 1 read, or does it contain multiple reads. The latest sequencing runs will have multi-fast5 format, usually containing ~4000 reads per file.

@liu-pinnng
Copy link
Author

Thanks for your kind comment.

I used this comment.
python SquigglePull.py -r -p downloads/SquiggleKit/example/ > data6.tsv
But still, nothing containing in the .tsv.
The data I used was downloaded from SquiggleKit.

The data I want to use is the multi-fast5 format.

@Psy-Fer
Copy link
Owner

Psy-Fer commented Dec 16, 2020

Ahh, try adding the --multi flag

@liu-pinnng
Copy link
Author

Still does not work.

@liu-pinnng
Copy link
Author

I worked in the following command.
(envs) PINGdeMacBook-Pro:SquiggleKit pingliu$ python SquigglePull.py -r -p downloads/SquiggleKit/example/ --multi > data.tsv

@liu-pinnng
Copy link
Author

It seems to work by adding ~/ in the front of the path.
However, when I tried to use my own .fast5 with --multi, the key error showed as follow:
KeyError: 'Unable to open object (component not found)'
extract_fast5():failed to read readID: UniqueGlobalKey(envs) PINGdeMacBook-Pro:SquiggleKit pingliu$

@Psy-Fer
Copy link
Owner

Psy-Fer commented Dec 16, 2020

Ahh. That is weird.

Let me do some tests to se if I can figure it out.

@Psy-Fer Psy-Fer self-assigned this Dec 16, 2020
@Psy-Fer Psy-Fer added the bug Something isn't working label Dec 16, 2020
@TTT16
Copy link

TTT16 commented Mar 20, 2024

Hi, Has this issue got fixed? I ran into the same problem. Please help to provide the solution. Thank you!

@Psy-Fer
Copy link
Owner

Psy-Fer commented Mar 21, 2024

I have narrowed this down to be an issue with various fast5 versions. and they way the files are written by ONT. Given slow5 and pod5 are the dominant file formats with working converters, I don't plan on fixing things like this with fast5 as the development time is not worth it.

SquigglePull is not really needed anymore, as you should be able to use the raw files directly with the downstream tools. The tsv intermediate format was an early prototype of what turned into the SLOW5 file format.

James

@TTT16
Copy link

TTT16 commented Mar 22, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants