-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VK video not archived #272
Comments
Thanks for the detailed report @edsu 😄 I haven't been able to reproduce that issue. The video should be under the "Extracted video data" summary page, which is not present in your screenshots. I see in the logs you shared that the capture timeout was hit during the browser scripts phase, which doesn't match what I have observed during my tests. Would you mind trying again and letting us know if you see variability here? That might help us deal with edge cases such as this one more efficiently in the future. Alternatively, for that capture specifically, you might want to:
An aside -- I see in your logs that you are using Node 18.x. While we are still testing on Node 18 and it's unlikely to have an effect, our focus is currently on Node 20 and 21 and I'd recommend upgrading if possible. |
Thanks for the quick response @matteocargnelutti -- in this case I wasn't specifically interested in the attached video, but was hoping that the embedded video would be part of the WARC data and would play when viewing the web page:
Does the video play in the web page replay for you? I upgraded to node v21.5.0 and ran with a longer
|
Nice 😄 !
It doesn't. Based on what I've seen so far: I think making this possible would require either a custom browser behavior or some ad-hoc HTTP request / response rewriting.
Cheers 👋 |
Ok I'll look into adding a behavior. Browsertrix-crawler has an option to run custom behaviors now. Have you considered adding anything like that to scoop? |
I happened to notice that the video in https://vk.com/wall-1113595_548588 is not archived?
I loaded the
vk-video.wacz
that was created into ReplayWeb.PageAnd then clicked on the video:
It doesn't seem to work with browsertrix-crawler either, so maybe this is an issue with the video behavior? Or maybe the problems are unrelated. I thought it was worth reporting though...
The text was updated successfully, but these errors were encountered: