Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Topic: Big data #7

Open
ragharwal opened this issue Feb 15, 2022 · 2 comments
Open

Topic: Big data #7

ragharwal opened this issue Feb 15, 2022 · 2 comments
Labels
blog documentation Improvements or additions to documentation good first issue Good for newcomers up-for-grabs Issues open for contributions from around the world

Comments

@ragharwal
Copy link

No description provided.

@ragharwal ragharwal added documentation Improvements or additions to documentation good first issue Good for newcomers up-for-grabs Issues open for contributions from around the world blog labels Feb 15, 2022
@alexaustin007
Copy link

Fault tolerance in Spark - Spark automatically recovers and recompute lost data or tasks using lineage information(which it creates for lazy job) which tracks the sequence of transformations applied to resilient distributed datasets (RDDs) which guarantees successful job completion even after node failures

contact - [email protected]

@alexaustin007
Copy link

Can the maintainers review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blog documentation Improvements or additions to documentation good first issue Good for newcomers up-for-grabs Issues open for contributions from around the world
Projects
None yet
Development

No branches or pull requests

2 participants