Skip to content

Conversation

@jhcipar
Copy link
Contributor

@jhcipar jhcipar commented Sep 10, 2025

No description provided.

Copy link
Contributor

@pandyamarut pandyamarut left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding this example, I am not much aware of the Nvidia Rapid APIs. Just to confirm- Everything we have considered here has latest version and is relevant. correct>

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's definitely the latest version, but after making this I do wonder if this is something we're really targeting. I'm not sure how often Pandas/dataframe ops are used for llm workloads 🤷

I'm fine with not including it if we don't think it's something people would be interested in!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will let you make a call here. Thanks

@deanq deanq requested a review from Copilot October 6, 2025 02:32
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces a new example demonstrating GPU-accelerated data processing using NVIDIA's cuDF library with pandas for analyzing NYC taxi data. The example showcases how to leverage GPU acceleration for common data analytics operations like groupby aggregations.

Key changes:

  • Creates a complete cuDF/pandas integration example with GPU acceleration
  • Implements persistent storage using network volumes to cache downloaded taxi data
  • Demonstrates performance-optimized data analytics workflows on GPU infrastructure

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

# This example shows some simple accelerated data analytics functionality using cudf and pandas.

# [cuDF](https://github.com/rapidsai/cudf) is part of the [NVIDIA RAPIDs](https://rapids.ai/) project.
# RAPIDs provides simple APIs to accelerate common Python data analytics functions with GPUs.
Copy link

Copilot AI Oct 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Corrected spelling of 'RAPIDs' to 'RAPIDS'. The official project name is 'RAPIDS' not 'RAPIDs'.

Suggested change
# RAPIDs provides simple APIs to accelerate common Python data analytics functions with GPUs.
# RAPIDS provides simple APIs to accelerate common Python data analytics functions with GPUs.

Copilot uses AI. Check for mistakes.
Comment on lines +40 to +41
def __init__(self):
return
Copy link

Copilot AI Oct 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Empty init method with explicit return statement is unnecessary. Either implement initialization logic or remove the method entirely.

Suggested change
def __init__(self):
return

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants