-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-18103] [SQL] Rename *FileCatalog to *FileIndex #15634
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #67544 has finished for PR 15634 at commit
|
|
This is okay, but note that "Provider" is equally overloaded in the Data Source API. |
|
Hmm, any other suggestions? |
|
FileLister? FileListing? |
|
How about |
|
Concretely, I propose renaming I think this would make the differences between these classes more clear. Previously, the name was only loosely tied with their behavior. |
|
FileIndex sounds good to me. I wouldn't call it "Metastore" though, since that is a Hive specific term. I'd call it Catalog. |
0776537 to
b6654f1
Compare
|
Test build #67717 has finished for PR 15634 at commit
|
|
Test build #67718 has finished for PR 15634 at commit
|
|
LGTM pending tests. |
|
Test build #67747 has finished for PR 15634 at commit
|
|
Merging in master. |
## What changes were proposed in this pull request? To reduce the number of components in SQL named *Catalog, rename *FileCatalog to *FileIndex. A FileIndex is responsible for returning the list of partitions / files to scan given a filtering expression. ``` TableFileCatalog => CatalogFileIndex FileCatalog => FileIndex ListingFileCatalog => InMemoryFileIndex MetadataLogFileCatalog => MetadataLogFileIndex PrunedTableFileCatalog => PrunedInMemoryFileIndex ``` cc yhuai marmbrus ## How was this patch tested? N/A Author: Eric Liang <ekl@databricks.com> Author: Eric Liang <ekhliang@gmail.com> Closes apache#15634 from ericl/rename-file-provider.
…to `MetadataLogFileIndex` ## What changes were proposed in this pull request? This is a follow-up to apache#15634. ## How was this patch tested? N/A Author: Liwei Lin <lwlin7@gmail.com> Closes apache#15712 from lw-lin/18103.
## What changes were proposed in this pull request? To reduce the number of components in SQL named *Catalog, rename *FileCatalog to *FileIndex. A FileIndex is responsible for returning the list of partitions / files to scan given a filtering expression. ``` TableFileCatalog => CatalogFileIndex FileCatalog => FileIndex ListingFileCatalog => InMemoryFileIndex MetadataLogFileCatalog => MetadataLogFileIndex PrunedTableFileCatalog => PrunedInMemoryFileIndex ``` cc yhuai marmbrus ## How was this patch tested? N/A Author: Eric Liang <ekl@databricks.com> Author: Eric Liang <ekhliang@gmail.com> Closes apache#15634 from ericl/rename-file-provider.
…to `MetadataLogFileIndex` ## What changes were proposed in this pull request? This is a follow-up to apache#15634. ## How was this patch tested? N/A Author: Liwei Lin <lwlin7@gmail.com> Closes apache#15712 from lw-lin/18103.
What changes were proposed in this pull request?
To reduce the number of components in SQL named *Catalog, rename *FileCatalog to *FileIndex. A FileIndex is responsible for returning the list of partitions / files to scan given a filtering expression.
cc @yhuai @marmbrus
How was this patch tested?
N/A