RANGER-5406: Support export policies in a segmented manner #741

yunyezhang-work · 2025-11-20T07:10:28Z

What changes were proposed in this pull request?

In big data production environments, customers create a massive number of policies, often reaching hundreds of thousands or even millions. Exporting the entire set of policies for disaster recovery would result in an enormous data volume and extremely slow import speeds into the backup cluster. Our current experimental data shows that importing 10,000 policies via the API is very memory-intensive and takes approximately 15 minutes. Importing 100,000 policies via the API will take 2.5h or even longer.

With an even larger number of policies, memory consumption will increase significantly, and insufficient memory can cause import interruptions. Therefore, we recommend modifying the API to allow for segmented export. This will save memory and ensure data reliability when importing to other clusters for disaster recovery.

How was this patch tested?

To manually test this feature, you can send an HTTP request to the ranger. Using a shell command as an example:

Without the segmentation parameter, calling the export API getPoliciesInJson will export all policies. As shown in the figure, there are 18 policies in this environment for hdfs-xxx.
curl -u$USER:$PASSWORD -XGET "http://$RANGER_HOST:$RANGER_PORT/service/plugins/policies/exportJson?serviceName=$SERVICE&checkPoliciesExists=true" -v -o export.json

Adding the segmentation parameter will export the policies for the specified start and end position range. As shown in the figure, policies 1-5 of hdfs-xxx are exported.
curl -u$USER:$PASSWORD -XGET "http://$RANGER_HOST:$RANGER_PORT/service/plugins/policies/exportJson?serviceName=$SERVICE&checkPoliciesExists=true&beginIndex=$BEGIN_INDEX&offsetIndex=$OFFSET_INDEX" -v -o export_${BEGIN_INDEX}_${OFFSET_INDEX}.json

yunyezhang-work · 2025-11-24T03:36:53Z

@mneethiraj @kumaab
Hello. Could you please help review these two PR? It seems GitHub doesn't assign viewers. We hope to have more interaction with the open-source community and look forward to your reply.
#741
#739

kumaab · 2025-11-26T00:47:17Z

Thank you @yunyezhang-work for the patch! please raise a PR for the master branch it is the branch for all dev work.

kumaab · 2025-11-26T00:48:34Z

security-admin/src/main/java/org/apache/ranger/rest/ServiceREST.java

 		return ret;
 	}

+	private List<RangerPolicy> cutRangerPolicyList(List<RangerPolicy> policyList, SearchFilter filter) {


Suggested name: getRangerPoliciesInRange

kumaab · 2025-11-26T00:49:07Z

security-admin/src/main/java/org/apache/ranger/rest/ServiceREST.java

+			int startIndex = filter.getBeginIndex();
+			int pageSize = filter.getOffsetIndex();
+			int toIndex = Math.min(startIndex + pageSize, totalCount);
+			LOG.info("==>totalCount: " + totalCount  + " startIndex: " + startIndex + " pageSize: " +pageSize + " toIndex: " + toIndex);


Avoid string concatenation, use String.format()

kumaab · 2025-11-26T00:52:06Z

security-admin/src/main/java/org/apache/ranger/rest/ServiceREST.java

+						LOG.info("Invalid or Unsupported sortType : " + sortType);
+					}
+				} else {
+					LOG.info("Invalid or Unsupported sortBy property : " + sortBy);


Avoid string concat, check all references.

See: https://cwiki.apache.org/confluence/display/RANGER/Apache+Ranger+Java+Style+Guide

agents-common/src/main/java/org/apache/ranger/plugin/util/SearchFilter.java

vyommani · 2025-11-27T03:57:48Z

agents-common/src/main/java/org/apache/ranger/plugin/util/SearchFilter.java

 	public static final String UPDATE_TIME     = "updateTime";    // sort
 	public static final String START_INDEX     = "startIndex";
+	public static final String BEGIN_INDEX     = "beginIndex";
+	public static final String OFFSET_INDEX     = "offsetIndex";


I think OFFSET is more meaning full than OFFSET_INDEX, offset is not index. What do you think ?

vyommani · 2025-11-27T07:22:43Z

agents-common/src/main/java/org/apache/ranger/plugin/util/SearchFilter.java

 	private int                 startIndex;
 	private int                 maxRows    = Integer.MAX_VALUE;
+	private int                 beginIndex = -1;
+	private int                 offsetIndex = -1;


Since you've added new fields to the SearchFilter class, don't forget to modify the copy constructor (public SearchFilter(SearchFilter other)) accordingly to ensure the new attributes are properly copied.

vyommani · 2025-11-27T07:39:01Z

agents-common/src/main/java/org/apache/ranger/plugin/util/SearchFilter.java

+	}
+
+	public void setBeginIndex(int beginIndex) {
+		this.beginIndex = beginIndex;


I think we should validate that beginIndex >= 0. What’s your opinion?

yunyezhang-work · 2025-11-28T01:51:05Z

@kumaab @vyommani Thank you for your suggestions. The changes have been made at the following link.#748

Support export policies in a segmented manner

245b5dc

yunyezhang-work changed the title ~~Support export policies in a segmented manner~~ RANGER-5406 Support export policies in a segmented manner Nov 20, 2025

yunyezhang-work changed the title ~~RANGER-5406 Support export policies in a segmented manner~~ RANGER-5406: Support export policies in a segmented manner Nov 20, 2025

yunyezhang-work mentioned this pull request Nov 22, 2025

RANGER-5402 Fix the out-of-bounds issue caused by overly long kms keys #734

Closed

kumaab assigned yunyezhang-work Nov 26, 2025

kumaab self-requested a review November 26, 2025 00:47

kumaab reviewed Nov 26, 2025

View reviewed changes

yunyezhang-work changed the base branch from ranger-2.3 to master November 26, 2025 12:31

yunyezhang-work changed the base branch from master to ranger-2.3 November 26, 2025 12:31

vyommani reviewed Nov 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RANGER-5406: Support export policies in a segmented manner #741

RANGER-5406: Support export policies in a segmented manner #741

Uh oh!

yunyezhang-work commented Nov 20, 2025 •

edited

Loading

Uh oh!

yunyezhang-work commented Nov 24, 2025

Uh oh!

kumaab commented Nov 26, 2025

Uh oh!

kumaab Nov 26, 2025

Uh oh!

kumaab Nov 26, 2025

Uh oh!

kumaab Nov 26, 2025

Uh oh!

Uh oh!

vyommani Nov 27, 2025

Uh oh!

vyommani Nov 27, 2025

Uh oh!

vyommani Nov 27, 2025

Uh oh!

yunyezhang-work commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RANGER-5406: Support export policies in a segmented manner #741

Are you sure you want to change the base?

RANGER-5406: Support export policies in a segmented manner #741

Uh oh!

Conversation

yunyezhang-work commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

yunyezhang-work commented Nov 24, 2025

Uh oh!

kumaab commented Nov 26, 2025

Uh oh!

kumaab Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

kumaab Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

kumaab Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vyommani Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

vyommani Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

vyommani Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

yunyezhang-work commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yunyezhang-work commented Nov 20, 2025 •

edited

Loading