-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
8275319 java.net.NetworkInterface throws java.lang.Error instead of SocketException #5956
8275319 java.net.NetworkInterface throws java.lang.Error instead of SocketException #5956
Conversation
👋 Welcome back djelinski! A progress list of the required criteria for merging this PR into |
@djelinski The following label will be automatically applied to this pull request:
When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command. |
Hello @djelinski,
I have created https://bugs.openjdk.java.net/browse/JDK-8275319 to track this. For this PR to trigger the PR review process, you will now have to update the PR title to match the bug id and bug title (and I think even the commit message should refer to the same). |
Webrevs
|
I suspect most of these cases need a closer look as there may be cases that can never happen, meaning Error or InternalError would be more appropriate, e.g. if MultiByteToWideChar fails with ERROR_INVALID_FLAGS or ERROR_INVALID_PARAMETER for example. |
I have to agree with Alan on this. Is there any concrete case where an Error was observed, or is this pure speculation? |
@jaikiran thanks for the JBS. Editing PR title was sufficient.
It happens occasionally, as evidenced by JDK-8217298, JDK-8046500 (fixed), JDK-8165665, JDK-8066931, JDK-8057900, JDK-8065559(closed), JDK-8040229, JDK-8065078(fixed), JDK-8068597(fixed), numerous reports on Google (search for "error IP Helper Library"), and an ancient note found in my company's codebase, possibly no longer relevant.
I started checking how such failures are handled elsewhere in JDK codebase, and the results are pretty inconsistent: ignore result, normal error handling, native assert, and Java error in NetworkInterface. Some consistency here would be nice, but then, I don't know which option I would pick. Google couldn't find any cases of "Error Cannot get multibyte", so I guess that doesn't happen too often.
There are only 2 places in Java codebase where Error is thrown, one is NetworkInterface, the other one is here. Still, I'd argue that even if an unhandled error in IP helper library indicates a bug in JDK, throwing a checked exception is a better option here, because:
|
So these Errors do actually happen - thanks for the confirmation, and double thanks for the archeology research. This is very useful to this discussion. I see that some of the issue you mentioned are suggesting some fixes - like - for instance JDK-8165665. If some conditions are not supposed to happen - then I still think that throwing |
There was churn in the Windows APIs and some of these issues may be historical. I think it would be useful if we can find any recent reports of issues and try to identify if they are issues in specific environments/configurations, JDK issues (not handling some cases), or something else. I'd prefer not rush in and change all these exceptions without going through them one by one. |
Makes sense.
What do you think? |
JDK-8217298 hints that one scenario where an exception is thrown is when there are no network adapters configured. I don't know how easy it would be to create that scenario and to see how current JDK releases behave. MultiByteToWideChar - yes, makes sense. GetIfTable -yes, I agree this needs to cleaned up. I don't see ERROR_BUFFER_OVERFLOW listed as a possible error. GetAdaptersAddresses. It would be nice if this NetworkInterface_winXP.c could be removed but it is still used for the -Djava.net.preferIPv4Stack=true case. A more useful exercise may be to see if we could move to one implementation that works with preferIPv4Stack set to true or false. |
Updated the PR with more specific exception types. I'm still doing some printf-based testing / debugging. Observations so far:
I expected the Windows error code to appear after my message, not before.
I'll update when I know more. FWIW, my test code:
|
The crash reason was a bit more mundane. We assigned the (int) result of |
In default mode the exception is swallowed here, and then SocketException ("No network interfaces configured") is thrown in Java code. Should I clean that up here or leave it for future PRs? |
case ERROR_NO_DATA: | ||
// not an error | ||
*adapters = NULL; | ||
return ERROR_SUCCESS; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mapping ERROR_NO_DATA to ERROR_SUCCESS is probably correct here. Could this explain bug reports that seem to be from configurations with no IP addresses plumbed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose it could. But in order to get ERROR_NO_DATA here one would need to disable loopback interface. I couldn't find how to do that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There has been one or two reports from configurations that appear to have no interface or no IP addresses. I don't think we've been able to create those configurations to duplicate. In any case, treat NO_DATA as SUCCESS seems correct here.
@@ -237,7 +229,7 @@ static int ipinflen = 2048; | |||
*/ | |||
int getAllInterfacesAndAddresses (JNIEnv *env, netif **netifPP) | |||
{ | |||
DWORD ret, flags; | |||
int ret, flags; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks right but I can't relate this to your comment about the crash.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let me try to explain this clang-style:
tableP
contains random garbage- lookupIPAddrTable returns error, does not touch
tableP
ret
is unsigned, so this is never true- uninitialized
tableP
is passed to enumAddresses_win_ipaddrtable enumAddresses_win_ipaddrtable
dereferencestableP
and crashes
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this, I hadn't spotted the broken code that tests if loopupIPAddrTable fails.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you've got this to a good place. It would be helpful if you could trim down some of the really long lines to keep to consistent with the existing code if possible. It's clear that there is a lot of cleanup needed in this area but that can be done in future PRs as needed.
Hi Daniel, thanks for persisting with this. I had a look at your changes and I believe you have hit the right balance. I also like the fact plain |
Hi - I have run your current patch through our CI and the good news is that we got no failure.
I believe you are correct and there are still some issues in that case. In particular I also see -that if netaddrCount == -2 there:
we're going to try and allocate arrays of size -2 a few lines below. That doesn't sound right. |
Hmm... JDK-8225239 might have introduced a bug.
it should be returning ret not NULL
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The current patch looks good and it's good to have caught the DWORD->int issue. We should add information about this to the JBS issue.
I don't mind if you want to iterate further with Daniel's suggestion, okay if it's a separate PR too. In general I think there is quite a bit of cleanup to do in this code.
|
@djelinski This change now passes all automated pre-integration checks. ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details. After integration, the commit message for the final commit will be:
You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed. At the time when this comment was updated there had been 108 new commits pushed to the
As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details. As you do not have Committer status in this project an existing Committer must agree to sponsor your change. Possible candidates are the reviewers of this PR (@AlanBateman, @dfuch) but any other Committer may sponsor as well. ➡️ To flag this PR as ready for integration with the above commit message, type |
Thanks @dfuch and @AlanBateman for the review and all your help. @dfuch I'd rather follow up on the remaining issues in a separate PR. This one is already doing more than the PR title advertises. |
/integrate |
@djelinski |
@djelinski I have no objection in handling that in a followup PR. |
/sponsor |
Going to push as commit 043cde2.
Your commit was automatically rebased without conflicts. |
@dfuch @djelinski Pushed as commit 043cde2. 💡 You may see a message that your pull request was closed with unmerged commits. This can be safely ignored. |
@dfuch yessir. I'll continue the cleanup, should have something for review later this week. |
Per Java documentation, "Error [..] indicates serious problems that a reasonable application should not try to catch". Failure to enumerate network interfaces or addresses is not a serious enough situation; many applications can recover from this pretty easily.
All native methods (except
init()
) in NetworkInterface are declared withthrows SocketException
, so throwing SocketExceptions instead of Errors will match the declared interface.Unix version of NetworkInterface already throws
SocketException
s in similar situations, and does not throwError
under any circumstances.I searched the bug database and mail archives, but could not find any discussion on the topic; if there's a reason to keep throwing
Error
s, I couldn't find it.tier1
tests pass,java.net
tests pass. No new regression tests as I couldn't find any list of steps to reproduce the problem.I don't have write access to JBS. I could use some help with creating a ticket.
Progress
Issue
Reviewers
Reviewing
Using
git
Checkout this PR locally:
$ git fetch https://git.openjdk.java.net/jdk pull/5956/head:pull/5956
$ git checkout pull/5956
Update a local copy of the PR:
$ git checkout pull/5956
$ git pull https://git.openjdk.java.net/jdk pull/5956/head
Using Skara CLI tools
Checkout this PR locally:
$ git pr checkout 5956
View PR using the GUI difftool:
$ git pr show -t 5956
Using diff file
Download this PR as a diff file:
https://git.openjdk.java.net/jdk/pull/5956.diff