Add processing of connection strings and DSNs management #1

bpintea · 2018-03-18T22:58:17Z

The driver receives it's configuration from the Driver Manager (DM) through connection strings.
These connection strings can be exhaustive, providing full information on the server to connect to, how to interact with it, how to treat the data etc. or partial, requiring the driver to retrieve information about the data source from various possible sources (system information, files, user prompts).

Supporting DSNs means:

capacity to process (parse, validate) the connection string;
generate a connection string to be stored in file/system/etc DSNs;
identify missing information and prompt for it.

droberts195 · 2018-03-19T16:44:33Z

driver/connect.c

+#define CONNSTR_KW_MAX_FETCH_SIZE	"MaxFetchSize"
+#define CONNSTR_KW_MAX_BODY_SIZE	"MaxBodySize"
+#define CONNSTR_KW_TRACE_FILE		"TraceFile"
+#define CONNSTR_KW_TRACE_LEVEL		"TraceLevel"


I haven't had time for a proper review yet, but one thing that immediately stands out is that you're using tabs for indenting, which (as the way the GitHub diff tool has chosen to lay out the code above highlights) looks different depending on the viewer's tab settings.

Is there a good reason not to switch to spaces? Both the ML C++ code and the Elasticsearch Java code use 4 spaces per indent.

If you think it's a good idea then /usr/bin/expand could make the changes.

Yes, that's a grassroots issue . :-)

I guess this is generally a matter of preference, if consistency is kept.
Myself, I prefer tabs for the ("classical") reasons of:

avoiding the debate if tabs should be 2 (ex. libcurl), 4 or even 8 (part of ODBC API, most being 4 though; linux kernel) spaces wide, and/or how that plays with a line width of 80 characters, which I've also been keeping (vs 140, the limit in Elasticsearch, though interestingly, the proprietary license is also 80 chars wide, but like most licenses otherwise; but arguably, OO languages tend to "generate" longer lines).

most editors being able to expand the tabs to a preferred size (unlike the reverse transformation).

tabs being meant for indentation and tabulating.

Complementary to that, I document the editor settings at the bottom of every source code file.

Now that given, I'm not that fussy about it; my preference is with the standard I currently use, but I understand there are reasons for spaces instead of tabs (diffs can be less easy to read, especially in a browser, code is sometimes viewed with simple shells editors etc.) and I'd be fine with a switch-over should we decide that's worth it.

OK fair enough. It's your repo so you get to decide the style. I just wanted to make sure you were aware that all the major elastic repos are using spaces.

SQLTCHAR, belonging to ODBC API, expands to SQLWCHAR when UNICODE compiling, SQLCHAR otherwise. This was used in the hope for an easy concurrent development of a Unicode and ANSI driver, or at least facilitate later the ANSI version development. However, this was a naive assumption, has been used inconsistently and would have generated wrong code in some places, with no-Unicode compilation. The commit touches much code, but is just simple textual substitution: - s/SQLTCHAR/SQLWCHAR/ - s/\<tstr\>/wptr/ - s/MK_TSTR/MK_WSTR/ - S/MK_WSTR/MK_WPTR/ (This will also facilitate adding next a much needed wstr struct for defining non-zero ended wchar_t strings with lenght.)

added new wstr_st struct for SQLWCHAR non 0-terminated strings

add util.c/.h module

Rework and progress of the function to - consider the values of "DriverCompletion" parameter: - promt the user for more info (potentially repetitively until connection is established); - conider the system registry; - discriminate between DRIVER and DSN configuration; - correctly assembly the connection string that would eventually be writen into a file DSN. Missing still: - reading the system registry; - the graphical part / user prompting.

Read registry information to fetch driver configuration attributes, in case the connection string contains the DSN attribute with a valid value.

edsavage · 2018-03-26T10:51:35Z

driver/connect.c

 	return TRUE;
 }

-static BOOL as_long(size_t cnt, SQLWCHAR *val, long *out)
+static BOOL wstr2long(wstr_st *val, long *out)


I'm curious why you've re-implemented this function yourself. Was it for performance reasons?

A rather minor concern was performance as well; currently the function is used for reading the config only, so performance isn't an issue, but I hope to reuse it later for data conversions (i.e. data received as string from ES, but consuming app wants a long/int), which the driver is required to do.
The main reason though is wcstol()'s need for a null terminated string, which I could only provide either by modifying the original string (zero it - convert - restore it) or duplicating it (or copying it in a local buffer to avoid allocation).
I have - arguably - considered a "quick" implementation as cleaner.

edsavage

This looks fine to me Bogdan.

As a matter of style I would have preferred that the body of single line if/else statements were always enclosed in braces. I see that it has been done in a few places but not consistently. I'm not overly concerned about the matter though..

edsavage · 2018-03-26T13:09:18Z

driver/connect.c

 	cleanup_curl(dbc);
-	if (abuff) /* if buffer had been set, the error occured in _perform() */
+	if (abuff) {
+		free(abuff);


Assign buff to NULL here? After freeing

Sure, committed the change in f0371e0.
It's a local variable freed in the error case at the end of the function, so it shouldn't be reused, but NULL'ing it is good practice (in case the code block would be reused) and the compiler can easily optimize the statement out when needed.

keep the check against null safe, in case the code block gets shuffled

droberts195

LGTM

SQLTCHAR, belonging to ODBC API, expands to SQLWCHAR when UNICODE compiling, SQLCHAR otherwise. This was used in the hope for an easy concurrent development of a Unicode and ANSI driver, or at least facilitate later the ANSI version development. However, this was a naive assumption, has been used inconsistently and would have generated wrong code in some places, with no-Unicode compilation. The commit touches much code, but is just simple textual substitution: - s/SQLTCHAR/SQLWCHAR/ - s/\<tstr\>/wptr/ - s/MK_TSTR/MK_WSTR/ - S/MK_WSTR/MK_WPTR/ (This will also facilitate adding next a much needed wstr struct for defining non-zero ended wchar_t strings with lenght.)

added new wstr_st struct for SQLWCHAR non 0-terminated strings

Add processing of connection strings and DSNs management

add processing of received connection strings

6b37a8c

bpintea requested review from droberts195 and edsavage March 18, 2018 22:59

droberts195 reviewed Mar 19, 2018

View reviewed changes

bpintea added 6 commits March 19, 2018 22:53

refactor connection strings with wstr_st (#1)

9b6a147

added new wstr_st struct for SQLWCHAR non 0-terminated strings

split reusable code in util.c/.h module

4422ac4

add util.c/.h module

peal json_escape into util module

b81a6e1

read system information for driver config

a4be0e7

Read registry information to fetch driver configuration attributes, in case the connection string contains the DSN attribute with a valid value.

edsavage reviewed Mar 26, 2018

View reviewed changes

edsavage approved these changes Mar 26, 2018

View reviewed changes

set pointer to NULL after freeing it

f0371e0

keep the check against null safe, in case the code block gets shuffled

droberts195 approved these changes Mar 27, 2018

View reviewed changes

bpintea merged commit d38dd0f into elastic:master Apr 3, 2018

bpintea deleted the feature/connection_string branch April 3, 2018 11:06

bpintea mentioned this pull request Apr 3, 2018

TODOs #2

Closed

bpintea added a commit that referenced this pull request Jun 4, 2018

refactor connection strings with wstr_st (#1)

8171962

added new wstr_st struct for SQLWCHAR non 0-terminated strings

bpintea added a commit that referenced this pull request Jun 4, 2018

Merge pull request #1 from bpintea/feature/connection_string

fabcbde

Add processing of connection strings and DSNs management

bpintea added >feature Applicable to PRs adding new functionality v6.5.0 labels May 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add processing of connection strings and DSNs management #1

Add processing of connection strings and DSNs management #1

Uh oh!

bpintea commented Mar 18, 2018

Uh oh!

droberts195 Mar 19, 2018

Uh oh!

bpintea Mar 19, 2018

Uh oh!

droberts195 Mar 20, 2018

Uh oh!

edsavage Mar 26, 2018

Uh oh!

bpintea Mar 26, 2018

Uh oh!

edsavage left a comment

Uh oh!

edsavage Mar 26, 2018

Uh oh!

bpintea Mar 27, 2018

Uh oh!

droberts195 left a comment

Uh oh!

Uh oh!

Add processing of connection strings and DSNs management #1

Add processing of connection strings and DSNs management #1

Uh oh!

Conversation

bpintea commented Mar 18, 2018

Uh oh!

droberts195 Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

bpintea Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

droberts195 Mar 20, 2018

Choose a reason for hiding this comment

Uh oh!

edsavage Mar 26, 2018

Choose a reason for hiding this comment

Uh oh!

bpintea Mar 26, 2018

Choose a reason for hiding this comment

Uh oh!

edsavage left a comment

Choose a reason for hiding this comment

Uh oh!

edsavage Mar 26, 2018

Choose a reason for hiding this comment

Uh oh!

bpintea Mar 27, 2018

Choose a reason for hiding this comment

Uh oh!

droberts195 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!