-
Notifications
You must be signed in to change notification settings - Fork 17.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
database/sql: missing escape functions #18478
Comments
There isn't anything here that must live in If you are interested in this, I would create a new public go package with something like this:
At that point you could suggest to drivers that they upstream your Quoter interface. You would use the Quoter interface like:
In other words, it would always be a pre-processor step to the actual DB.Query methods. I do agree MySQL is especially bad here. I can't really see this going into the std library. Have you talked to any sql driver maintainers about this or do you have any issues you can link here? |
I would argue to the contrary. Escaping primitives are needed for working securely with databases and they are inherently dependent on the SQL-server behind the connection. A package using database.sql might not even know what server it is talking to and how to properly quote and escape for that server, if it just gets passed a reference. As is, database.sql can only be used for queries known ahead of time and under the constraint, that the only dynamic entities in the query are value and never columns (think ORDER BY <user input>) or tables |
database/SQL abstracts the database interface, not the query text. You can build up the query text today with go packages that exist today. |
quoting query text is part of the database interface. |
placeholders are also abstracted. Native postgres uses $<number> not ?, oracle :<name> |
Placeholders are not abstracted. database/sql itself doesn't care at all, it just passes the query to a driver. And most drivers I know of don't abstract it away, either, requiring you to use $1 for PostgreSQL and ? for SQLite. Similarly, quoting arguments would depend on the specific DBMS. |
So proper quoting is impossible, since neither database.sql nor database.sql.driver export quoting functions and when I am sitting behind those apis my best bet is type-switching the driver to find out what I am talking to and worst case implementing that my self? That does not look solid to me. |
Maybe the current situation is not ideal, but there have been no solid proposals yet that consider all popular databases. If the driver community wants to put together a proposal, we're all ears. |
Perl/DBI does this right. Golang needs to do the same. |
@caseyallenshobe, see https://golang.org/wiki/NoMeToo If you have a proposal, please propose. |
I didn't say "me too", actually. I referenced a good implementation. Check out DBI and quote_identifier / quote_literal functions. Just like database/sql, DBI leaves platform-specific implementation up to the drivers. |
That's more helpful. For what it's worth, I lived and breathed Perl and Perl's DBI for a decade+, and that's what inspired my writing the database/sql package. I'll leave this to @kardianos, who's taken over maintenance of the package. |
Quoter defines an interface to safely quote identifiers and values. Updates golang/go#18478
I'm generally in favor of doing this. However, I'm not sure if or where in the std repo it would live. For now I've implemented a first pass https://godoc.org/github.com/golang-sql/sqlexp . |
MySQL has configuration options for quoting, so the quoting is also dependent on the connection. As a user I would expect the function to be on the sql.DB object, since I probably already have a reference to it when I am constructing queries and sql.DB already has all the information to set up the quoting. Thats also where you find them in similar packages for other languages like Perl/DB, PHP/PDO, PHP/Mysqli. |
@nefthy Good point about MySQL especially. I've updated the calls to be network friendly. You can get the driver from Here's what I'd like to see:
Perhaps it would be good to combine the escaper functions with the database name function. We'd probably want to make it easy to expand it in the future as well, which would mean changing it into a struct with methods. |
This is especially problematic for batch inserts, since database/sql doesn't currently support batch inserts, and the only way to do them today is by constructing the raw query string. |
Likewise for database-side arrays and variable-amount IN clauses. |
@hareesh-blippar Could you give an example of what you mean by batch inserts, like COPY IN or Bulk Copy? @caseyallenshobe database side arrays will be handled in a different way, by adding support for them. |
@kardianos For example, |
Batch inserts would be inserting multiple rows with a single statement, I believe. And yes, Go 1.9 promises something better, as does sqlx today. My company is still stuck on Go 1.4.3 and upgrades slowly, so it will be quite a while for us yet... Forced needs for implementing something to properly quote to overcome some shortcoming aren't the only reason either...just a good example. :) |
@hareesh-blippar I don't understand why database/sql doesn't understand multi-row value insert today. You would need to add those as parameters (which would be great for a little 10 line function to do, but other then that, it should be easy enough... Are you expecting a different API to support this? @caseyallenshobe Well, your situation would give additional weight to develop this out of tree first, then anyone can use it regardless of go version. |
@kardianos is that possible with database/sql today when the value of n in my example is variable and in the order of thousands? |
I would think so. SQL Server can only insert 1000 rows with that syntax, so built a little helper around it to break it up into statements of 1000. So I don't see why not. |
One case not yet discussed: Generating SQL without executing. For a patch generator I am in dire need of the equivalent As it stands, I have two options:
Both are not very attractive, I guess I'll go with the latter option and keep fixing it till it stops breaking? |
Even though MySQL behavior is different On the other hand, QueryString can not chose how to escape backslash without BTW, if we add QuoteString, why no FormatDate, FormatDecimal, FormatFloat, FormatGeometry, Note that QuoteIdentifier is special, because we can not use placeholder for it on some RDB including MySQL. So QuoteIdentifier will be worth enough even if @dbuteyn You can copy escaping code from |
@methane Thanks, its actually the first place I went to but I couldn't find what I was looking for at a glance. A string replace was sufficient for my needs and I didn't want to waste any more time. Obviously what I did is in no way correct and won't work for others. |
I'm new to Go so my apologies if this is extraordinarily obvious to others, but I have managed to use Sprintf to create a string for the query containing a dynamic table name for Sqlite3.
This has allowed me to query my Sqlite3 database using a dynamic table name which comes from elsewhere in my program (never as user input). Hope this helps someone though I understand this discussion is more tailored around larger DB systems. Who knows, it might work with them too! |
Reading this, I initially thought only "identifier.quoting" is non-portable. Alas, no. For example MySQL by default interprets backlash escapes inside 'strings', which violates the standard 🤦, unless one enables Still, I wonder — do all DBs have a set of flags that make them parse literals according to standard? |
@cben Nope, this isn't a thing. Many databases don't have such a setting, and I wouldn't recommend such a setting if there was one. Many people write SQL that make really important assumptions about quoting. The interface I would recommend would be something like this: @David00 You're on the right track and for your purposes, that might be enough. But almost inevitably, if you are talking about escaping inputs, you'll want check for values within the string as well as placing quotes around it. Also note that SQLite escaping is different then other systems. I'm honestly not sure the best direction for the quoter interface. Should there be a registry? Probably not. Could there be a per database quoter? Maybe. The problem is easy enough to frame. I'm not sure what the best solution would look like for this. |
Maybe I'm being naive, but I would expect that quoting would have to be the responsibility of the driver have to be at the connection level to get it right. (MySQL being the obvious case in point.) It would then be up to the sql package to provide an interface to obtain the quoter from the connection. An error return value allows for compatibility with drivers that have not implemented a quoter. Something like: // Quoter provides a driver-appropriate utility for quoting values and identifiers.
//
// If the driver does not provide a Quoter, then a nil value is returned with a
// non-nil error.
func (*sql.Conn) Quoter() (Quoter, error) { ... } If I'm in a project and I feel good with the assumption that my driver of choice provides a Quoter, then I can just ignore the error case. If I want maximum abstraction, I write a few extra lines to handle that case. |
For reference, here is the PGX Sanitize function for identifiers: |
We now have very short steps for someone to take to a disaster:
This person isn't me, because I'm translating code using Perl's DBI and DBI did this right decades ago. We've have decades of trivial SQL vulnerabilities due to string construction of queries from untrusted input, and when placeholders are finally pretty common, people are still pooh-poohing other necessary measures with the exact same arguments used to say that placeholders weren't necessary? This is disgraceful. |
I'm honestly kinda surprised that after 6 1/2 years there doesn't seem to be a standard way of escaping a user-input string in sql using go. |
For: github.com/go-sql-driver/mysql I couldn't stand to keep mexing quotes with + concatenation with fmt.Sprintf so I tried to create a standard. I know this is far from perfect but I will share what I did and fixed for me: Codepackage repository
import (
"fmt"
"reflect"
"strings"
)
type Repository struct{}
type BuildQueryParams struct {
Placeholders []BuildQueryPlaceholder
}
type BuildQueryPlaceholder struct {
Key string
Params []interface{}
}
// NewRepository get a new instance
func NewRepository() *Repository {
return &Repository{}
}
func (r *Repository) BuildQuery(rawSql string, buildQueryParams BuildQueryParams) (string, []interface{}) {
finalQuery := strings.ReplaceAll(rawSql, "%q", "`")
finalParams := []any{}
for _, placeholder := range buildQueryParams.Placeholders {
pMask, params := r.getPlaceholders(placeholder.Params...)
finalParams = append(finalParams, params...)
finalQuery = strings.ReplaceAll(finalQuery, fmt.Sprintf(":%s", placeholder.Key), pMask)
}
return finalQuery, finalParams
}
func (r *Repository) SpreadSlice(slice any) []interface{} {
s := reflect.ValueOf(slice)
if s.Kind() != reflect.Slice {
panic("InterfaceSlice() given a non-slice type")
}
// Keep the distinction between nil and empty slice input
if s.IsNil() {
return nil
}
ret := make([]interface{}, s.Len())
for i := 0; i < s.Len(); i++ {
ret[i] = s.Index(i).Interface()
}
return ret
}
func (r *Repository) getPlaceholders(values ...interface{}) (placeholders string, parameters []interface{}) {
n := len(values)
p := make([]string, n)
parameters = make([]interface{}, n)
for i := 0; i < n; i++ {
p[i] = "?"
parameters[i] = values[i]
}
placeholders = strings.Join(p, ",")
return placeholders, parameters
} Usagesql, params := r.repository.BuildQuery(`
SELECT
ch.%qname%q,
ch.client_username
FROM %qcharacter%q ch
INNER JOIN %qclient%q cl ON ch.client_username = cl.username
WHERE ch.%qname%q IN (:p1)
AND cl.client_status_id = :p2
AND ch.character_status_id = 1;`, BuildQueryParams{
Placeholders: []BuildQueryPlaceholder{
{
Key: "p1",
Params: r.repository.BindSlice([]any{
"character1", "character2", "character3",
}),
},
{
Key: "p2",
Params: r.repository.BindSlice([]any{
1,
}),
},
},
})
fmt.Println("Final SQL", sql)
fmt.Println("Final Params", params) OutputFinal SQL
SELECT
ch.`name`,
ch.client_username
FROM `character` ch
INNER JOIN `client` cl ON ch.client_username = cl.username
WHERE ch.`name` IN (?,?,?)
AND cl.client_status_id = ?
AND ch.character_status_id = 1;
Final Params [character1 character2 character3 1] Query itrows, err := connection.QueryContext(executionCtx, sql, params...) The final sql will be replaced with "sql quotes" for each %q and the placeholders automatically added for each mapped key that starts with semi colon (:). All placerholder parameters will be combined in sequence to be used for the prepared statement bind. BindSlice function is used to generically pass any kind of slice to be appended to final parameters. That's is! |
Your solution does not escape input strings, its a workaround to being unable to use backticks in a multi-line literal. This issue was opened because string replace isn't a proper solution. It makes the application vulnerable to SQL injection. What we asking for is to add Yes, it's possible to roll your own solution by following the docs but the whole point of the It's only been 7 years, there is no rush. |
There is also the issue, that quoting rules are database or even connection dependent, so if the drivers do not provide quoting functions, there is no way to write database agnostic code. I fail to see why the maintainers chose not to require drives to provide such functions. Database vendors do provide quoting functions in their C libraries, for good reasons. |
May want to use a permalink instead as code often changes over time. Regular links will eventually point to nowhere: https://github.com/pingcap/tidb/blob/eafb78a9c6b8ff6f9c00188ca16fc63b41ae4e56/pkg/util/sqlexec/utils.go#L97 This code doesn't appear to provide escaping but rather formats parameters as inline SQL which doesn't really solve anything and is prone to SQL injection. It is also using MySQL-specific syntax per readme.md of the project. |
Nit: "Escape" sounds like it would do something like I think we can skip escaped level and have a "QuoteString" API which simultaneously escapes and adds the quotes: TLDR compare these two hypothetical styles: -Sprintf("select myField from \"%s\" where myValue = '%s'", conn.EscapeIdentifier(tableName) conn.EscapeString(value))
+Sprintf("select myField from %s where myValue = %s", conn.QuoteIdentifier(tableName), conn.QuoteString(value)) the former might not even be portable due to hard-coded |
What version of Go are you using (
go version
)?go version go1.7.4 linux/amd64
What operating system and processor architecture are you using (
go env
)?GOARCH="amd64"
GOBIN=""
GOEXE=""
GOHOSTARCH="amd64"
GOHOSTOS="linux"
GOOS="linux"
GOPATH="/home/nefthy/go-test/"
GORACE=""
GOROOT="/usr/lib/go"
GOTOOLDIR="/usr/lib/go/pkg/tool/linux_amd64"
CC="x86_64-pc-linux-gnu-gcc"
GOGCCFLAGS="-fPIC -m64 -pthread -fmessage-length=0 -fdebug-prefix-map=/home/nefthy/go-test/tmp/go-build451484149=/tmp/go-build -gno-record-gcc-switches"
CXX="x86_64-pc-linux-gnu-g++"
CGO_ENABLED="1"
What did you do?
There are situations when strings need to be escaped in queries that can not be done with placeholders. An example the following queries cannot be expressed with ? placeholders:
Using Sprintf is no option, since the identifiers need to be properly quoted. The quoting and escaping is inherently vendor specific and may even depend on configuration on a per database/connection basis (hello there MySql...).
What did you expect to see?
The driver must export Quoting which are passed along by the database/sql Api. As far as I can tell the folling functions are needed
What did you see instead?
No escaping/quoting functions
The text was updated successfully, but these errors were encountered: