fix: case-insensitivity in the `like()` method when in use with accented characters #9238

michalsn · 2024-10-25T20:43:54Z

Description
This PR fixes the case insensitivity option in the like() method when we deal with accented characters.

Closes #9236

Checklist:

Securely signed commits
Component(s) with PHPDoc blocks, only if necessary or adds value
Unit testing, with >80% coverage
User guide updated
Conforms to style guide

system/Database/BaseBuilder.php

tests/_support/Database/Seeds/CITestSeeder.php

datamweb

LGTM!
@michalsn Thanks.

ddevsr

Add more case test

datamweb · 2024-10-29T19:17:47Z

tests/system/Database/Live/LikeTest.php

@@ -79,6 +79,12 @@ public function testLikeCaseInsensitive(): void
    #[DataProvider('provideMultibyteCharacters')]
    public function testLikeCaseInsensitiveWithMultibyteCharacter(string $match, string $result): void
    {
+        if ($this->db->DBDriver === 'SQLSRV') {
+            $this->markTestSkipped(
+                'Currently Builder class does not fully support Unicode strings in SQLSRV.'


To be honest, I didn’t fully understand the details of this topic. While researching, I came across the following information:

nchar-and-nvarchar-transact-sql

"Character data types that are either fixed-size, nchar, or variable-size, nvarchar. In SQL Server 2012 (11.x) and later versions, when a Supplementary Character (SC) enabled collation is used, these data types store the full range of Unicode character data and use the UTF-16 character encoding. If a non-SC collation is specified, then these data types store only the subset of character data supported by the UCS-2 character encoding."

Based on this information, it seems that when an SC collation is used, there shouldn’t be any limitations on Unicode character support, as UTF-16 is capable of storing the full range of Unicode characters.

If possible, please provide additional documentation or clarification on these changes and specify how the SC collation should be applied in this PR.

I was going to post the details as a bug report, but I gave it another try after your comment.

You are right. When I was checking this, I set collation only for the value field. Apparently, we have to set it for the entire database to work properly. Because of this, I was only able to add Unicode string using N prefix.

Thank you!

Thanks for the explanation.

samsonasik · 2024-10-30T10:07:17Z

@michalsn rebase is needed

michalsn · 2024-10-30T10:56:12Z

@samsonasik Thanks, done.

…d characters

Co-authored-by: Pooya Parsa <[email protected]>

michalsn added the bug Verified issues on the current code behavior or pull requests that will fix them label Oct 25, 2024

datamweb reviewed Oct 25, 2024

View reviewed changes

system/Database/BaseBuilder.php Outdated Show resolved Hide resolved

tests/_support/Database/Seeds/CITestSeeder.php Show resolved Hide resolved

michalsn requested a review from datamweb October 26, 2024 07:21

datamweb approved these changes Oct 26, 2024

View reviewed changes

michalsn mentioned this pull request Oct 26, 2024

Bug: insensitive like query failing with multibyte characters #9236

Closed

ddevsr suggested changes Oct 28, 2024

View reviewed changes

datamweb reviewed Oct 29, 2024

View reviewed changes

michalsn force-pushed the fix/caseInsensitive branch from 3e3e722 to 7742124 Compare October 30, 2024 10:55

samsonasik approved these changes Oct 30, 2024

View reviewed changes

datamweb approved these changes Oct 30, 2024

View reviewed changes

ddevsr approved these changes Oct 31, 2024

View reviewed changes

michalsn and others added 7 commits November 3, 2024 19:40

fix: case-insensitivity in the like() method when in use with accente…

725292c

…d characters

Update system/Database/BaseBuilder.php

bd3059c

Co-authored-by: Pooya Parsa <[email protected]>

add more cases for tests

ecf96bf

even more tests

01a636d

fix types

4f1f8d0

skip unicode strings for SQLSRV

e5ebbbb

set collation for sqlsrv

3715280

michalsn force-pushed the fix/caseInsensitive branch from 7742124 to 3715280 Compare November 3, 2024 18:41

michalsn merged commit fc19b69 into codeigniter4:develop Nov 3, 2024
40 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: case-insensitivity in the `like()` method when in use with accented characters #9238

fix: case-insensitivity in the `like()` method when in use with accented characters #9238

michalsn commented Oct 25, 2024

datamweb left a comment

ddevsr left a comment

datamweb Oct 29, 2024

michalsn Oct 29, 2024

datamweb Oct 30, 2024

samsonasik commented Oct 30, 2024

michalsn commented Oct 30, 2024

fix: case-insensitivity in the like() method when in use with accented characters #9238

fix: case-insensitivity in the like() method when in use with accented characters #9238

Conversation

michalsn commented Oct 25, 2024

datamweb left a comment

Choose a reason for hiding this comment

ddevsr left a comment

Choose a reason for hiding this comment

datamweb Oct 29, 2024

Choose a reason for hiding this comment

michalsn Oct 29, 2024

Choose a reason for hiding this comment

datamweb Oct 30, 2024

Choose a reason for hiding this comment

samsonasik commented Oct 30, 2024

michalsn commented Oct 30, 2024

fix: case-insensitivity in the `like()` method when in use with accented characters #9238

fix: case-insensitivity in the `like()` method when in use with accented characters #9238