Add Rename Symbol Capability #915

mcecode · 2023-07-19T20:24:10Z

Hi, I'd like to give implementing #161 a try, though it might take some time since I'm new to language servers and renaming in Bash doesn't seem to be as straightforward as in other languages. I have onPrepareRename working already, so hopefully onRenameRequest won't be too difficult to implement.

For global defined variables, I'm planning to base the rename on the includeAllWorkspaceSymbols config. If it's set to false, then the symbol would only be renamed within files linked by sourcing. If it's set to true, then the symbol would be renamed throughout the whole workspace.

For variables that are not defined, renaming would only happen within the file. I'm thinking this could be useful for renaming environment variables like if someone wants to change $HOME to $PWD for some reason.

For special variables, I'm thinking $1 to $9 could be renamed since it's common to assign these to variables with more descriptive names. It could be useful, for example, if someone created a quick script or function and now want to clean it up, they could just rename $1 to $myVar and assign myVar="$1" at the top of the script or function. I don't think other special variables like $@ or $# should be renamed since they already have meaning in and of themselves.

Feedback on the approach I'm thinking of taking and the work I've done so far would be much appreciated :)

Todos:

skovhus · 2023-07-19T20:55:19Z

Thanks for looking into this! And great that you are opening a draft PR, makes it easier to help you.

I do believe we have all the building blocks available to implement this feature – also note that a lot of the required functionality is available through https://github.com/bash-lsp/bash-language-server/blob/main/server/src/util/declarations.ts (which is what is used by the analyzer to get declarations). Declarations contain both environment variables, variables and functions.

For special variables, I'm thinking $1 to $9 could be renamed since it's common to assign these to variables with more descriptive names. It could be useful, for example, if someone created a quick script or function and now want to clean it up, they could just rename $1 to $myVar and assign myVar="$1" at the top of the script or function. I don't think other special variables like $@ or $# should be renamed since they already have meaning in and of themselves.

I suggest descoping that for the first version (unless this is easy to implement).

mcecode · 2023-07-19T21:10:31Z

Thanks for the quick response! I'm still getting familiar with the code base so thanks for the tip on using the functionality in declarations.ts.

I'll take your advice and descope my ideas about special variables for now since I'm not sure how complex it is to implement.

mcecode · 2023-08-16T17:57:38Z

Hi @skovhus, it took a bit of exploration and experimentation, but I think I'm more or less done with renaming not defined and scoped variables and functions, and I think renaming global variables and functions isn't too far off. Before I continue though, I'd like to get your feedback on what I've done and the approach I took so I know I'm going in the right direction.

Cases covered

Renaming not defined variables

1-not-defined.webm
Renaming function scoped local variables

2-scoped-function.webm
Renaming subshell scoped functions and variables

3-scoped-subshell.webm
Some scope awareness

4-scope-awareness-function.webm

5-scope-awareness-subshell.webm
Differentiation between functions and variables with the same name

6-differentiate-variables-and-functions.webm

Though I think function nesting, subshells, and such are rarely needed or used, I opted to support them so that in those cases where a user does use them, they won't get too janky of an experience.

Limitations

Variable names not typed as variable_name by tree-sitter-bash can't and don't get renamed. Examples of this are variables inside arithmetic expansions and C-style for loops.

In the arithmetic expansion above, the whole n+n is typed as a word with a command_name parent. Even if separated apart, each character would just be seen as separate words:

To support these cases, we'd need to parse inside these constructs ourselves which would be quite complex to do.
Scope awareness breaks down for complex nesting and scopes.

In the example below, scope-wise, $var inside 3 should not be renamed when renaming var inside 1.

7-limitation-complex-nesting.webm

In this other example, $var inside calleeFunc is not renamed when renaming var inside callerFunc even if, scope-wise, they are the same variable.

8-limitation-complex-scopes.webm

This limitation comes from the heuristics I used when finding symbol instances. Though I could try to cover these and other edge cases, I think the effort-to-benefit ratio wouldn't be worth it right now as not a lot of users would run into them.
Only the first variable_assignment is caught per declaration_command, so in the example below, only a="a" would be caught.
```
local a="a" b="b" c="c"
```
Truthfully, I just forgot that multiple variable_assignments can be done in one declaration_command while implementing Analyzer#findOriginalDeclaration, that's why it doesn't handle it. I don't think multiple variable_assignments in one declaration_command is used often, so this can be left as is, but I can add it if you think it would be useful for users.

Approach taken

The initial approach I took is as follows:

Get the word at the given position using Analyzer#wordAtPointFromTextPosition.
Find the word's declaration using Analyzer#findDeclarationsMatchingWord.
Find all of the word's occurrences using Analyzer#findOccurrences.

I quickly learned that though this approach worked well enough for renaming not defined variables, it broke down when dealing with things like local declarations and scopes. So, I ended up slowly adding methods to Analyzer as I revised my approach, which became:

Get the symbol, either a function or a variable, at the given position using Analyzer#symbolAtPointFromTextPosition
Find the symbol's original definition within the scope it's in using Analyzer#findOriginalDeclaration.
Find the definition's scope using Analyzer#findParentScope.
Find all of the symbol's occurrences inside that scope using Analyzer#findOccurrencesWithin.

I added methods instead of changing what's already there since I didn't want to interfere with the current functionality and diagnostics provided by the language server and risk breaking the current experience for users. However, I do think what I added could be incorporated later on by other handlers. For example, I think Analyzer#findOccurrencesWithin's implementation could replace Analyzer#findOccurrences' which would give onDocumentHighlight and onReferences more accurate results.

That said, I do think I'm adding quite a bit of code, and I do realize that the onus of reviewing all of this would fall onto you. So, I wanted to know if you're alright with how I'm going about things and with all of the cases I'm trying to cover. Do you think I did too much? Should I tone down what I'm trying to support? Did I miss something and maybe I'm just reimplementing things that are already available in the code base? How could I make this easier for you to review later on?

skovhus

Amazing work here. 👏 👏 👏 I quickly looked over the code and came with some suggestions. But I'm really happy about the level of communication you do here and the videos.

I would love if you invested in writing some high level unit tests covering all the cases we handle and do not handle.

server/src/server.ts

server/src/analyser.ts

skovhus · 2023-08-16T20:05:06Z

server/src/analyser.ts

+  /**
+   * A more scope-aware version of findOccurrences.
+   */
+  public findOccurrencesWithin({


I didn't look to carefully here, but it might be we can reuse some functionality from utils/declarations.

skovhus · 2023-08-16T20:06:19Z

server/src/analyser.ts

+      parent = this.findParentScopeNode(uri, parent.startPosition, parent.endPosition)
+    }
+
+    // TODO: Handle global definitions


Do you mean declarations across files? getGlobalDeclarations should help with this.

mcecode · 2023-08-17T17:52:08Z

Thanks for the review, I appreciate it. I'll update the implementation based on your suggestions.

Don't worry about the tests, I've kept track of the cases covered and not and plan to write them in test form when the implementation is a bit more locked in. I've been adding and removing methods here and there during my exploration and didn't want to add and remove tests at the same time.

Since you seem to be alright with the general direction I'm going, I'll just ping you again when I'm done with the implementation + tests. Hopefully it won't take me so long this time 😅 Thanks again!

skovhus · 2023-09-25T09:26:56Z

It seems like this is pretty close! 👏

mcecode · 2023-09-25T10:38:29Z

Hi! Yeah, it's pretty close now. Taking me a while since I've only been able to chip at it little by little and I had to think through a bit on how workspace-wide renames should work, but the implementation's pretty much done. I just have to fix the failing tests due to snapshots not expecting onPrepareRename and onRenameRequest handlers, add the tests for renaming, and probably refactor a bit since some of the methods I've added have become quite long and unwieldy.

I'll probably make an updated write-up of the cases covered, limitations, and approach taken when I'm done as some things have changed since the last one I commented. Heads up though, I tried to use as much of what's already in the code base, but I wasn't able to use what's in util/declarations.ts because it returns declarations based on the latest definition. The implementation I've come up with uses the original declaration/definition to gauge the scope of where to search for words that match. I'd be glad to know if I missed something and I can use it somewhere.

skovhus · 2023-09-25T10:41:07Z

he implementation I've come up with uses the original declaration/definition to gauge the scope of where to search for words that match. I'd be glad to know if I missed something and I can use it somewhere.

No worries.

Handle multiple variable assignments and declarations per declaration command when searching inside functions

Handle multiple variable assignments and declarations per declaration command when searching globally

Handle multiple variable assignments and declarations in one declaration command

mcecode · 2023-10-19T10:11:50Z

Hi @skovhus, I'm finally done. There are a lot of lines added, but most of those are from tests and snapshots. I'm not sure if it's overkill, but I wanted to document as much behavior as I reasonably could while taking into account some edge cases, and that's what I ended up with.

Now, onto the updates. I'll be skipping visuals this time since I think the tests and example cases in renaming.sh should cover that.

Cases covered

Renaming undeclared symbols
Renaming locally scoped symbols, local variables in functions and declared variables and functions in subshells, with some scope awareness
Renaming globally scoped symbols, both file-wide and workspace-wide, taking into account the includeAllWorkspaceSymbols flag
Differentiates between variables and functions with the same name
Handling requests with non-renamable symbols (onPrepareRename) and invalid symbol names (onRenameRequest)

Limitations

Not all variables are typed by tree-sitter-bash as variable_name, so those variables won't be renamed
Scope awareness breaks down for complex scopes and nesting
Only takes into account subshells created with ( and ), so, for example, subshell scopes created with the pipe (|) operator aren't recognized
Doesn't take into account sourcing location and scope, so, for example, sourcing inside a subshell will affect symbols outside of that subshell

You can find tests that expound and give more examples of these in the "Edge or not covered cases" under the "onRenameRequest" describe block and some parts of the "onPrepareRename" describe block. Honestly though, there are probably more limitations that I'm just not aware of since there are so many cases that can be created with Bash.

Approach taken

There are some changes from the original, but the overall steps are the same.

Get the symbol, either a function or a variable, at the given position using Analyzer#symbolAtPointFromTextPosition.
Find the symbol's original declaration and scope based on its original definition using Analyzer#findOriginalDeclaration. The general heuristic used here is that the original declaration should be within the same or at a higher scope as the symbol found in step 1 while also being higher up in the file.
Based on the original declaration and scope found in step 2, find all of the symbol's occurrences inside the scope, the file, or the whole workspace using Analyzer#findOccurrencesWithin, taking into account things like scoping and the includeAllWorkspaceSymbols flag.

Thank you so much for your patience on this. Feel free to tell me if there's anything else I need to do or if I missed anything and I'll try to address it as soon as I can.

Shane-XB-Qian · 2023-10-19T12:02:12Z

i did a rough test, work fine, thanks.
but i found a case: e.g while read a b; do echo $b; done here it cannot rename b
was that an expected case, or seems it's not in one of "Edge or not covered cases"

mcecode · 2023-10-20T03:20:05Z

Hi @Shane-XB-Qian, thanks for testing it on your setup. Glad it works well.

Unfortunately, yes, that case is an expected case since b is not typed by tree-sitter-bash as variable_name but rather as word, so it falls on the first limitation that I listed. I've added a while read loop test case since that's a common use case and updated the first test's name in "Edge or not covered cases" to make the limitation being tested more explicit.

skovhus · 2023-12-27T12:27:22Z

Sorry for the late review. Amazing work here @mcecode – let me know if you have other ideas for improving the LSP. I would love some help maintaining this project...

mcecode force-pushed the rename-symbol branch from 4bb8fc7 to ceaf963 Compare August 3, 2023 18:56

skovhus reviewed Aug 16, 2023

View reviewed changes

mcecode added 21 commits September 26, 2023 07:48

Add rename handlers

470f8cf

Implement onPrepareRename

47ad106

Improve and refactor onPrepareRename checks

a18090c

Implement onRenameRequest checks

e79d598

Implement renaming not defined variables

bd69d87

Implement local variable rename naively for single-level functions

b2e2256

Skip renaming shadowed local variables in nested functions

894a724

Make findParentFunction find functions defined with function keyword

bf9a422

Extract nodeAtPoints from findOccurrencesWithin

63e5360

Allow renaming local variables within nested functions

e4b73e1

Rename only variables inside functions

5829d95

Handle renaming variables and functions inside subshells

bbe6a45

Skip renaming shadowed variables and functions in nested subshells

316b609

Only rename functions inside subshells

d68ad68

Implement findOriginalDeclaration initially and use

ae97012

Recurse through nodes serially in findOriginalDeclaration

caa704a

Handle function defintions in findOriginalDeclaration

6f11cac

Improve findOriginalDeclaration heuristics

5e114ee

Replace serialForEach with breadthForEach

3fbea1e

Initial reimplementation of findOccurrencesWithin

91dd91d

Improve findOccurrencesWithin scope awareness

d365b90

mcecode added 5 commits September 26, 2023 07:48

Improve findOriginalDeclaration heuristics

187971a

Handle multiple variable assignments and declarations per declaration command when searching inside functions

Improve findOriginalDeclaration heuristics

ac11e2c

Handle multiple variable assignments and declarations per declaration command when searching globally

Change global rename logic

49b97ab

Improve renaming when includeAllWorkspaceSymbols is true

fd9a01c

Improve findOccurrencesWithin heuristics

a96d50c

Handle multiple variable assignments and declarations in one declaration command

mcecode force-pushed the rename-symbol branch from 0de7536 to a96d50c Compare September 26, 2023 07:48

mcecode added 9 commits September 26, 2023 17:19

Fix failing test and add expects for rename handlers registration

5ba6b4c

Add onPrepareRename tests and remove unneeded check

4f7bf1e

Revert removing check and update onPrepareRename tests

bbab0c4

Add initial onRenameRequest test cases

676b571

Add more onRenameRequest tests and cases

638c19a

Add more onRenameRequest tests

ad379eb

Fix bugs, finish rename symbol tests, and update snapshots

5c13080

Refactor and fix comments

0977049

Update docs

9a1b79c

mcecode marked this pull request as ready for review October 19, 2023 10:13

mcecode changed the title ~~[WIP] Add Rename Symbol Capability~~ Add Rename Symbol Capability Oct 19, 2023

Add while read test and update test name

612f4ab

skovhus mentioned this pull request Dec 27, 2023

rename & refactor #1070

Closed

skovhus approved these changes Dec 27, 2023

View reviewed changes

skovhus added 2 commits December 27, 2023 13:24

Merge branch 'main' into rename-symbol

565d911

Prepare for releasing 5.1.0

29b1676

skovhus enabled auto-merge December 27, 2023 12:26

skovhus merged commit 4d7ff81 into bash-lsp:main Dec 27, 2023

mcecode deleted the rename-symbol branch December 31, 2023 09:57

mcecode mentioned this pull request Apr 5, 2024

'textDocument/rename' does not include variable references in (( )) (arithmetic) #1134

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Rename Symbol Capability #915

Add Rename Symbol Capability #915

mcecode commented Jul 19, 2023 •

edited

Loading

skovhus commented Jul 19, 2023

mcecode commented Jul 19, 2023

mcecode commented Aug 16, 2023

skovhus left a comment

skovhus Aug 16, 2023

skovhus Aug 16, 2023

mcecode commented Aug 17, 2023 •

edited

Loading

skovhus commented Sep 25, 2023

mcecode commented Sep 25, 2023

skovhus commented Sep 25, 2023

mcecode commented Oct 19, 2023

Shane-XB-Qian commented Oct 19, 2023

mcecode commented Oct 20, 2023

skovhus commented Dec 27, 2023

Add Rename Symbol Capability #915

Add Rename Symbol Capability #915

Conversation

mcecode commented Jul 19, 2023 • edited Loading

Todos:

skovhus commented Jul 19, 2023

mcecode commented Jul 19, 2023

mcecode commented Aug 16, 2023

Cases covered

Limitations

Approach taken

skovhus left a comment

Choose a reason for hiding this comment

skovhus Aug 16, 2023

Choose a reason for hiding this comment

skovhus Aug 16, 2023

Choose a reason for hiding this comment

mcecode commented Aug 17, 2023 • edited Loading

skovhus commented Sep 25, 2023

mcecode commented Sep 25, 2023

skovhus commented Sep 25, 2023

mcecode commented Oct 19, 2023

Cases covered

Limitations

Approach taken

Shane-XB-Qian commented Oct 19, 2023

mcecode commented Oct 20, 2023

skovhus commented Dec 27, 2023

mcecode commented Jul 19, 2023 •

edited

Loading

mcecode commented Aug 17, 2023 •

edited

Loading