Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

major changes to bash_startup.in #10

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

geoff-nixon
Copy link

Well, I started on fish, but then I realized I really needed to figure out what was supposed to be happening, and started looking at bash for the "reference material". But I needed to do a lot of cleanup here so I could read it. By the time I was finished there was so much cruft, it ended up basically a rewrite.

Let me know if I dropped anything that was important. I don't think so, but... 😟

Anyway lots of these changes are for ksh/POSIX compatibility and will translate over fully. Those will be a little tricky because you have to do all the work within PS1 and PS2, but it usually works out.

Anyway, this should be much faster too, eliminated lots of redundancies. It seems to be working for me. Can you test?

…. I needed to clean it up so I could actually understand what it was doing; but there was so much cruft, it ended up like this. I think it should be faster; hopefully it works as expected?
# Note: this module requires 2 bash features which you must not otherwise be
# using: the "DEBUG" trap, and the "PROMPT_COMMAND" variable. iterm2_preexec_install
# will override these and if you override one or the other this _will_ break.
osc='\033]'; cso='\007' # Beginning, ending escape sequences.
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's not use more than one command per line. I find it harder to read.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I haven't seen \007 called "CSO" before. Where does the acronym come from?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's not use more than one command per line. I find it harder to read.

Fair enough.

I haven't seen \007 called "CSO" before. Where does the acronym come from?

I just wanted a short variable name and couldn't think of one. It's just OSC backwards. Was kinda thinking like case esac, if fi...

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

BEL is the "official" name for \007, may as well use that.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah. right. Probably use esc instead of osc then too?

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, it's actually an OSC (Operating System Command). ESC + ] = OSC, per ECMA 48 and various other standards. I use the xterm docs as the most convenient almanac for this stuff: http://invisible-island.net/xterm/ctlseqs/ctlseqs.html

@gnachman
Copy link
Owner

Thanks for taking on this rather gnarly cleanup. I wrote lots of nitpicky comments because this code is kind of scary (not my baliwick!) and it's really hard to test all the crazy things you find in the wild, so you have to be a little paranoid.

@geoff-nixon
Copy link
Author

I wrote lots of nitpicky comments because this code is kind of scary (not my baliwick!) and it's really hard to test all the crazy things you find in the wild, so you have to be a little paranoid.

Yeah, unfortunately I only looked at the original somewhat late in the process, so it was hard to tell your comments from the original in places. I'll bring what we need back in/do some more documentation.

@geoff-nixon
Copy link
Author

This is taking a bit longer that I thought... might be a day or two till I can work through it all.

@gnachman
Copy link
Owner

No worries, thanks for taking the time to fix this :)

# has been tested with the default shells on MacOS X 10.4 "Tiger", Ubuntu 5.10
# "Breezy Badger", Ubuntu 6.06 "Dapper Drake", and Ubuntu 6.10 "Edgy Eft".
# tmux / screen are not supported; I think this is the theoretical tmux hack?
[ $TERM = screen ] && osc='\033Ptmux;\033\033]' && cso='\007\033\\'
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI, It was suggested to me that this would be a better test for TERM:

["${TERM:-}" != screen*]
This would seem to handle an empty $TERM better, but I'm not sure what benefit the * confers.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI: That only works on bash and zsh, not pure sh. And it needs the double brackets (i.e. [[...]]).

A posix sh version would be:

#!/bin/sh

is_tmux_or_screen=f
if [ -n "$TMUX" ]; then
  is_tmux_or_screen=t
fi

case "$TERM" in
  screen*)
    is_tmux_or_screen=t
    ;;
esac

if [ "$is_tmux_or_screen" = f ]; then
  ...
fi

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gnachman

This would seem to handle an empty $TERM better, but I'm not sure what benefit the * confers.
Well, I have seen variants like screen-256color, so I get that part. But does having an empty $TERM actually happen? I suppose you can unset it, but I'd think bad things would start to happen pretty quickly.

@docwhat
You're right in that 1) its not portable (and that it would need [[ ]]), and testing $TMUX is a good idea, probably more reliable. But there's a lot of extraneous code there.

Also, while its extremely common, if [ condition ]; then is redundant, and if then's are one of the slowest structures in shell scripts. There are rare occasions when you need it but generally, (like in case), all you need is something like:

[ -z "$TMUX" ] && case "$TERM" in
  screen*) is_tmux_or_screen=t ;; # or, in our use case here, do nothing
     ''|*) is_tmux_or_screen=f ;; # or, in our case, put our commands here
esac || is_tmux_or_screen=t

which is much faster, because we don't execute additional tests if they're not needed.

@geoff-nixon
Copy link
Author

Sorry, I haven't forgot about this! Last week was kinda crazy for me, and this week may be as well. However, a bit of good news:

  • I've worked out a lowest-common-denominator POSIX-compliant way of doing basic shell integration, using only PS1 and some functions.

Sourcing the following works with bash-as-sh (version 2+), dash, mksh, ksh (93), pkdsh, yash.

_it2a(){ status=$?; printf '\001'; return $status ;}
_it2b(){ status=$?; printf '\002'; return $status ;}
_it2c(){ status=$?; printf '\015'; return $status ;} 
PS1='$(_it2a)$(_it2c)$(itput -cd $?; itput -a)$(_it2b)'$PS1'$(_it2a)$(itput -b)$(_it2b)'
readonly PS1

itput is just an abstraction of some some of the escape sequences wrapped up in a script to keep me form going nuts; there's no reason this need to live in a separate script per se:

#!/bin/sh
for last; do :; done

iterm2_hostname=$(hostname -f)
iterm2_print_user_vars() { :; }
iterm2_print_state_data() {
  printf "\033]1337;RemoteHost=$USER@$iterm2_hostname\007"
  printf "\033]1337;CurrentDir=$PWD\007"
  iterm2_print_user_vars
}

iterm2_prompt_start(){        printf "\033]133;A\007"    ; }
iterm2_prompt_end(){          printf "\033]133;B\007"    ; }
iterm2_before_cmd_executes(){ printf "\033]133;C\007"    ; }
iterm2_after_cmd_executes(){  printf "\033]133;D;$*\007" 
  iterm2_print_state_data
}

iterm2_set_user_var() {
  printf "\033]1337;SetUserVar=%s=%s\007" "$1" $(printf "%s" "$2" | base64)
  }

OPTIND=1
while getopts "abcd" itput_opt; do
  case "$itput_opt" in
    a) iterm2_prompt_start ;;
    b) iterm2_prompt_end   ;;
    c) iterm2_before_cmd_executes ;;
    d) iterm2_after_cmd_executes "$last" ;;
  esac
done
shift $((OPTIND - 1))

Setting PS1 to readonly is ugly, but if there's no other hook to use, its the only way to keep it from being overwritten. Also, probably should add a SIGINT trap since a 'ctrl-c' on an incomplete line causes the return value to change.

I've also looked into the various features of the most common shells, and there's a means to do a more robust integration in ksh93 (it has the pseudo-signal DEBUG trap) and mksh has a special operator ${| } that can be used to do the same thing.

The ^A/^B/CR trick is from the mksh bible manual, page 12; it accomplishes the same thing as the bash \[ \] construct.

@geoff-nixon
Copy link
Author

Also, I believe GNU screen always sets the variable STY, so its probably possible to avoid having to check $TERM at all.

@gnachman
Copy link
Owner

gnachman commented Jul 7, 2015

Regarding the readonly PS1, can the user still change it? With the current code changes to PS1 get picked up, and this was added in response to a lot of user feedback, so it can't be given up.

An environment var list STY is less likely to get propagated by ssh than $TERM; plus you'd need another check for tmux in addition to screen.

This looks pretty elegant and I like where it's going so far.

@geoff-nixon
Copy link
Author

Leaving aside tmux and friends for the moment:

Regarding the readonly PS1, can the user still change it? With the current code changes to PS1 get picked up, and this was added in response to a lot of user feedback, so it can't be given up.

The following (just the one file now) works identically in bash, bash-as-sh, ksh[93], and zsh, and abides any changes to PS1. In the rest of the family - pdksh, dash, mksh - which really aren't meant to be used as interactive shells anyway - we can make PS1 read-only, or just let shell integration die if the user changes the prompt.

[ -z "$ZSH_VERSION" ] || setopt PROMPT_SUBST

itput(){
 for last; do :; done

  iterm2_print_user_vars() { :; }

  : ${iterm2_hostname:=${HOSTNAME:-$(hostname -f)}}

  iterm2_print_state_data() {
    printf "\033]1337;RemoteHost=$USER@$iterm2_hostname\007"
    printf "\033]1337;CurrentDir=$PWD\007"
    iterm2_print_user_vars
  }

  iterm2_prompt_start(){        printf '\033]133;A\007'    ;}
  iterm2_prompt_end(){          printf '\033]133;B\007'    ;}
  iterm2_before_cmd_executes(){ printf "\033]133;C\007"    ;}
  iterm2_after_cmd_executes(){  printf "\033]133;D;$*\007" 
                                iterm2_print_state_data ;}

  iterm2_set_user_var() {
    printf "\033]1337;SetUserVar=%s=%s\007" "$1" $(printf "%s" "$2" | base64) ;}

  OPTIND=1
  while getopts "abcd" itput_opt; do
    case "$itput_opt" in
      a) iterm2_prompt_start ;;
      b) iterm2_prompt_end   ;;
      c) iterm2_before_cmd_executes ;;
      d) iterm2_after_cmd_executes "$last" ;;
    esac
  done
  shift $((OPTIND - 1))
}

_it2a(){ retstatus=$?; printf '\001'; return $retstatus ;}
_it2b(){ retstatus=$?; printf '\002'; return $retstatus ;}
_it2c(){ retstatus=$?; printf '\015'; return $retstatus ;} 

it2_rotate_ps1(){
  [ x"$PS1" != x"$OLD_PS1" ] && PS1='$(_it2a)$(_it2c)$(itput -cd $?; itput -a)$(_it2b)'$PS1'$(_it2a)$(itput -b)$(_it2b)' && OLD_PS1=$PS1
}

it2_rotate_ps1

trap 'tput -T $TERM el1' INT # Squash bad return values on ctrl-C.
trap it2_rotate_ps1 DEBUG >/dev/null 2>&1 || readonly PS1 # Or, lose integration.

Basically, the only bit that's needed is the DEBUG psedo-trap to survive changes to PS1.
This is still crufty - but compared to the "bash pre-exec" code path, it already makes 1/4 the number of executions per prompt.

So if you think this is a direction you'd like take this (i.e., generic code for bash, zsh, and basically anything else that's not fish or tcsh), I'll try to clean it up and push a new commit to this PR?

@gnachman
Copy link
Owner

Sorry for the delay in getting back to you. Real life intervened for a bit.

How much does it complicate the script to bring zsh in? I don't mind keeping it separate since there are a lot of zsh users in the world, and it could be useful to do zsh-specific customizations some day.

There are a few things I don't understand. Comments inline:

[ -z "$ZSH_VERSION" ] || setopt PROMPT_SUBST 

Setting PROMPT_SUBST seems like trouble. It'll change how existing PS1's are interpreted. If we had a separate script for zsh could this be avoided? One of the most important goals in my life is to reduce the number of support queries I get.

itput(){
 for last; do :; done

  iterm2_print_user_vars() { :; }

  : ${iterm2_hostname:=${HOSTNAME:-$(hostname -f)}}

Just out of curiosity what does the leading colon do here?

  iterm2_print_state_data() {
    printf "\033]1337;RemoteHost=$USER@$iterm2_hostname\007"
    printf "\033]1337;CurrentDir=$PWD\007"

It's not safe to put user-supplied data in the format string of printf, since it could have %whatevers in it that confuse printf.

    iterm2_print_user_vars
  }

  iterm2_prompt_start(){        printf '\033]133;A\007'    ;}
  iterm2_prompt_end(){          printf '\033]133;B\007'    ;}
  iterm2_before_cmd_executes(){ printf "\033]133;C\007"    ;}
  iterm2_after_cmd_executes(){  printf "\033]133;D;$*\007" 
                                iterm2_print_state_data ;}

  iterm2_set_user_var() {
    printf "\033]1337;SetUserVar=%s=%s\007" "$1" $(printf "%s" "$2" | base64) ;}

  OPTIND=1
  while getopts "abcd" itput_opt; do
    case "$itput_opt" in
      a) iterm2_prompt_start ;;
      b) iterm2_prompt_end   ;;
      c) iterm2_before_cmd_executes ;;
      d) iterm2_after_cmd_executes "$last" ;;
    esac
  done
  shift $((OPTIND - 1))
}

_it2a(){ retstatus=$?; printf '\001'; return $retstatus ;}
_it2b(){ retstatus=$?; printf '\002'; return $retstatus ;}
_it2c(){ retstatus=$?; printf '\015'; return $retstatus ;} 

I don't understand the purpose of putting control characters in the prompt. What's this for?

it2_rotate_ps1(){
  [ x"$PS1" != x"$OLD_PS1" ] && PS1='$(_it2a)$(_it2c)$(itput -cd $?; itput -a)$(_it2b)'$PS1'$(_it2a)$(itput -b)$(_it2b)' && OLD_PS1=$PS1

I think this is wrong. The order this does is:

  1. iterm2_before_cmd_executes
  2. iterm2_after_cmd_executes
  3. iterm2_prompt_start
  4. $PS1
  5. iterm2_prompt_end
  6. User enters command
  7. Command produces output

The order it needs to be in is:

  1. iterm2_after_cmd_executes
  2. iterm2_prompt_start
  3. $PS1
  4. iterm2_prompt_end
  5. User enters command
  6. iterm2_before_cmd_executes
  7. Command produces output
}

it2_rotate_ps1

trap 'tput -T $TERM el1' INT # Squash bad return values on ctrl-C.

Why do we need this?

trap it2_rotate_ps1 DEBUG >/dev/null 2>&1 || readonly PS1 # Or, lose integration.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants