; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018169 (gene) of Chayote v1 genome

Gene IDSed0018169
OrganismSechium edule (Chayote v1)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationLG01:71206717..71210664
RNA-Seq ExpressionSed0018169
SyntenySed0018169
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151986.1 rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis sativus]1.9e-14873.08Show/hide
Query:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M QAIH LP+N TGFGLSDSRC+PCSGVSGR +  S  SLCA H+    VKFRPLNCT +G SF CKASSGGHRRNPDF KQNR GFSRSRNRQNEE +S
Subjt:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS
        LD++DES+LL SKNGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKKVEA+GQTKGSETVDSLLKLLRKHS E+GKRSS     SS
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS

Query:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH
        +KD SFNHVKEN PYDE +G+S FGLS +LREKAQ        RPVS FQR+SPVPRVKYQPIY GE IV+ST+ +NSKGVK NGT+TGSQLK KVWT+ 
Subjt:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH

Query:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKH--EDHEDLNSLKLAELRAIAKSNGM
        ESERE WEELQSQ E EQEP PDQ+FE+EPEAE+Y+LEHE DE+ P               F+D V+D E+FAKH  ++HEDLNSLKLAELRAIAKS  +
Subjt:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKH--EDHEDLNSLKLAELRAIAKSNGM

Query:  KGFSKMKKSELVQLLS
        +GFSKMKKSELVQLLS
Subjt:  KGFSKMKKSELVQLLS

XP_008447421.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X7 [Cucumis melo]2.9e-14973.19Show/hide
Query:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M QAIH LP N TGFGLSDSRCLPCSGVSGR +  S RSLCA H     VKFRPLNCT +G SF CKASS GHRRNPDF KQNRQG+SRSRNRQNEE +S
Subjt:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS
        L+++DES+LLSS+NGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKK+EA+GQTKGSETVDSLLKLLRKH+ E+GKRSS   +  S
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS

Query:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH
        +KD SFNHVKEN PYDE +G+SIFGLS +LREKAQ+P G S  RP S FQR+SPVPRVKYQPIY GE IVDST+ +NSKG+KLNGTETGSQLKAKVWT+ 
Subjt:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH

Query:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKG
        ESERE WEELQSQ +TEQEP  DQ+FE+EPEAE+Y+LEHE DE+ P               F+D ++D E+F+KH +HE+LNSLKLAELRAIAKS  ++G
Subjt:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKG

Query:  FSKMKKSELVQLLS
        FSKMKKSELVQLLS
Subjt:  FSKMKKSELVQLLS

XP_023553735.1 rho-N domain-containing protein 1, chloroplastic-like [Cucurbita pepo subsp. pepo]4.5e-15072.97Show/hide
Query:  LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDES
        LPNN TGFGLSDSRCLPCSGV GR + VSSRSLC  H+  A VKFRP+NCTL+ ASF C+ASSGGHR+  DFSKQNR G+SRSRNRQNE+ DSL+ +DES
Subjt:  LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDES

Query:  NLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFN
        +LLSSKNGP L +SS +KSQ TATPG REKEIVELFRK+QAQLR RAA KEEKK+EA+GQTKGSETVDSLL LLR+HS E+GKR          KD SFN
Subjt:  NLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFN

Query:  HVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAW
        HVKEN PYDE K +SIFGLS+ LREKAQ+PTG S  RPVS FQRKSPVP VKYQPI  GE I++S D VNSKG+KLNGTETGSQLKAKVWT+ ESERE W
Subjt:  HVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAW

Query:  EELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP----------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELVQ
        EELQSQGETEQEP PDQ+FE+EPE ESYELEHEPDE+ P           FD+ V+   KF+K++DHEDLNSLKLAELRAIAKS  +KGFSKMKKSELV+
Subjt:  EELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP----------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELVQ

Query:  LLSATKV
        LLS  +V
Subjt:  LLSATKV

XP_038887988.1 rho-N domain-containing protein 1, chloroplastic-like isoform X1 [Benincasa hispida]4.3e-16177.91Show/hide
Query:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M QAIH LPNN+T FGLSDSRCLPCSGVSGR + VSSRSLCA H+  ARVKFRPLNCT +GASF CKASSGGHRRNPDFSKQNR GFSRSRNRQNEE +S
Subjt:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSS--ASDSS
        LD++DES+LLSSKNGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKK+EA+GQTKGSETVDSLLKLLRKHS E+GKRSS   S   
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSS--ASDSS

Query:  SSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWT
        SSSKDF+FNHVKEN  YDE KGTSIFGLSA+LREKAQ+PTG S  RPVS FQRKSPVPRVKYQPI+ GE IVDSTD VNSKGVKLNGTET SQLKAKVWT
Subjt:  SSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWT

Query:  QHES-EREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDE--------------IGPAFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNG
        + ES ER  WEELQSQGET+QEP  DQ++E+EPEAESYELEH+PDE              +   FDD V+D EKFAKH++HEDLNSLK+AELRAIAKS  
Subjt:  QHES-EREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDE--------------IGPAFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNG

Query:  MKGFSKMKKSELVQLLSATKV
        +KGFSKMKKSELVQLLS   V
Subjt:  MKGFSKMKKSELVQLLSATKV

XP_038887989.1 rho-N domain-containing protein 1, chloroplastic-like isoform X2 [Benincasa hispida]4.9e-15776.96Show/hide
Query:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M QAIH LPNN+T     DSRCLPCSGVSGR + VSSRSLCA H+  ARVKFRPLNCT +GASF CKASSGGHRRNPDFSKQNR GFSRSRNRQNEE +S
Subjt:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSS--ASDSS
        LD++DES+LLSSKNGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKK+EA+GQTKGSETVDSLLKLLRKHS E+GKRSS   S   
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSS--ASDSS

Query:  SSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWT
        SSSKDF+FNHVKEN  YDE KGTSIFGLSA+LREKAQ+PTG S  RPVS FQRKSPVPRVKYQPI+ GE IVDSTD VNSKGVKLNGTET SQLKAKVWT
Subjt:  SSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWT

Query:  QHES-EREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDE--------------IGPAFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNG
        + ES ER  WEELQSQGET+QEP  DQ++E+EPEAESYELEH+PDE              +   FDD V+D EKFAKH++HEDLNSLK+AELRAIAKS  
Subjt:  QHES-EREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDE--------------IGPAFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNG

Query:  MKGFSKMKKSELVQLLSATKV
        +KGFSKMKKSELVQLLS   V
Subjt:  MKGFSKMKKSELVQLLSATKV

TrEMBL top hitse value%identityAlignment
A0A0A0L7X8 Rho_N domain-containing protein9.1e-14973.08Show/hide
Query:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M QAIH LP+N TGFGLSDSRC+PCSGVSGR +  S  SLCA H+    VKFRPLNCT +G SF CKASSGGHRRNPDF KQNR GFSRSRNRQNEE +S
Subjt:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS
        LD++DES+LL SKNGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKKVEA+GQTKGSETVDSLLKLLRKHS E+GKRSS     SS
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS

Query:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH
        +KD SFNHVKEN PYDE +G+S FGLS +LREKAQ        RPVS FQR+SPVPRVKYQPIY GE IV+ST+ +NSKGVK NGT+TGSQLK KVWT+ 
Subjt:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH

Query:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKH--EDHEDLNSLKLAELRAIAKSNGM
        ESERE WEELQSQ E EQEP PDQ+FE+EPEAE+Y+LEHE DE+ P               F+D V+D E+FAKH  ++HEDLNSLKLAELRAIAKS  +
Subjt:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKH--EDHEDLNSLKLAELRAIAKSNGM

Query:  KGFSKMKKSELVQLLS
        +GFSKMKKSELVQLLS
Subjt:  KGFSKMKKSELVQLLS

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X71.4e-14973.19Show/hide
Query:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M QAIH LP N TGFGLSDSRCLPCSGVSGR +  S RSLCA H     VKFRPLNCT +G SF CKASS GHRRNPDF KQNRQG+SRSRNRQNEE +S
Subjt:  MFQAIH-LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS
        L+++DES+LLSS+NGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKK+EA+GQTKGSETVDSLLKLLRKH+ E+GKRSS   +  S
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSS

Query:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH
        +KD SFNHVKEN PYDE +G+SIFGLS +LREKAQ+P G S  RP S FQR+SPVPRVKYQPIY GE IVDST+ +NSKG+KLNGTETGSQLKAKVWT+ 
Subjt:  SKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQH

Query:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKG
        ESERE WEELQSQ +TEQEP  DQ+FE+EPEAE+Y+LEHE DE+ P               F+D ++D E+F+KH +HE+LNSLKLAELRAIAKS  ++G
Subjt:  ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKG

Query:  FSKMKKSELVQLLS
        FSKMKKSELVQLLS
Subjt:  FSKMKKSELVQLLS

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X41.1e-14670.79Show/hide
Query:  MFQAIH-LPNN--------------VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQG
        M QAIH LP N              V GFGLSDSRCLPCSGVSGR +  S RSLCA H     VKFRPLNCT +G SF CKASS GHRRNPDF KQNRQG
Subjt:  MFQAIH-LPNN--------------VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQG

Query:  FSRSRNRQNEEGDSLDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSD
        +SRSRNRQNEE +SL+++DES+LLSS+NGPLL +SST KSQATATPG REKEIVELFRKVQAQLR RAA KEEKK+EA+GQTKGSETVDSLLKLLRKH+ 
Subjt:  FSRSRNRQNEEGDSLDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSD

Query:  ERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGT
        E+GKRSS   +  S+KD SFNHVKEN PYDE +G+SIFGLS +LREKAQ+P G S  RP S FQR+SPVPRVKYQPIY GE IVDST+ +NSKG+KLNGT
Subjt:  ERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGT

Query:  ETGSQLKAKVWTQHESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKHEDHEDLNSLKL
        ETGSQLKAKVWT+ ESERE WEELQSQ +TEQEP  DQ+FE+EPEAE+Y+LEHE DE+ P               F+D ++D E+F+KH +HE+LNSLKL
Subjt:  ETGSQLKAKVWTQHESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEPDEIGP--------------AFDDIVEDCEKFAKHEDHEDLNSLKL

Query:  AELRAIAKSNGMKGFSKMKKSELVQLLS
        AELRAIAKS  ++GFSKMKKSELVQLLS
Subjt:  AELRAIAKSNGMKGFSKMKKSELVQLLS

A0A6J1HJC1 rho-N domain-containing protein 1, chloroplastic-like2.2e-14772.79Show/hide
Query:  LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDES
        LPNN TGFGLSDSRCLPCSGV GR + VSSRSLC  H+  A VKFRP+NCTL+ ASF C+ASSGGHR+  DFSKQNR G+SRSRNRQNE+ DSL+ +DES
Subjt:  LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDES

Query:  NLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFN
        +LLSSKNGP L +SS +KSQ TATPG REKEIVELFRK+QAQLR RAA KEEKK+EA+GQTKGSETVDSLL LLR+HS E+GKR          KD S N
Subjt:  NLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFN

Query:  HVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAW
        HVKEN PYDE K +SIFGLS+ LREKAQ+P G S  RPVS FQRKSPVP VKYQPI  GE IV+S D VNSKG+KLNGTETGSQLKAKVWT+ ESERE W
Subjt:  HVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAW

Query:  EELQSQGETEQEPGPDQDFEIEPEAE-SYELEHEPDEIGP----------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELV
        EELQSQGETEQEP PDQ+FE+EPE E SYELEHEPDE+ P           FD+ V+  EKF+K++DHEDLNSLKLAELRAIAKS  +KGFSKMKKSELV
Subjt:  EELQSQGETEQEPGPDQDFEIEPEAE-SYELEHEPDEIGP----------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELV

Query:  QLLSATKV
        +LLS  +V
Subjt:  QLLSATKV

A0A6J1HST4 rho-N domain-containing protein 1, chloroplastic-like5.5e-14672.44Show/hide
Query:  LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDES
        LPNN TGFGLSDSRCLPCSGV GR + VSSRSLC  H+  A VKFRP+NCTL+ ASF C+ASSGGHR+  DFSKQNR G+SRSRNRQNE+ DSL+ +DES
Subjt:  LPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDES

Query:  NLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFN
        +LLSSKNGP L +SS +KSQ TATPG REKEIVELFRK+QAQLR RAA KEEKK+EA+GQTKGSETVDSLL LLR+HS E+GKR          KD SFN
Subjt:  NLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFN

Query:  HVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAW
        HVKEN PYDE K +SIFGLS+ LREKAQ+PTG S  RPVS FQRKSPVP VKYQP+   E IV+S D VNSKG+KLNGTETGSQLKAKVWT+ ESERE W
Subjt:  HVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAW

Query:  EELQSQGETEQ--EPGPDQDFEIEPEAE-SYELEHEPDEIGP----------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSE
        EELQSQGETEQ  EP PDQ+FE+EPE E SYELEHEPDE+ P           FD+ V+  EKF+K +DHEDLNSLKLAELRAIAKS  +KGFSKMKKSE
Subjt:  EELQSQGETEQ--EPGPDQDFEIEPEAE-SYELEHEPDEIGP----------AFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSE

Query:  LVQLLSATKV
        LV+LLS  +V
Subjt:  LVQLLSATKV

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-737.2e-3436.94Show/hide
Query:  RPLNCTLVGASFMCKASSGGHR-RNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDE--SNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQ
        RP+  +LV     C A+   HR R+ D ++  + G +R +++  +E D  +++DE  ++++SSKNGP + L+S ++ QAT+ PG REKEIVELF++VQAQ
Subjt:  RPLNCTLVGASFMCKASSGGHR-RNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDE--SNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQ

Query:  LRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHS-DERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSK
        LR R   KEEKK E         +VDSLL LLRKHS D+R K       S   K+ S +  K ++    ++ +SIF +    +E+ + P   +  RP S 
Subjt:  LRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHS-DERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSK

Query:  FQRKSPVPRVKYQPIYR--GERIVDS-TDRV-NSKGVKLNGTETGSQLKAKVWTQH---ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEP
        F+R+SPVP VK+QP+     ER++++  D V  +K    N   T        +  +   E E  + ++L    + E    PD     EP  E Y+   EP
Subjt:  FQRKSPVPRVKYQPIYR--GERIVDS-TDRV-NSKGVKLNGTETGSQLKAKVWTQH---ESEREAWEELQSQGETEQEPGPDQDFEIEPEAESYELEHEP

Query:  DEIGPAFDDIVEDCEKFAKHE-DHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELVQLLS
            P+   I E  +   K      DL++LK+ ELR +AKS G+KG+SKMKK++LV+LLS
Subjt:  DEIGPAFDDIVEDCEKFAKHE-DHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELVQLLS

Q94K75 Rho-N domain-containing protein 1, chloroplastic4.9e-5943.33Show/hide
Query:  MFQAIHLPNN-VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M    HL ++ V G+ LSDSRC   S VS RT  +   S C  HK   R+K  P       +SF+C+ASSGG+RRNPDFS+ N+ G+ R  NRQ+   + 
Subjt:  MFQAIHLPNN-VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQTKGSETVDSLLKLLRKHSDERGKRSSASDS
         D ++ S++LSS+NGPL  LSS+ K QAT++PG REKEIVELFRKVQAQLR RAA  KEEKK+E  +KGQ K SETVDSLLKLLRKHS E+ KR  +  S
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQTKGSETVDSLLKLLRKHSDERGKRSSASDS

Query:  SSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVW
        S            +    D++  T     S +     +D    S  RP S F+RKSPVPR +  P Y  E   D +          + + T +Q K  V 
Subjt:  SSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVW

Query:  TQHESEREAWEELQSQGETEQEPGP-----DQDFEIEPEAESYELEHEPDEI------------------GPAFDDIVEDCEKFAKHEDHEDLNSLKLAE
           E E E   E + + E E EPGP     + D E++PE+ S+  E E D++                    + DD  ED ++ A+ E  +DL+ LKL E
Subjt:  TQHESEREAWEELQSQGETEQEPGP-----DQDFEIEPEAESYELEHEPDEI------------------GPAFDDIVEDCEKFAKHEDHEDLNSLKLAE

Query:  LRAIAKSNGMKGFSKMKKSELVQLLSA
        LR IAKS G+KG SKMKK+ELV+LL +
Subjt:  LRAIAKSNGMKGFSKMKKSELVQLLSA

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor3.5e-6043.33Show/hide
Query:  MFQAIHLPNN-VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M    HL ++ V G+ LSDSRC   S VS RT  +   S C  HK   R+K  P       +SF+C+ASSGG+RRNPDFS+ N+ G+ R  NRQ+   + 
Subjt:  MFQAIHLPNN-VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQTKGSETVDSLLKLLRKHSDERGKRSSASDS
         D ++ S++LSS+NGPL  LSS+ K QAT++PG REKEIVELFRKVQAQLR RAA  KEEKK+E  +KGQ K SETVDSLLKLLRKHS E+ KR  +  S
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQTKGSETVDSLLKLLRKHSDERGKRSSASDS

Query:  SSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVW
        S            +    D++  T     S +     +D    S  RP S F+RKSPVPR +  P Y  E   D +          + + T +Q K  V 
Subjt:  SSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVW

Query:  TQHESEREAWEELQSQGETEQEPGP-----DQDFEIEPEAESYELEHEPDEI------------------GPAFDDIVEDCEKFAKHEDHEDLNSLKLAE
           E E E   E + + E E EPGP     + D E++PE+ S+  E E D++                    + DD  ED ++ A+ E  +DL+ LKL E
Subjt:  TQHESEREAWEELQSQGETEQEPGP-----DQDFEIEPEAESYELEHEPDEI------------------GPAFDDIVEDCEKFAKHEDHEDLNSLKLAE

Query:  LRAIAKSNGMKGFSKMKKSELVQLLSA
        LR IAKS G+KG SKMKK+ELV+LL +
Subjt:  LRAIAKSNGMKGFSKMKKSELVQLLSA

AT1G06190.2 Rho termination factor5.4e-4547.83Show/hide
Query:  MFQAIHLPNN-VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS
        M    HL ++ V G+ LSDSRC   S VS RT  +   S C  HK   R+K  P       +SF+C+ASSGG+RRNPDFS+ N+ G+ R  NRQ+   + 
Subjt:  MFQAIHLPNN-VTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDS

Query:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQTKGSETVDSLLKLLRKHSDERGKRSSASDS
         D ++ S++LSS+NGPL  LSS+ K QAT++PG REKEIVELFRKVQAQLR RAA  KEEKK+E  +KGQ K SETVDSLLKLLRKHS E+ KR  +  S
Subjt:  LDSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQTKGSETVDSLLKLLRKHSDERGKRSSASDS

Query:  SSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDST
        S            +    D++  T     S +     +D    S  RP S F+RKSPVPR +  P Y  E   D +
Subjt:  SSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDST

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism1.7e-2238.24Show/hide
Query:  HRR-NPDFSKQNRQGFSRSRNRQNEEGDSL-DSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQ
        HRR NPDFS+ N+ GF R RNR+NE+ D L D   E ++LSSKN                     EKEIVELF+KVQ QLR RAA  KEEKK E  +KGQ
Subjt:  HRR-NPDFSKQNRQGFSRSRNRQNEEGDSL-DSLDESNLLSSKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAAT-KEEKKVE--AKGQ

Query:  -TKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRG
          K SETVDSLLKLLRKHS E+ K+  ++ +S          ++ +    E +  S    S+    + +D       RP S F+R SPVPR K Q  Y  
Subjt:  -TKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGTSIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRG

Query:  ERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAWEELQSQGETEQEPGPDQDFEI---EPEAESYELEHEPD-EIGPAFDDIVEDCEKFAKHE
        E I D                      +  WTQ +      ++++S+ E E EP P+   E    EPEAE YE E EP+  I  +  ++  +     + E
Subjt:  ERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAWEELQSQGETEQEPGPDQDFEI---EPEAESYELEHEPD-EIGPAFDDIVEDCEKFAKHE

Query:  DHEDLN
        D ED N
Subjt:  DHEDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCAAGCCATACATCTTCCCAACAATGTTACAGGGTTTGGACTGTCAGATAGCAGATGCCTACCATGCTCTGGAGTTTCAGGACGAACATCCATGGTCTCTTCCCG
CTCTTTATGTGCCGGACATAAAACCAAAGCGCGGGTCAAATTTCGACCCTTAAACTGTACTTTGGTGGGGGCTTCTTTTATGTGCAAAGCCAGCTCAGGAGGTCATAGGA
GAAACCCAGATTTCTCAAAGCAAAATAGGCAGGGCTTCTCAAGAAGTAGAAATAGGCAAAATGAGGAGGGAGATAGCCTTGATAGTCTCGACGAATCCAACTTATTATCG
TCGAAAAATGGACCATTACTTCCCCTCTCTAGCACTGCAAAATCCCAGGCCACTGCTACCCCAGGCCATAGGGAGAAGGAAATTGTTGAACTTTTCAGGAAGGTTCAAGC
TCAGCTCCGGGTGCGAGCTGCAACCAAGGAAGAAAAGAAGGTCGAAGCGAAAGGACAAACAAAAGGGAGCGAAACAGTGGATTCTCTTCTTAAACTATTACGAAAGCATT
CAGACGAGCGTGGGAAGAGAAGCAGTGCTAGTGACAGCAGCAGCAGCAGCAAGGATTTCAGTTTTAATCATGTCAAAGAAAATAGTCCATATGATGAAGAGAAAGGCACA
AGCATTTTTGGCCTAAGTGCCAGCTTGAGAGAGAAGGCCCAAGATCCAACAGGACCTTCTTCTGGTAGACCCGTATCAAAATTTCAACGCAAATCCCCCGTGCCTCGGGT
GAAATACCAGCCAATTTACCGAGGGGAACGTATCGTTGACTCCACCGACAGGGTGAATTCAAAGGGAGTGAAACTGAATGGAACCGAGACAGGTTCTCAACTAAAGGCAA
AGGTATGGACTCAACACGAGTCGGAACGAGAGGCCTGGGAAGAGCTGCAATCACAAGGAGAGACGGAGCAGGAGCCAGGGCCAGACCAAGATTTTGAGATAGAGCCAGAG
GCTGAATCATATGAGCTAGAGCATGAGCCTGATGAGATAGGACCTGCGTTCGATGACATTGTTGAAGACTGTGAGAAATTTGCAAAGCACGAGGATCATGAGGACTTGAA
CTCATTGAAGCTCGCTGAGCTGAGGGCAATTGCCAAATCCAACGGTATGAAAGGCTTCTCAAAGATGAAGAAAAGCGAGCTCGTGCAGTTGCTAAGCGCGACTAAGGTAT
GA
mRNA sequenceShow/hide mRNA sequence
GGCAAAGAAGATCGAACCGCAAAAGCGAAAAATCATCGAAGAAATGCGAATCCAAAACCCTTCTTCAGCGCACTAAAACCTTTTTCTTGAAGCTGTATAATCGAGCAATG
TTTCAAGCCATACATCTTCCCAACAATGTTACAGGGTTTGGACTGTCAGATAGCAGATGCCTACCATGCTCTGGAGTTTCAGGACGAACATCCATGGTCTCTTCCCGCTC
TTTATGTGCCGGACATAAAACCAAAGCGCGGGTCAAATTTCGACCCTTAAACTGTACTTTGGTGGGGGCTTCTTTTATGTGCAAAGCCAGCTCAGGAGGTCATAGGAGAA
ACCCAGATTTCTCAAAGCAAAATAGGCAGGGCTTCTCAAGAAGTAGAAATAGGCAAAATGAGGAGGGAGATAGCCTTGATAGTCTCGACGAATCCAACTTATTATCGTCG
AAAAATGGACCATTACTTCCCCTCTCTAGCACTGCAAAATCCCAGGCCACTGCTACCCCAGGCCATAGGGAGAAGGAAATTGTTGAACTTTTCAGGAAGGTTCAAGCTCA
GCTCCGGGTGCGAGCTGCAACCAAGGAAGAAAAGAAGGTCGAAGCGAAAGGACAAACAAAAGGGAGCGAAACAGTGGATTCTCTTCTTAAACTATTACGAAAGCATTCAG
ACGAGCGTGGGAAGAGAAGCAGTGCTAGTGACAGCAGCAGCAGCAGCAAGGATTTCAGTTTTAATCATGTCAAAGAAAATAGTCCATATGATGAAGAGAAAGGCACAAGC
ATTTTTGGCCTAAGTGCCAGCTTGAGAGAGAAGGCCCAAGATCCAACAGGACCTTCTTCTGGTAGACCCGTATCAAAATTTCAACGCAAATCCCCCGTGCCTCGGGTGAA
ATACCAGCCAATTTACCGAGGGGAACGTATCGTTGACTCCACCGACAGGGTGAATTCAAAGGGAGTGAAACTGAATGGAACCGAGACAGGTTCTCAACTAAAGGCAAAGG
TATGGACTCAACACGAGTCGGAACGAGAGGCCTGGGAAGAGCTGCAATCACAAGGAGAGACGGAGCAGGAGCCAGGGCCAGACCAAGATTTTGAGATAGAGCCAGAGGCT
GAATCATATGAGCTAGAGCATGAGCCTGATGAGATAGGACCTGCGTTCGATGACATTGTTGAAGACTGTGAGAAATTTGCAAAGCACGAGGATCATGAGGACTTGAACTC
ATTGAAGCTCGCTGAGCTGAGGGCAATTGCCAAATCCAACGGTATGAAAGGCTTCTCAAAGATGAAGAAAAGCGAGCTCGTGCAGTTGCTAAGCGCGACTAAGGTATGAT
AAATACTAGTTGGACATGAACTTCTGGGATATAGGATGATGTTTAGGTGCAATTTTTTCTATTGCTTGAATAGAATCTCAATCTGTATGACCAGTTTTGACTTTGAATTA
GCTTTTCAAACCTTTTCTCAGATTGTATATCTGGCCAACCTATTTAGATATGGATCATTTTTAGCATTAATTTCTGATCATTCATATATAGATCTCATTTTGATTGTCAA
TA
Protein sequenceShow/hide protein sequence
MFQAIHLPNNVTGFGLSDSRCLPCSGVSGRTSMVSSRSLCAGHKTKARVKFRPLNCTLVGASFMCKASSGGHRRNPDFSKQNRQGFSRSRNRQNEEGDSLDSLDESNLLS
SKNGPLLPLSSTAKSQATATPGHREKEIVELFRKVQAQLRVRAATKEEKKVEAKGQTKGSETVDSLLKLLRKHSDERGKRSSASDSSSSSKDFSFNHVKENSPYDEEKGT
SIFGLSASLREKAQDPTGPSSGRPVSKFQRKSPVPRVKYQPIYRGERIVDSTDRVNSKGVKLNGTETGSQLKAKVWTQHESEREAWEELQSQGETEQEPGPDQDFEIEPE
AESYELEHEPDEIGPAFDDIVEDCEKFAKHEDHEDLNSLKLAELRAIAKSNGMKGFSKMKKSELVQLLSATKV