; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022448 (gene) of Snake gourd v1 genome

Gene IDTan0022448
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationLG07:73537728..73541201
RNA-Seq ExpressionTan0022448
SyntenyTan0022448
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal
IPR036269 - Rho termination factor, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151987.1 rho-N domain-containing protein 1, chloroplastic isoform X2 [Cucumis sativus]4.9e-16881.01Show/hide
Query:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD
        MSQAIHLLP+N TDSRC+PCSGVS R A+ S  SLCAEHR N  VKFR LNCTSLG +FTCKASSGGHRRNPDF KQNR+GFSRSRNRQNEER+SL+N+D
Subjt:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD

Query:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDF
        ESDLL SKNGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRS  SG G SS+KD 
Subjt:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDF

Query:  SVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESER
        S NHVKENGPYDEG+G+S FGLS +LREKAQ        RPVSNFQR+SPVPRVKYQPIYPGES V+ST+GMNSKGVK NG +TGSQLK KVWTRQESER
Subjt:  SVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESER

Query:  EAWEELQSQ--GGQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKH--EDHEDLNSLKLTELKAMAKSHGMKGF
        E WEELQSQ    QEPEPDQEFELEPE E+YD LEHE DEME ELVNLLGVSSDVDDTF+D+VKD E+FAKH  ++HEDLNSLKL EL+A+AKS  ++GF
Subjt:  EAWEELQSQ--GGQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKH--EDHEDLNSLKLTELKAMAKSHGMKGF

Query:  SKMKKSELVQLLSEAR
        SKMKKSELVQLLS  +
Subjt:  SKMKKSELVQLLSEAR

XP_008447421.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X7 [Cucumis melo]1.4e-16780.62Show/hide
Query:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS
        MSQAIHLLP N T     DSRCLPCSGVS R A+ S  SLCAEH  N  VKFR LNCTSLG +FTCKASS GHRRNPDF KQNR G+SRSRNRQNEER+S
Subjt:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS

Query:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSS
        LEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG+G    
Subjt:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSS

Query:  SSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTR
        S+KD S NHVKENGPYDEG+G+SIFGLS +LREKAQEP G SF RP SNFQR+SPVPRVKYQPIYPGES VDST+GMNSKG+KLNG ETGSQLKAKVWTR
Subjt:  SSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTR

Query:  QESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGM
        QESERE WEELQSQ    QEPE DQEFE+EPE E+YD LEHE DEME ELVNLLGVSSD+DDTF+D++KD E+F+KH +HE+LNSLKL EL+A+AKS  +
Subjt:  QESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGM

Query:  KGFSKMKKSELVQLLSEA
        +GFSKMKKSELVQLLS +
Subjt:  KGFSKMKKSELVQLLSEA

XP_008447423.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X9 [Cucumis melo]2.0e-16981.6Show/hide
Query:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD
        MSQAIHLLP N TDSRCLPCSGVS R A+ S  SLCAEH  N  VKFR LNCTSLG +FTCKASS GHRRNPDF KQNR G+SRSRNRQNEER+SLEN+D
Subjt:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD

Query:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDF
        ESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG+G    S+KD 
Subjt:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDF

Query:  SVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESER
        S NHVKENGPYDEG+G+SIFGLS +LREKAQEP G SF RP SNFQR+SPVPRVKYQPIYPGES VDST+GMNSKG+KLNG ETGSQLKAKVWTRQESER
Subjt:  SVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESER

Query:  EAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSK
        E WEELQSQ    QEPE DQEFE+EPE E+YD LEHE DEME ELVNLLGVSSD+DDTF+D++KD E+F+KH +HE+LNSLKL EL+A+AKS  ++GFSK
Subjt:  EAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSK

Query:  MKKSELVQLLSEA
        MKKSELVQLLS +
Subjt:  MKKSELVQLLSEA

XP_038887988.1 rho-N domain-containing protein 1, chloroplastic-like isoform X1 [Benincasa hispida]9.4e-18085.55Show/hide
Query:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGA-FTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS
        MSQAIHLLPNN+T     DSRCLPCSGVS R A+VSS SLCAEHR +A+VKFR LNCTSLGA FTCKASSGGHRRNPDFSKQNR+GFSRSRNRQNEER+S
Subjt:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGA-FTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS

Query:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGS-GS
        L+N+DESDLLSSKNGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG GS G 
Subjt:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGS-GS

Query:  SSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWT
        SSSKDF+ NHVKENG YDEGKGTSIFGLSA+LREKAQEPTG SFSRPVSNFQRKSPVPRVKYQPI+PGES VDSTDG+NSKGVKLNG ET SQLKAKVWT
Subjt:  SSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWT

Query:  RQES-EREAWEELQSQG--GQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSH
        RQES ER  WEELQSQG   QEPE DQE+ELEPE ESY ELEH+PDE ESELVNLLGVSSD+DDTFDD+VKD EKFAKH++HEDLNSLK+ EL+A+AKS 
Subjt:  RQES-EREAWEELQSQG--GQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSH

Query:  GMKGFSKMKKSELVQLLSEARV
         +KGFSKMKKSELVQLLS+  V
Subjt:  GMKGFSKMKKSELVQLLSEARV

XP_038887989.1 rho-N domain-containing protein 1, chloroplastic-like isoform X2 [Benincasa hispida]1.3e-18186.57Show/hide
Query:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGA-FTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD
        MSQAIHLLPNN+TDSRCLPCSGVS R A+VSS SLCAEHR +A+VKFR LNCTSLGA FTCKASSGGHRRNPDFSKQNR+GFSRSRNRQNEER+SL+N+D
Subjt:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGA-FTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD

Query:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGS-GSSSSKD
        ESDLLSSKNGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG GS G SSSKD
Subjt:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGS-GSSSSKD

Query:  FSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQES-
        F+ NHVKENG YDEGKGTSIFGLSA+LREKAQEPTG SFSRPVSNFQRKSPVPRVKYQPI+PGES VDSTDG+NSKGVKLNG ET SQLKAKVWTRQES 
Subjt:  FSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQES-

Query:  EREAWEELQSQG--GQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGF
        ER  WEELQSQG   QEPE DQE+ELEPE ESY ELEH+PDE ESELVNLLGVSSD+DDTFDD+VKD EKFAKH++HEDLNSLK+ EL+A+AKS  +KGF
Subjt:  EREAWEELQSQG--GQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGF

Query:  SKMKKSELVQLLSEARV
        SKMKKSELVQLLS+  V
Subjt:  SKMKKSELVQLLSEARV

TrEMBL top hitse value%identityAlignment
A0A0A0L7X8 Rho_N domain-containing protein1.7e-16680.05Show/hide
Query:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS
        MSQAIHLLP+N T     DSRC+PCSGVS R A+ S  SLCAEHR N  VKFR LNCTSLG +FTCKASSGGHRRNPDF KQNR+GFSRSRNRQNEER+S
Subjt:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS

Query:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSS
        L+N+DESDLL SKNGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRS  SG G S
Subjt:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSS

Query:  SSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTR
        S+KD S NHVKENGPYDEG+G+S FGLS +LREKAQ        RPVSNFQR+SPVPRVKYQPIYPGES V+ST+GMNSKGVK NG +TGSQLK KVWTR
Subjt:  SSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTR

Query:  QESEREAWEELQSQ--GGQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKH--EDHEDLNSLKLTELKAMAKSH
        QESERE WEELQSQ    QEPEPDQEFELEPE E+YD LEHE DEME ELVNLLGVSSDVDDTF+D+VKD E+FAKH  ++HEDLNSLKL EL+A+AKS 
Subjt:  QESEREAWEELQSQ--GGQEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKH--EDHEDLNSLKLTELKAMAKSH

Query:  GMKGFSKMKKSELVQLLSEAR
         ++GFSKMKKSELVQLLS  +
Subjt:  GMKGFSKMKKSELVQLLSEAR

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X76.8e-16880.62Show/hide
Query:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS
        MSQAIHLLP N T     DSRCLPCSGVS R A+ S  SLCAEH  N  VKFR LNCTSLG +FTCKASS GHRRNPDF KQNR G+SRSRNRQNEER+S
Subjt:  MSQAIHLLPNNVT-----DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDS

Query:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSS
        LEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG+G    
Subjt:  LENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSS

Query:  SSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTR
        S+KD S NHVKENGPYDEG+G+SIFGLS +LREKAQEP G SF RP SNFQR+SPVPRVKYQPIYPGES VDST+GMNSKG+KLNG ETGSQLKAKVWTR
Subjt:  SSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTR

Query:  QESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGM
        QESERE WEELQSQ    QEPE DQEFE+EPE E+YD LEHE DEME ELVNLLGVSSD+DDTF+D++KD E+F+KH +HE+LNSLKL EL+A+AKS  +
Subjt:  QESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGM

Query:  KGFSKMKKSELVQLLSEA
        +GFSKMKKSELVQLLS +
Subjt:  KGFSKMKKSELVQLLSEA

A0A1S3BI13 rho-N domain-containing protein 1, chloroplastic isoform X99.6e-17081.6Show/hide
Query:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD
        MSQAIHLLP N TDSRCLPCSGVS R A+ S  SLCAEH  N  VKFR LNCTSLG +FTCKASS GHRRNPDF KQNR G+SRSRNRQNEER+SLEN+D
Subjt:  MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLD

Query:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDF
        ESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG+G    S+KD 
Subjt:  ESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDF

Query:  SVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESER
        S NHVKENGPYDEG+G+SIFGLS +LREKAQEP G SF RP SNFQR+SPVPRVKYQPIYPGES VDST+GMNSKG+KLNG ETGSQLKAKVWTRQESER
Subjt:  SVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESER

Query:  EAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSK
        E WEELQSQ    QEPE DQEFE+EPE E+YD LEHE DEME ELVNLLGVSSD+DDTF+D++KD E+F+KH +HE+LNSLKL EL+A+AKS  ++GFSK
Subjt:  EAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSK

Query:  MKKSELVQLLSEA
        MKKSELVQLLS +
Subjt:  MKKSELVQLLSEA

A0A1S3BIA8 rho-N domain-containing protein 1, chloroplastic isoform X67.6e-16778.92Show/hide
Query:  MSQAIHLLPNN--------------VTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSR
        MSQAIHLLP N              V DSRCLPCSGVS R A+ S  SLCAEH  N  VKFR LNCTSLG +FTCKASS GHRRNPDF KQNR G+SRSR
Subjt:  MSQAIHLLPNN--------------VTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYGFSRSR

Query:  NRQNEERDSLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR
        NRQNEER+SLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKR
Subjt:  NRQNEERDSLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR

Query:  SSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGS
        SSG+G    S+KD S NHVKENGPYDEG+G+SIFGLS +LREKAQEP G SF RP SNFQR+SPVPRVKYQPIYPGES VDST+GMNSKG+KLNG ETGS
Subjt:  SSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGS

Query:  QLKAKVWTRQESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTEL
        QLKAKVWTRQESERE WEELQSQ    QEPE DQEFE+EPE E+YD LEHE DEME ELVNLLGVSSD+DDTF+D++KD E+F+KH +HE+LNSLKL EL
Subjt:  QLKAKVWTRQESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTEL

Query:  KAMAKSHGMKGFSKMKKSELVQLLSEA
        +A+AKS  ++GFSKMKKSELVQLLS +
Subjt:  KAMAKSHGMKGFSKMKKSELVQLLSEA

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X42.9e-16678.01Show/hide
Query:  MSQAIHLLPNNVT-------------------DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYG
        MSQAIHLLP N T                   DSRCLPCSGVS R A+ S  SLCAEH  N  VKFR LNCTSLG +FTCKASS GHRRNPDF KQNR G
Subjt:  MSQAIHLLPNNVT-------------------DSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLG-AFTCKASSGGHRRNPDFSKQNRYG

Query:  FSRSRNRQNEERDSLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSV
        +SRSRNRQNEER+SLEN+DESDLLSS+NGPLLS+SSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRQNEERDSLENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNG
        EQGKRSSG+G    S+KD S NHVKENGPYDEG+G+SIFGLS +LREKAQEP G SF RP SNFQR+SPVPRVKYQPIYPGES VDST+GMNSKG+KLNG
Subjt:  EQGKRSSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNG

Query:  AETGSQLKAKVWTRQESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSL
         ETGSQLKAKVWTRQESERE WEELQSQ    QEPE DQEFE+EPE E+YD LEHE DEME ELVNLLGVSSD+DDTF+D++KD E+F+KH +HE+LNSL
Subjt:  AETGSQLKAKVWTRQESEREAWEELQSQGG--QEPEPDQEFELEPEPESYDELEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSL

Query:  KLTELKAMAKSHGMKGFSKMKKSELVQLLSEA
        KL EL+A+AKS  ++GFSKMKKSELVQLLS +
Subjt:  KLTELKAMAKSHGMKGFSKMKKSELVQLLSEA

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-731.1e-4038.32Show/hide
Query:  AFTCKASSGGHR-RNPDFSKQNRYGFSRSRNRQNEERDSLENLDE--SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEE
        +  C A+   HR R+ D ++  + G +R +++  +E+D  EN+DE  +D++SSKNGP +SL+S  + QAT+ PG REKEIVELF++VQAQLR R   KEE
Subjt:  AFTCKASSGGHR-RNPDFSKQNRYGFSRSRNRQNEERDSLENLDE--SDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEE

Query:  KKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPR
        KK E Q + +G   +VDSLL LLRKHSV+Q ++       S   K+ SV+  K +      + +SIF +    +E+ ++P  ++F RP SNF+R+SPVP 
Subjt:  KKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPR

Query:  VKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESEREAWEELQSQGGQEPEPDQEFELEPEPESYDELEH----EPDEMESELVN----
        VK+QP+    +NVD+   +N+    +  A+   + KA           A +E  S    EP       +EPE  S D+L+H    EPD  +++  +    
Subjt:  VKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESEREAWEELQSQGGQEPEPDQEFELEPEPESYDELEH----EPDEMESELVN----

Query:  --LLGVSS--DVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSKMKKSELVQLLS
           L + S   +D++ D  +K            DL++LK+TEL+ +AKS G+KG+SKMKK++LV+LLS
Subjt:  --LLGVSS--DVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSKMKKSELVQLLS

Q94K75 Rho-N domain-containing protein 1, chloroplastic1.3e-5942.69Show/hide
Query:  MSQAIHLLPNNV-----TDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSL
        MS   HL  + V     +DSRC   S VS RT  +   S C +H+       RL +  +  +F C+ASSGG+RRNPDFS+ N++G+ R  NRQ+  R+  
Subjt:  MSQAIHLLPNNV-----TDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSL

Query:  ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR------S
        + ++ SD+LSS+NGPL +LSS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR      S
Subjt:  ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR------S

Query:  SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQ
         G   G +  K     ++  +G  D                       SSF+RP S+F+RKSPVPR +  P Y  E+  D +            + T +Q
Subjt:  SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQ

Query:  LKAKVWTRQESEREAWEELQSQGGQEPEP-------DQEFELEPEPESYDELEHEPD---EMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLN
         K  V    E E E   E + +   E EP       + + EL+PE  S+ + E + D   ++ S+   +L V SD D++ DD  +D ++ A+ E  +DL+
Subjt:  LKAKVWTRQESEREAWEELQSQGGQEPEP-------DQEFELEPEPESYDELEHEPD---EMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLN

Query:  SLKLTELKAMAKSHGMKGFSKMKKSELVQLL
         LKL EL+ +AKS G+KG SKMKK+ELV+LL
Subjt:  SLKLTELKAMAKSHGMKGFSKMKKSELVQLL

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor9.4e-6142.69Show/hide
Query:  MSQAIHLLPNNV-----TDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSL
        MS   HL  + V     +DSRC   S VS RT  +   S C +H+       RL +  +  +F C+ASSGG+RRNPDFS+ N++G+ R  NRQ+  R+  
Subjt:  MSQAIHLLPNNV-----TDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSL

Query:  ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR------S
        + ++ SD+LSS+NGPL +LSS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR      S
Subjt:  ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR------S

Query:  SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQ
         G   G +  K     ++  +G  D                       SSF+RP S+F+RKSPVPR +  P Y  E+  D +            + T +Q
Subjt:  SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQ

Query:  LKAKVWTRQESEREAWEELQSQGGQEPEP-------DQEFELEPEPESYDELEHEPD---EMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLN
         K  V    E E E   E + +   E EP       + + EL+PE  S+ + E + D   ++ S+   +L V SD D++ DD  +D ++ A+ E  +DL+
Subjt:  LKAKVWTRQESEREAWEELQSQGGQEPEP-------DQEFELEPEPESYDELEHEPD---EMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLN

Query:  SLKLTELKAMAKSHGMKGFSKMKKSELVQLL
         LKL EL+ +AKS G+KG SKMKK+ELV+LL
Subjt:  SLKLTELKAMAKSHGMKGFSKMKKSELVQLL

AT1G06190.2 Rho termination factor5.4e-4846.81Show/hide
Query:  MSQAIHLLPNNV-----TDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSL
        MS   HL  + V     +DSRC   S VS RT  +   S C +H+       RL +  +  +F C+ASSGG+RRNPDFS+ N++G+ R  NRQ+  R+  
Subjt:  MSQAIHLLPNNV-----TDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSL

Query:  ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR------S
        + ++ SD+LSS+NGPL +LSS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR      S
Subjt:  ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR------S

Query:  SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDST
         G   G +  K     ++  +G  D                       SSF+RP S+F+RKSPVPR +  P Y  E+  D +
Subjt:  SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDST

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism1.6e-2337.5Show/hide
Query:  HRR-NPDFSKQNRYGFSRSRNRQNEERDSL-ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK
        HRR NPDFS+ N++GF R RNR+NE++D L +   E D+LSSKN                     EKEIVELF+KVQ QLR R AA KEEKK E   + +
Subjt:  HRR-NPDFSKQNRYGFSRSRNRQNEERDSL-ENLDESDLLSSKNGPLLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK

Query:  G---SETVDSLLKLLRKHSVEQGKRS-SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIY
        G   SETVDSLLKLLRKHS EQ K+  S   S     +D   +  + +              S+    + ++   + F+RP S+F+R SPVPR K Q  Y
Subjt:  G---SETVDSLLKLLRKHSVEQGKRS-SGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFGLSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIY

Query:  PGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESEREAWEELQSQGGQEPEPDQEFELEPEP-----ESYDELE-----HEPDEMESELVNLLGVS
          E+  D              + T +Q K +V +R E E E   E  ++   EPEP+ E+E E EP     ES  EL+      E DE E +   ++   
Subjt:  PGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESEREAWEELQSQGGQEPEPDQEFELEPEP-----ESYDELE-----HEPDEMESELVNLLGVS

Query:  SDVDDTFDDNVK
        SD D++ + + +
Subjt:  SDVDDTFDDNVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCCATACATCTTCTTCCTAACAACGTTACAGATAGCAGATGCCTACCATGCTCTGGAGTTTCAGTACGGACAGCCACTGTCTCTTCTCTCTCTTTATGTGC
TGAACATAGAACCAATGCACAGGTCAAATTCCGACTCCTAAACTGTACTTCGTTGGGGGCTTTTACGTGTAAAGCCAGCTCGGGAGGTCATAGGAGAAACCCAGATTTCT
CAAAGCAAAATAGGTATGGCTTCTCAAGAAGTAGAAATAGACAAAATGAGGAGAGAGATAGCCTTGAAAATCTTGATGAATCTGATTTATTATCGTCTAAGAATGGACCA
TTACTTTCCCTCTCTAGCACCCCAAAATCCCAGGCCACTGCTACCCCAGGCCCTAGGGAGAAGGAAATTGTTGAACTTTTCAGGAAGGTTCAAGCTCAACTTCGGGAGCG
AGCTGCAATGAAAGAAGAAAAGAAAATGGAAGCACAAGGACAAACGAAAGGGAGCGAGACGGTGGATTCTCTTCTTAAGCTATTGAGAAAGCATTCAGTTGAGCAAGGGA
AGAGAAGCAGTGGTAGTGGTAGTGGCAGCAGCAGCAGCAAGGACTTCAGTGTTAACCATGTCAAAGAGAATGGTCCCTATGATGAAGGAAAAGGCACAAGCATTTTTGGC
CTAAGTGCCAGCTTGAGAGAGAAGGCCCAAGAACCAACAGGATCTTCTTTCAGTAGACCTGTATCAAATTTTCAACGTAAATCCCCCGTGCCTCGGGTGAAATACCAACC
AATTTACCCTGGGGAAAGTAATGTAGACTCCACTGATGGCATGAATTCAAAGGGAGTGAAACTTAATGGAGCCGAGACAGGTTCTCAACTGAAGGCAAAGGTATGGACTC
GACAAGAGTCAGAACGAGAGGCCTGGGAAGAGCTGCAATCACAAGGAGGGCAGGAGCCAGAGCCAGACCAAGAGTTTGAGTTGGAGCCAGAGCCTGAATCATATGATGAG
CTCGAGCACGAACCTGATGAGATGGAGTCTGAACTCGTTAATTTATTAGGTGTGTCTTCAGACGTTGATGACACATTTGATGACAACGTTAAAGACATTGAAAAATTTGC
AAAGCATGAGGATCATGAGGACTTGAACTCATTGAAGCTTACTGAACTGAAGGCAATGGCCAAATCTCATGGTATGAAAGGCTTCTCGAAGATGAAGAAGAGCGAGCTCG
TGCAGTTGCTAAGTGAGGCTAGAGTATAA
mRNA sequenceShow/hide mRNA sequence
GTGAAATTCGCAATTATAGGCCAGCAGCGCAAAGAAGAAGGAAGCGGAAAAGCGAAAAATCCTCATAGAAATGCAAATCCAAAACCCTTCTTTACCTCACTAAAACCTTT
TCTCCAAGCTGTCTATACCAGCAATGTCTCAAGCCATACATCTTCTTCCTAACAACGTTACAGATAGCAGATGCCTACCATGCTCTGGAGTTTCAGTACGGACAGCCACT
GTCTCTTCTCTCTCTTTATGTGCTGAACATAGAACCAATGCACAGGTCAAATTCCGACTCCTAAACTGTACTTCGTTGGGGGCTTTTACGTGTAAAGCCAGCTCGGGAGG
TCATAGGAGAAACCCAGATTTCTCAAAGCAAAATAGGTATGGCTTCTCAAGAAGTAGAAATAGACAAAATGAGGAGAGAGATAGCCTTGAAAATCTTGATGAATCTGATT
TATTATCGTCTAAGAATGGACCATTACTTTCCCTCTCTAGCACCCCAAAATCCCAGGCCACTGCTACCCCAGGCCCTAGGGAGAAGGAAATTGTTGAACTTTTCAGGAAG
GTTCAAGCTCAACTTCGGGAGCGAGCTGCAATGAAAGAAGAAAAGAAAATGGAAGCACAAGGACAAACGAAAGGGAGCGAGACGGTGGATTCTCTTCTTAAGCTATTGAG
AAAGCATTCAGTTGAGCAAGGGAAGAGAAGCAGTGGTAGTGGTAGTGGCAGCAGCAGCAGCAAGGACTTCAGTGTTAACCATGTCAAAGAGAATGGTCCCTATGATGAAG
GAAAAGGCACAAGCATTTTTGGCCTAAGTGCCAGCTTGAGAGAGAAGGCCCAAGAACCAACAGGATCTTCTTTCAGTAGACCTGTATCAAATTTTCAACGTAAATCCCCC
GTGCCTCGGGTGAAATACCAACCAATTTACCCTGGGGAAAGTAATGTAGACTCCACTGATGGCATGAATTCAAAGGGAGTGAAACTTAATGGAGCCGAGACAGGTTCTCA
ACTGAAGGCAAAGGTATGGACTCGACAAGAGTCAGAACGAGAGGCCTGGGAAGAGCTGCAATCACAAGGAGGGCAGGAGCCAGAGCCAGACCAAGAGTTTGAGTTGGAGC
CAGAGCCTGAATCATATGATGAGCTCGAGCACGAACCTGATGAGATGGAGTCTGAACTCGTTAATTTATTAGGTGTGTCTTCAGACGTTGATGACACATTTGATGACAAC
GTTAAAGACATTGAAAAATTTGCAAAGCATGAGGATCATGAGGACTTGAACTCATTGAAGCTTACTGAACTGAAGGCAATGGCCAAATCTCATGGTATGAAAGGCTTCTC
GAAGATGAAGAAGAGCGAGCTCGTGCAGTTGCTAAGTGAGGCTAGAGTATAAGAAAAGAAACACATGTTGGACAGGAACTTCTGGGTTATAGAATGTTTAGGTGCTACTT
TTTTGCTTCTCTTTTCTTTTGTTGAATATAATCAATCTGTATGACCAGTTTTGAGTTGGAATTAGTTTTTCAAACCTTTCCCAGATTGTATATCGGGCTAGCCTTTTGAC
AGGTTAAGAGTACATGTCAGCAACAACAATCTCTCTTTTAGCTTTCTCCTATTCAGATACTTGTTCACTTTTAGATATGAATAATTTTTTAAATCTCTAA
Protein sequenceShow/hide protein sequence
MSQAIHLLPNNVTDSRCLPCSGVSVRTATVSSLSLCAEHRTNAQVKFRLLNCTSLGAFTCKASSGGHRRNPDFSKQNRYGFSRSRNRQNEERDSLENLDESDLLSSKNGP
LLSLSSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGSGSGSSSSKDFSVNHVKENGPYDEGKGTSIFG
LSASLREKAQEPTGSSFSRPVSNFQRKSPVPRVKYQPIYPGESNVDSTDGMNSKGVKLNGAETGSQLKAKVWTRQESEREAWEELQSQGGQEPEPDQEFELEPEPESYDE
LEHEPDEMESELVNLLGVSSDVDDTFDDNVKDIEKFAKHEDHEDLNSLKLTELKAMAKSHGMKGFSKMKKSELVQLLSEARV