; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023082 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023082
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationchr04:20418196..20422586
RNA-Seq ExpressionPI0023082
SyntenyPI0023082
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151986.1 rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis sativus]3.7e-20091.13Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MSQAIHLLP+NPTGFGLSDSRC+PCSGVSGRAASFS  +LCAEH INVPVKFRPLNCTSL  SFTCKASSGGHRRNPDF KQNRHGFSRSRNRQNEERES
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        L+NVDESDLL SKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES
        KDISFNHVKENGPYDEGRGSS FG SPNLREK Q       RPVSNFQRRSPVPRVKYQPIYPGESIV+ST+ MNSKGVK NGT+TGSQLK KVWTRQES
Subjt:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES

Query:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHG--EHEDLNSLKLAELRAIAKSRSLKG
         REHWEELQSQRE EQEPEPDQEFELEPEAE+Y LEHE DEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHG  EHEDLNSLKLAELRAIAKSRSL+G
Subjt:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHG--EHEDLNSLKLAELRAIAKSRSLKG

Query:  FSKMKKSELVQLLSNGQ
        FSKMKKSELVQLLSNGQ
Subjt:  FSKMKKSELVQLLSNGQ

XP_008447419.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X6 [Cucumis melo]3.5e-19889.34Show/hide
Query:  MSQAIHLLPNNPTG---------FGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSR
        MSQAIHLLP NPTG         F + DSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF KQNR G+SRSR
Subjt:  MSQAIHLLPNNPTG---------FGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSR

Query:  NRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR
        NRQNEERESLENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKR
Subjt:  NRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR

Query:  SSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLK
        SSG GG SNKDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+KLNGTETGSQLK
Subjt:  SSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLK

Query:  AKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIA
        AKVWTRQES REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LNSLKLAELRAIA
Subjt:  AKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIA

Query:  KSRSLKGFSKMKKSELVQLLSN
        KSRSL+GFSKMKKSELVQLLSN
Subjt:  KSRSLKGFSKMKKSELVQLLSN

XP_008447421.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X7 [Cucumis melo]2.3e-20292.01Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MSQAIHLLP NPTGFGLSDSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF KQNR G+SRSRNRQNEERES
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        LENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GG SN
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES
        KDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+KLNGTETGSQLKAKVWTRQES
Subjt:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES

Query:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFS
         REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LNSLKLAELRAIAKSRSL+GFS
Subjt:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFS

Query:  KMKKSELVQLLSN
        KMKKSELVQLLSN
Subjt:  KMKKSELVQLLSN

XP_016900423.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis melo]9.1e-19987.76Show/hide
Query:  MSQAIHLLPNNPT--------------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFS
        MSQAIHLLP NPT                    GFGLSDSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF 
Subjt:  MSQAIHLLPNNPT--------------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFS

Query:  KQNRHGFSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKL
        KQNR G+SRSRNRQNEERESLENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKL
Subjt:  KQNRHGFSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKL

Query:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVK
        LRKH+VEQGKRSSG GG SNKDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+K
Subjt:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVK

Query:  LNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN
        LNGTETGSQLKAKVWTRQES REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LN
Subjt:  LNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN

Query:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN
        SLKLAELRAIAKSRSL+GFSKMKKSELVQLLSN
Subjt:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN

XP_016900425.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X4 [Cucumis melo]1.8e-19988.99Show/hide
Query:  MSQAIHLLPNNPT--------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHG
        MSQAIHLLP NPT              GFGLSDSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF KQNR G
Subjt:  MSQAIHLLPNNPT--------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHG

Query:  FSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSV
        +SRSRNRQNEERESLENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTET
        EQGKRSSG GG SNKDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+KLNGTET
Subjt:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTET

Query:  GSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAE
        GSQLKAKVWTRQES REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LNSLKLAE
Subjt:  GSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAE

Query:  LRAIAKSRSLKGFSKMKKSELVQLLSN
        LRAIAKSRSL+GFSKMKKSELVQLLSN
Subjt:  LRAIAKSRSLKGFSKMKKSELVQLLSN

TrEMBL top hitse value%identityAlignment
A0A0A0L7X8 Rho_N domain-containing protein1.8e-20091.13Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MSQAIHLLP+NPTGFGLSDSRC+PCSGVSGRAASFS  +LCAEH INVPVKFRPLNCTSL  SFTCKASSGGHRRNPDF KQNRHGFSRSRNRQNEERES
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        L+NVDESDLL SKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES
        KDISFNHVKENGPYDEGRGSS FG SPNLREK Q       RPVSNFQRRSPVPRVKYQPIYPGESIV+ST+ MNSKGVK NGT+TGSQLK KVWTRQES
Subjt:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES

Query:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHG--EHEDLNSLKLAELRAIAKSRSLKG
         REHWEELQSQRE EQEPEPDQEFELEPEAE+Y LEHE DEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHG  EHEDLNSLKLAELRAIAKSRSL+G
Subjt:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHG--EHEDLNSLKLAELRAIAKSRSLKG

Query:  FSKMKKSELVQLLSNGQ
        FSKMKKSELVQLLSNGQ
Subjt:  FSKMKKSELVQLLSNGQ

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X71.1e-20292.01Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MSQAIHLLP NPTGFGLSDSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF KQNR G+SRSRNRQNEERES
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN
        LENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSSG GG SN
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSN

Query:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES
        KDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+KLNGTETGSQLKAKVWTRQES
Subjt:  KDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQES

Query:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFS
         REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LNSLKLAELRAIAKSRSL+GFS
Subjt:  AREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFS

Query:  KMKKSELVQLLSN
        KMKKSELVQLLSN
Subjt:  KMKKSELVQLLSN

A0A1S3BIA8 rho-N domain-containing protein 1, chloroplastic isoform X61.7e-19889.34Show/hide
Query:  MSQAIHLLPNNPTG---------FGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSR
        MSQAIHLLP NPTG         F + DSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF KQNR G+SRSR
Subjt:  MSQAIHLLPNNPTG---------FGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSR

Query:  NRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR
        NRQNEERESLENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKR
Subjt:  NRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR

Query:  SSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLK
        SSG GG SNKDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+KLNGTETGSQLK
Subjt:  SSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLK

Query:  AKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIA
        AKVWTRQES REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LNSLKLAELRAIA
Subjt:  AKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIA

Query:  KSRSLKGFSKMKKSELVQLLSN
        KSRSL+GFSKMKKSELVQLLSN
Subjt:  KSRSLKGFSKMKKSELVQLLSN

A0A1S4DWS1 rho-N domain-containing protein 1, chloroplastic isoform X14.4e-19987.76Show/hide
Query:  MSQAIHLLPNNPT--------------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFS
        MSQAIHLLP NPT                    GFGLSDSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF 
Subjt:  MSQAIHLLPNNPT--------------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFS

Query:  KQNRHGFSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKL
        KQNR G+SRSRNRQNEERESLENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKL
Subjt:  KQNRHGFSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKL

Query:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVK
        LRKH+VEQGKRSSG GG SNKDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+K
Subjt:  LRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVK

Query:  LNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN
        LNGTETGSQLKAKVWTRQES REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LN
Subjt:  LNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN

Query:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN
        SLKLAELRAIAKSRSL+GFSKMKKSELVQLLSN
Subjt:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X48.9e-20088.99Show/hide
Query:  MSQAIHLLPNNPT--------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHG
        MSQAIHLLP NPT              GFGLSDSRCLPCSGVSGRAASFS R+LCAEHGINVPVKFRPLNCTSL  SFTCKASS GHRRNPDF KQNR G
Subjt:  MSQAIHLLPNNPT--------------GFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHG

Query:  FSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSV
        +SRSRNRQNEERESLENVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAM+EEKKMEAQGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRQNEERESLENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTET
        EQGKRSSG GG SNKDISFNHVKENGPYDEGRGSSIFG SPNLREK QEP GSF RP SNFQRRSPVPRVKYQPIYPGESIVDST+ MNSKG+KLNGTET
Subjt:  EQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTET

Query:  GSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAE
        GSQLKAKVWTRQES REHWEELQSQR+TEQEPE DQEFE+EPEAE+Y LEHEADEMEPELVNLLGVSSD+DDTFEDD+KDNEEF+KHGEHE+LNSLKLAE
Subjt:  GSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAE

Query:  LRAIAKSRSLKGFSKMKKSELVQLLSN
        LRAIAKSRSL+GFSKMKKSELVQLLSN
Subjt:  LRAIAKSRSLKGFSKMKKSELVQLLSN

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-732.2e-3835.28Show/hide
Query:  RTLCAEHGIN-----------VPVKFRPLNCTSLVASFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRQNEERESLENVDE--SDLLSSKNGPLLSISST
        RTL A HG N           +P+  RP+       S  C A+   HR R+ D ++  + G +R +++  +E++  EN+DE  +D++SSKNGP +S++S 
Subjt:  RTLCAEHGIN-----------VPVKFRPLNCTSLVASFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRQNEERESLENVDE--SDLLSSKNGPLLSISST

Query:  PKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSI
         + QAT+ PG REKEIVELF++VQAQLR R   +EEKK E Q + +G   +VDSLL LLRKHSV+Q ++S        K+ S +  K +      + SSI
Subjt:  PKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSI

Query:  FGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPI--YPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQESAREHWEELQ-SQRETEQEPE
        F  +    E+ +    +F RP SNF+RRSPVP VK+QP+     E ++++ +D   +       +  +     V T + ++    E L     +   + E
Subjt:  FGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPI--YPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQESAREHWEELQ-SQRETEQEPE

Query:  PDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN
        PD     EP         E DE   ++ ++  +    D T +  +             DL++LK+ ELR +AKSR +KG+SKMKK++LV+LLSN
Subjt:  PDQEFELEPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN

Q94K75 Rho-N domain-containing protein 1, chloroplastic8.4e-6243.19Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MS   HL  +   G+ LSDSRC   S VS R  +    + C +H      K   L      +SF C+ASSGG+RRNPDFS+ N+HG+ R  NRQ+  RE 
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR-----S
         + ++ SD+LSS+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA +EEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR     S
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR-----S

Query:  SGG---GGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQ
        S G   G + +K     ++  +G  D                       SF+RP S+F+R+SPVPR +  P Y  E+  D +            + T +Q
Subjt:  SGG---GGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQ

Query:  LKAKVWTRQESAREHWEELQSQRETEQEPEP-----DQEFELEPEAESYGLEHEADEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN
         K  V    E   E   E + + E E EP P     + + EL+PE+ S+  E E D++  ++++    +L V SD D++ +D  +D++E A+    +DL+
Subjt:  LKAKVWTRQESAREHWEELQSQRETEQEPEP-----DQEFELEPEAESYGLEHEADEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN

Query:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN
         LKL ELR IAKSR LKG SKMKK+ELV+LL +
Subjt:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor5.9e-6343.19Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MS   HL  +   G+ LSDSRC   S VS R  +    + C +H      K   L      +SF C+ASSGG+RRNPDFS+ N+HG+ R  NRQ+  RE 
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR-----S
         + ++ SD+LSS+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA +EEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR     S
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR-----S

Query:  SGG---GGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQ
        S G   G + +K     ++  +G  D                       SF+RP S+F+R+SPVPR +  P Y  E+  D +            + T +Q
Subjt:  SGG---GGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQ

Query:  LKAKVWTRQESAREHWEELQSQRETEQEPEP-----DQEFELEPEAESYGLEHEADEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN
         K  V    E   E   E + + E E EP P     + + EL+PE+ S+  E E D++  ++++    +L V SD D++ +D  +D++E A+    +DL+
Subjt:  LKAKVWTRQESAREHWEELQSQRETEQEPEP-----DQEFELEPEAESYGLEHEADEMEPELVN----LLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLN

Query:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN
         LKL ELR IAKSR LKG SKMKK+ELV+LL +
Subjt:  SLKLAELRAIAKSRSLKGFSKMKKSELVQLLSN

AT1G06190.2 Rho termination factor7.1e-4846.81Show/hide
Query:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES
        MS   HL  +   G+ LSDSRC   S VS R  +    + C +H      K   L      +SF C+ASSGG+RRNPDFS+ N+HG+ R  NRQ+  RE 
Subjt:  MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERES

Query:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR-----S
         + ++ SD+LSS+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA +EEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR     S
Subjt:  LENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKR-----S

Query:  SGG---GGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDST
        S G   G + +K     ++  +G  D                       SF+RP S+F+R+SPVPR +  P Y  E+  D +
Subjt:  SGG---GGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDST

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism8.4e-2538.82Show/hide
Query:  HRR-NPDFSKQNRHGFSRSRNRQNEERESL-ENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKMEAQGQTK
        HRR NPDFS+ N+HGF R RNR+NE+++ L +   E D+LSSKN                     EKEIVELF+KVQ QLR R AA +EEKK E   + +
Subjt:  HRR-NPDFSKQNRHGFSRSRNRQNEERESL-ENVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMREEKKMEAQGQTK

Query:  G---SETVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGES
        G   SETVDSLLKLLRKHS EQ K+      S  +    +   E   +   R  S        R K    T  F+RP S+F+R SPVPR K Q  Y  E+
Subjt:  G---SETVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGSSIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGES

Query:  IVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFEL-EPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVK
        I D                      +  WT+++      ++++S+ E E EPEP+   E  EPE E+   E+E  E EPEL  L  VS    ++F  +  
Subjt:  IVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFEL-EPEAESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVK

Query:  DNEE
        ++EE
Subjt:  DNEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCCATTCATCTTCTTCCTAACAACCCTACAGGCTTTGGACTGTCAGATAGCAGATGCCTACCGTGCTCTGGAGTTTCAGGACGAGCAGCCTCGTTCTCTTC
TCGCACTTTATGTGCTGAACATGGAATCAATGTACCAGTCAAATTTAGACCTCTAAACTGTACTTCGTTGGTGGCATCTTTTACGTGCAAAGCCAGCTCAGGAGGTCACA
GGAGAAACCCAGATTTCTCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCAAAATGAGGAGAGAGAGAGCCTTGAAAATGTTGACGAATCTGATTTATTA
TCGTCTAAGAATGGACCATTACTTTCCATCTCTAGCACCCCAAAATCCCAGGCCACTGCTACCCCAGGCCCAAGGGAAAAGGAAATTGTTGAACTTTTCAGAAAGGTTCA
AGCTCAGCTTCGGGAGCGAGCTGCAATGAGAGAAGAGAAGAAAATGGAAGCTCAAGGACAAACAAAAGGGAGTGAGACAGTAGATTCTCTTCTTAAGCTATTGCGAAAGC
ATTCAGTGGAACAAGGGAAGAGAAGCAGTGGTGGGGGTGGCAGCAGCAACAAGGACATCAGTTTTAACCATGTCAAAGAGAATGGTCCTTATGATGAAGGAAGAGGCTCA
AGCATTTTTGGCCCAAGTCCCAATTTGAGAGAGAAGACCCAAGAACCTACAGGTTCTTTCAGTAGACCCGTATCAAATTTTCAACGTAGATCCCCCGTGCCTCGGGTGAA
GTACCAGCCAATTTACCCTGGGGAAAGTATTGTCGACTCGACCGATGATATGAATTCAAAAGGTGTGAAACTTAATGGAACCGAAACAGGTTCTCAACTGAAAGCAAAGG
TATGGACTCGACAAGAGTCGGCACGAGAGCACTGGGAAGAGCTGCAATCACAAAGAGAGACAGAACAGGAGCCAGAGCCGGACCAAGAGTTTGAATTGGAACCAGAGGCT
GAATCATATGGTCTAGAGCATGAAGCTGATGAAATGGAGCCTGAACTTGTTAATTTATTAGGCGTGTCGTCAGATGTCGATGACACGTTTGAGGATGATGTTAAAGACAA
TGAGGAATTTGCAAAGCATGGTGAACATGAGGACTTGAACTCATTGAAACTTGCTGAACTGAGGGCGATTGCCAAATCTCGTAGTTTGAAAGGGTTTTCGAAGATGAAGA
AGAGTGAGCTCGTGCAGTTGTTAAGCAACGGTCAGTGA
mRNA sequenceShow/hide mRNA sequence
CCTAAATCGTATTTTCCGGAGAAGAATGTCGTATCCAATAGTACTATTAGCATTAGTAGCTCAAAGAGATGGCCACCGGCTCCGGTAGAGGCATTTTGGGCTTAGGTCAG
GAAGATACGGTGAGAGAAGCCCAGTCCCCATGGCCCAACTACATCCTCTTCGCCCGAGGAAGCTGCGGCATTTCTCAAGCCTAACCAATCTCTCCCTTTATTGGTGATAA
TTCACAATTATAGGCCAGCAGGACAAGGAAGAACGAACCGGAAAAGCGAAAATCCTCAAAGAAATGCAAATCAAAAACCCTTCTTTTCCTCACTAAAACCTTTCCTTTCC
TTCCAGCTGTCTTAACCAGCAATGTCTCAAGCCATTCATCTTCTTCCTAACAACCCTACAGGCTTTGGACTGTCAGATAGCAGATGCCTACCGTGCTCTGGAGTTTCAGG
ACGAGCAGCCTCGTTCTCTTCTCGCACTTTATGTGCTGAACATGGAATCAATGTACCAGTCAAATTTAGACCTCTAAACTGTACTTCGTTGGTGGCATCTTTTACGTGCA
AAGCCAGCTCAGGAGGTCACAGGAGAAACCCAGATTTCTCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCAAAATGAGGAGAGAGAGAGCCTTGAAAAT
GTTGACGAATCTGATTTATTATCGTCTAAGAATGGACCATTACTTTCCATCTCTAGCACCCCAAAATCCCAGGCCACTGCTACCCCAGGCCCAAGGGAAAAGGAAATTGT
TGAACTTTTCAGAAAGGTTCAAGCTCAGCTTCGGGAGCGAGCTGCAATGAGAGAAGAGAAGAAAATGGAAGCTCAAGGACAAACAAAAGGGAGTGAGACAGTAGATTCTC
TTCTTAAGCTATTGCGAAAGCATTCAGTGGAACAAGGGAAGAGAAGCAGTGGTGGGGGTGGCAGCAGCAACAAGGACATCAGTTTTAACCATGTCAAAGAGAATGGTCCT
TATGATGAAGGAAGAGGCTCAAGCATTTTTGGCCCAAGTCCCAATTTGAGAGAGAAGACCCAAGAACCTACAGGTTCTTTCAGTAGACCCGTATCAAATTTTCAACGTAG
ATCCCCCGTGCCTCGGGTGAAGTACCAGCCAATTTACCCTGGGGAAAGTATTGTCGACTCGACCGATGATATGAATTCAAAAGGTGTGAAACTTAATGGAACCGAAACAG
GTTCTCAACTGAAAGCAAAGGTATGGACTCGACAAGAGTCGGCACGAGAGCACTGGGAAGAGCTGCAATCACAAAGAGAGACAGAACAGGAGCCAGAGCCGGACCAAGAG
TTTGAATTGGAACCAGAGGCTGAATCATATGGTCTAGAGCATGAAGCTGATGAAATGGAGCCTGAACTTGTTAATTTATTAGGCGTGTCGTCAGATGTCGATGACACGTT
TGAGGATGATGTTAAAGACAATGAGGAATTTGCAAAGCATGGTGAACATGAGGACTTGAACTCATTGAAACTTGCTGAACTGAGGGCGATTGCCAAATCTCGTAGTTTGA
AAGGGTTTTCGAAGATGAAGAAGAGTGAGCTCGTGCAGTTGTTAAGCAACGGTCAGTGAGGATGTCATGCTGGAAAGGAACCTCTGGATTTAGGATATTTAGGTGTTTTT
TTTACTTATTTTTTATTTTATTTATTTTTCTTGTGCTTATCAATCTGTATGACCAGTTTTGAGTTAGAAATTGGAATTAGCTTTACAAACATTTTCCCAGATTGTATATC
AGGCTAATTTTCACAGGTTCAGAGTACATGTCAGCAACAATCTCGTCTTTTAGTTTTTTCTCTTCTATTCAAATTCTTGTTCACTTATA
Protein sequenceShow/hide protein sequence
MSQAIHLLPNNPTGFGLSDSRCLPCSGVSGRAASFSSRTLCAEHGINVPVKFRPLNCTSLVASFTCKASSGGHRRNPDFSKQNRHGFSRSRNRQNEERESLENVDESDLL
SSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMREEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGSSNKDISFNHVKENGPYDEGRGS
SIFGPSPNLREKTQEPTGSFSRPVSNFQRRSPVPRVKYQPIYPGESIVDSTDDMNSKGVKLNGTETGSQLKAKVWTRQESAREHWEELQSQRETEQEPEPDQEFELEPEA
ESYGLEHEADEMEPELVNLLGVSSDVDDTFEDDVKDNEEFAKHGEHEDLNSLKLAELRAIAKSRSLKGFSKMKKSELVQLLSNGQ