; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G001090 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G001090
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionrho-N domain-containing protein 1, chloroplastic-like
Genome locationCG_Chr07:1167045..1172565
RNA-Seq ExpressionClCG07G001090
SyntenyClCG07G001090
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151986.1 rho-N domain-containing protein 1, chloroplastic isoform X1 [Cucumis sativus]8.6e-18885.61Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MSQAIHLLP+N TGFGLSDS C+PCSGVSGRAAS S  SLCAEH+IN PVKFRPLNCTSLG SFTCKASSGGHRRNPDF KQNRHGFSRSRNR NEERES
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS
        LDNVDESDLL SKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG S+
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS

Query:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES
        K+ SFNHVKENGPYDEG+G+S FGLS N+R+KAQ       RPVSNFQR+SPVPRVKYQPIYPGES+VNST+G+N+KGVK NGT+ GSQLK KVWTRQES
Subjt:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES

Query:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKH--DEHVDLNSLKLAELRAIAKSRSLKG
        EREHWEELQSQ E EQEPEPDQE+ELEPEAE+Y+L+HE DEMEPELVNLLGVSSD+DD  ED+VKDNE+FAKH   EH DLNSLKLAELRAIAKSRSL+G
Subjt:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKH--DEHVDLNSLKLAELRAIAKSRSLKG

Query:  FSKMKKSELVRLLSDAQ
        FSKMKKSELV+LLS+ Q
Subjt:  FSKMKKSELVRLLSDAQ

XP_008447421.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X7 [Cucumis melo]3.0e-18885.75Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MSQAIHLLP N TGFGLSDS CLPCSGVSGRAAS S RSLCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERES
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS
        L+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSS G GGS+
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS

Query:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES
        K+ SFNHVKENGPYDEG+G+SIFGLS N+R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQES
Subjt:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES

Query:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFS
        EREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFS
Subjt:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFS

Query:  KMKKSELVRLLSDA
        KMKKSELV+LLS++
Subjt:  KMKKSELVRLLSDA

XP_016900425.1 PREDICTED: rho-N domain-containing protein 1, chloroplastic isoform X4 [Cucumis melo]2.3e-18582.94Show/hide
Query:  MSQAIHLLPNNLT--------------GFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHG
        MSQAIHLLP N T              GFGLSDS CLPCSGVSGRAAS S RSLCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF KQNR G
Subjt:  MSQAIHLLPNNLT--------------GFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHG

Query:  FSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSV
        +SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEA
        EQGKRSS G GGS+K+ SFNHVKENGPYDEG+G+SIFGLS N+R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+KLNGTE 
Subjt:  EQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEA

Query:  GSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAE
        GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LNSLKLAE
Subjt:  GSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAE

Query:  LRAIAKSRSLKGFSKMKKSELVRLLSDA
        LRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  LRAIAKSRSLKGFSKMKKSELVRLLSDA

XP_038887988.1 rho-N domain-containing protein 1, chloroplastic-like isoform X1 [Benincasa hispida]7.0e-19889.76Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MSQAIHLLPNNLT FGLSDS CLPCSGVSGRAASVSSRSLCAEH+I+A VKFRPLNCTSLG SFTCKASSGGHRRNPDFSKQNRHGFSRSRNR NEERES
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG---GGG
        LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG   GGG
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG---GGG

Query:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR
         SSK+F+FNHVKENG YDEGKGTSIFGLSAN+R+KAQEPTGSF+RPVSNFQRKSPVPRVKYQPI+PGES+V+STDGVN+KGVKLNGTE  SQLKAKVWTR
Subjt:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR

Query:  QES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSL
        QES ER HWEELQSQGET+QEPE DQEYELEPEAESYEL+H+PDE E ELVNLLGVSSD+DD  +D+VKDNEKFAKHDEH DLNSLK+AELRAIAKSRSL
Subjt:  QES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSL

Query:  KGFSKMKKSELVRLLSDAQL
        KGFSKMKKSELV+LLSD  +
Subjt:  KGFSKMKKSELVRLLSDAQL

XP_038887989.1 rho-N domain-containing protein 1, chloroplastic-like isoform X2 [Benincasa hispida]8.0e-19488.81Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MSQAIHLLPNNLT     DS CLPCSGVSGRAASVSSRSLCAEH+I+A VKFRPLNCTSLG SFTCKASSGGHRRNPDFSKQNRHGFSRSRNR NEERES
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG---GGG
        LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG   GGG
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSG---GGG

Query:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR
         SSK+F+FNHVKENG YDEGKGTSIFGLSAN+R+KAQEPTGSF+RPVSNFQRKSPVPRVKYQPI+PGES+V+STDGVN+KGVKLNGTE  SQLKAKVWTR
Subjt:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR

Query:  QES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSL
        QES ER HWEELQSQGET+QEPE DQEYELEPEAESYEL+H+PDE E ELVNLLGVSSD+DD  +D+VKDNEKFAKHDEH DLNSLK+AELRAIAKSRSL
Subjt:  QES-EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSL

Query:  KGFSKMKKSELVRLLSDAQL
        KGFSKMKKSELV+LLSD  +
Subjt:  KGFSKMKKSELVRLLSDAQL

TrEMBL top hitse value%identityAlignment
A0A0A0L7X8 Rho_N domain-containing protein4.2e-18885.61Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MSQAIHLLP+N TGFGLSDS C+PCSGVSGRAAS S  SLCAEH+IN PVKFRPLNCTSLG SFTCKASSGGHRRNPDF KQNRHGFSRSRNR NEERES
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS
        LDNVDESDLL SKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKK+EAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG S+
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS

Query:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES
        K+ SFNHVKENGPYDEG+G+S FGLS N+R+KAQ       RPVSNFQR+SPVPRVKYQPIYPGES+VNST+G+N+KGVK NGT+ GSQLK KVWTRQES
Subjt:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES

Query:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKH--DEHVDLNSLKLAELRAIAKSRSLKG
        EREHWEELQSQ E EQEPEPDQE+ELEPEAE+Y+L+HE DEMEPELVNLLGVSSD+DD  ED+VKDNE+FAKH   EH DLNSLKLAELRAIAKSRSL+G
Subjt:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKH--DEHVDLNSLKLAELRAIAKSRSLKG

Query:  FSKMKKSELVRLLSDAQ
        FSKMKKSELV+LLS+ Q
Subjt:  FSKMKKSELVRLLSDAQ

A0A1S3BHF2 rho-N domain-containing protein 1, chloroplastic isoform X71.4e-18885.75Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MSQAIHLLP N TGFGLSDS CLPCSGVSGRAAS S RSLCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF KQNR G+SRSRNR NEERES
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS
        L+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKRSS G GGS+
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSS

Query:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES
        K+ SFNHVKENGPYDEG+G+SIFGLS N+R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+KLNGTE GSQLKAKVWTRQES
Subjt:  KEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQES

Query:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFS
        EREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LNSLKLAELRAIAKSRSL+GFS
Subjt:  EREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFS

Query:  KMKKSELVRLLSDA
        KMKKSELV+LLS++
Subjt:  KMKKSELVRLLSDA

A0A1S3BIA8 rho-N domain-containing protein 1, chloroplastic isoform X62.1e-18483.22Show/hide
Query:  MSQAIHLLPNNLTG---------FGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSR
        MSQAIHLLP N TG         F + DS CLPCSGVSGRAAS S RSLCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF KQNR G+SRSR
Subjt:  MSQAIHLLPNNLTG---------FGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSR

Query:  NRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR
        NR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+VEQGKR
Subjt:  NRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKR

Query:  SSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLK
        SS G GGS+K+ SFNHVKENGPYDEG+G+SIFGLS N+R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+KLNGTE GSQLK
Subjt:  SSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLK

Query:  AKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIA
        AKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LNSLKLAELRAIA
Subjt:  AKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIA

Query:  KSRSLKGFSKMKKSELVRLLSDA
        KSRSL+GFSKMKKSELV+LLS++
Subjt:  KSRSLKGFSKMKKSELVRLLSDA

A0A1S4DWS1 rho-N domain-containing protein 1, chloroplastic isoform X15.6e-18581.8Show/hide
Query:  MSQAIHLLPNNLT--------------------GFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFS
        MSQAIHLLP N T                    GFGLSDS CLPCSGVSGRAAS S RSLCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF 
Subjt:  MSQAIHLLPNNLT--------------------GFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFS

Query:  KQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKL
        KQNR G+SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKL
Subjt:  KQNRHGFSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKL

Query:  LRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVK
        LRKH+VEQGKRSS G GGS+K+ SFNHVKENGPYDEG+G+SIFGLS N+R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+K
Subjt:  LRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVK

Query:  LNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLN
        LNGTE GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LN
Subjt:  LNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLN

Query:  SLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA
        SLKLAELRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  SLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDA

A0A1S4DXI1 rho-N domain-containing protein 1, chloroplastic isoform X41.1e-18582.94Show/hide
Query:  MSQAIHLLPNNLT--------------GFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHG
        MSQAIHLLP N T              GFGLSDS CLPCSGVSGRAAS S RSLCAEH IN PVKFRPLNCTSLG+SFTCKASS GHRRNPDF KQNR G
Subjt:  MSQAIHLLPNNLT--------------GFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHG

Query:  FSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSV
        +SRSRNR NEERESL+NVDESDLLSS+NGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKH+V
Subjt:  FSRSRNRHNEERESLDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSV

Query:  EQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEA
        EQGKRSS G GGS+K+ SFNHVKENGPYDEG+G+SIFGLS N+R+KAQEP GSF RP SNFQR+SPVPRVKYQPIYPGES+V+ST+G+N+KG+KLNGTE 
Subjt:  EQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEA

Query:  GSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAE
        GSQLKAKVWTRQESEREHWEELQSQ +TEQEPE DQE+E+EPEAE+Y+L+HE DEMEPELVNLLGVSSDIDD  ED++KDNE+F+KH EH +LNSLKLAE
Subjt:  GSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAE

Query:  LRAIAKSRSLKGFSKMKKSELVRLLSDA
        LRAIAKSRSL+GFSKMKKSELV+LLS++
Subjt:  LRAIAKSRSLKGFSKMKKSELVRLLSDA

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-731.7e-3737.6Show/hide
Query:  LSFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRHNEERESLDNVDE--SDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKE
        LS  C A+   HR R+ D ++  + G +R +++  +E++  +N+DE  +D++SSKNGP +S++S  + QAT+ PG REKEIVELF++VQAQLR R   KE
Subjt:  LSFTCKASSGGHR-RNPDFSKQNRHGFSRSRNRHNEERESLDNVDE--SDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKE

Query:  EKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVK
        EKK E Q + +G   +VDSLL LLRKHSV+Q ++S     G  KE S +  K +      + +SIF  +    ++ +    +F RP SNF+R+SPVP VK
Subjt:  EKKMEAQGQTKGSE-TVDSLLKLLRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVK

Query:  YQPI--YPGESVVNS-TDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSS
        +QP+     E V+N+  D V          EA   L+ K  T  E +     E  S  E E     D ++  + E ++ + D    E +   + +  V  
Subjt:  YQPI--YPGESVVNS-TDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEAESYELDHEPDEMEPELVNLLGVSS

Query:  DIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSD
         ID+  +  +K +          DL++LK+ ELR +AKSR +KG+SKMKK++LV LLS+
Subjt:  DIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSD

Q94K75 Rho-N domain-containing protein 1, chloroplastic1.8e-6344.44Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MS   HL  + + G+ LSDS C   S VS R  ++   S C +HK N  +K  P        SF C+ASSGG+RRNPDFS+ N+HG+ R  NR +  RE 
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG
         D ++ SD+LSS+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR      
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG

Query:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR
            +FS     +    D+   T     S N  + A     SF RP S+F+RKSPVPR +  P Y  E+  + +   +            +Q K  V   
Subjt:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR

Query:  QESEREHWEELQSQGETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELR
         E E E   E + + E E EP P     + + EL+PE+ S+  + E D++  ++++    +L V SD D+ ++D  +D+++ A+ +   DL+ LKL ELR
Subjt:  QESEREHWEELQSQGETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELR

Query:  AIAKSRSLKGFSKMKKSELVRLL
         IAKSR LKG SKMKK+ELV LL
Subjt:  AIAKSRSLKGFSKMKKSELVRLL

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor1.3e-6444.44Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MS   HL  + + G+ LSDS C   S VS R  ++   S C +HK N  +K  P        SF C+ASSGG+RRNPDFS+ N+HG+ R  NR +  RE 
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG
         D ++ SD+LSS+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR      
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG

Query:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR
            +FS     +    D+   T     S N  + A     SF RP S+F+RKSPVPR +  P Y  E+  + +   +            +Q K  V   
Subjt:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTR

Query:  QESEREHWEELQSQGETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELR
         E E E   E + + E E EP P     + + EL+PE+ S+  + E D++  ++++    +L V SD D+ ++D  +D+++ A+ +   DL+ LKL ELR
Subjt:  QESEREHWEELQSQGETEQEPEP-----DQEYELEPEAESYELDHEPDEMEPELVN----LLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELR

Query:  AIAKSRSLKGFSKMKKSELVRLL
         IAKSR LKG SKMKK+ELV LL
Subjt:  AIAKSRSLKGFSKMKKSELVRLL

AT1G06190.2 Rho termination factor1.4e-5049.27Show/hide
Query:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES
        MS   HL  + + G+ LSDS C   S VS R  ++   S C +HK N  +K  P        SF C+ASSGG+RRNPDFS+ N+HG+ R  NR +  RE 
Subjt:  MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERES

Query:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG
         D ++ SD+LSS+NGPL ++SS+PK QAT++PGPREKEIVELFRKVQAQLR R AA KEEKK+E  ++GQ K SETVDSLLKLLRKHS EQ KR      
Subjt:  LDNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKME--AQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGG

Query:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNST
            +FS     +    D+   T     S N  + A     SF RP S+F+RKSPVPR +  P Y  E+  + +
Subjt:  GSSKEFSFNHVKENGPYDEGKGTSIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNST

AT2G31150.1 ATP binding;ATPases, coupled to transmembrane movement of ions, phosphorylative mechanism3.4e-2540.33Show/hide
Query:  HRR-NPDFSKQNRHGFSRSRNRHNEERESL-DNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK
        HRR NPDFS+ N+HGF R RNR NE+++ L D   E D+LSSKN                     EKEIVELF+KVQ QLR R AA KEEKK E   + +
Subjt:  HRR-NPDFSKQNRHGFSRSRNRHNEERESL-DNVDESDLLSSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRER-AAMKEEKKMEAQGQTK

Query:  G---SETVDSLLKLLRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSA-NMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGE
        G   SETVDSLLKLLRKHS EQ K+            +FN  K+    D+         S  + R+K    T  F RP S+F+R SPVPR K Q  Y  E
Subjt:  G---SETVDSLLKLLRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGTSIFGLSA-NMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGE

Query:  SVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQ--EY-ELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED
        ++ +               EA S   +  WT+++      ++++S+ E E EPEP+   EY E EPEAE YE + EP+    E V+ L + S   ++ ED
Subjt:  SVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQ--EY-ELEPEAESYELDHEPDEMEPELVNLLGVSSDIDDKIED

Query:  NVKDN
            N
Subjt:  NVKDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCCATTCATCTTCTTCCTAACAACCTTACAGGCTTTGGACTGTCAGATAGCATATGCCTACCATGCTCTGGAGTTTCAGGACGAGCAGCCTCAGTCTCATC
TCGCTCTTTATGTGCTGAACATAAAATCAATGCACCGGTCAAATTCAGACCCCTAAACTGTACTTCGTTGGGGTTGTCTTTTACATGCAAAGCCAGCTCAGGAGGTCATA
GGAGAAACCCAGATTTTTCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCACAATGAGGAGAGAGAGAGCCTTGACAATGTTGATGAATCTGATTTATTA
TCATCTAAGAATGGACCCTTACTTTCCATCTCTAGCACACCAAAATCCCAGGCCACAGCTACCCCAGGCCCGAGGGAGAAGGAAATTGTTGAACTTTTCAGGAAGGTTCA
AGCTCAGCTTCGGGAGCGAGCTGCAATGAAAGAAGAGAAGAAAATGGAAGCTCAAGGACAAACGAAAGGGAGCGAGACAGTGGATTCTCTTCTTAAGCTATTGAGAAAGC
ATTCGGTGGAGCAAGGGAAGAGAAGCAGTGGTGGTGGCGGCGGCAGCAGCAAGGAGTTCAGTTTTAACCATGTCAAAGAGAATGGTCCTTATGATGAAGGAAAAGGCACA
AGCATTTTTGGCCTAAGTGCCAATATGAGAGACAAGGCCCAAGAACCAACAGGTTCTTTCAATAGACCCGTATCAAATTTTCAACGTAAATCCCCCGTGCCTCGGGTGAA
GTACCAACCAATTTACCCCGGGGAAAGTGTTGTCAACTCGACTGATGGTGTGAATGCAAAGGGAGTGAAACTTAATGGAACTGAGGCAGGTTCTCAACTGAAGGCAAAGG
TATGGACTCGGCAGGAGTCAGAACGAGAGCACTGGGAAGAGCTGCAATCACAAGGAGAGACTGAGCAGGAGCCAGAGCCAGATCAAGAGTATGAGTTGGAGCCAGAGGCT
GAATCATATGAGCTAGACCATGAGCCTGATGAAATGGAGCCTGAACTTGTAAATTTATTAGGCGTGTCATCAGACATCGATGACAAGATTGAAGACAATGTTAAAGACAA
TGAGAAATTTGCAAAGCATGATGAACATGTGGATTTGAACTCATTGAAGCTTGCTGAACTGAGGGCGATTGCCAAATCTCGCAGTTTGAAAGGGTTCTCGAAGATGAAGA
AGAGTGAACTCGTGCGGTTGTTAAGCGACGCTCAGCTAAAGAAGTATTTAAAGAAATCGACCCAGCACCACGACGCTCTTCAATATCTCACCCTCGGGGTTGAAGTGCAG
TGCCGCAGCACTGAGGTAGCGCTACGGCGCTGTAGCGTCTGTACGAGCATCCATCTTCTTAGCGCCATGACGCTACAAGACAATGCCGGAGCACTCCATGCTTTTAACGA
AAGCTTGCTCTCGGGTTTGAGTGTCATCGCATCATTTCTTCAAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAGCCATTCATCTTCTTCCTAACAACCTTACAGGCTTTGGACTGTCAGATAGCATATGCCTACCATGCTCTGGAGTTTCAGGACGAGCAGCCTCAGTCTCATC
TCGCTCTTTATGTGCTGAACATAAAATCAATGCACCGGTCAAATTCAGACCCCTAAACTGTACTTCGTTGGGGTTGTCTTTTACATGCAAAGCCAGCTCAGGAGGTCATA
GGAGAAACCCAGATTTTTCAAAGCAAAATAGGCATGGCTTCTCAAGAAGCAGAAATAGGCACAATGAGGAGAGAGAGAGCCTTGACAATGTTGATGAATCTGATTTATTA
TCATCTAAGAATGGACCCTTACTTTCCATCTCTAGCACACCAAAATCCCAGGCCACAGCTACCCCAGGCCCGAGGGAGAAGGAAATTGTTGAACTTTTCAGGAAGGTTCA
AGCTCAGCTTCGGGAGCGAGCTGCAATGAAAGAAGAGAAGAAAATGGAAGCTCAAGGACAAACGAAAGGGAGCGAGACAGTGGATTCTCTTCTTAAGCTATTGAGAAAGC
ATTCGGTGGAGCAAGGGAAGAGAAGCAGTGGTGGTGGCGGCGGCAGCAGCAAGGAGTTCAGTTTTAACCATGTCAAAGAGAATGGTCCTTATGATGAAGGAAAAGGCACA
AGCATTTTTGGCCTAAGTGCCAATATGAGAGACAAGGCCCAAGAACCAACAGGTTCTTTCAATAGACCCGTATCAAATTTTCAACGTAAATCCCCCGTGCCTCGGGTGAA
GTACCAACCAATTTACCCCGGGGAAAGTGTTGTCAACTCGACTGATGGTGTGAATGCAAAGGGAGTGAAACTTAATGGAACTGAGGCAGGTTCTCAACTGAAGGCAAAGG
TATGGACTCGGCAGGAGTCAGAACGAGAGCACTGGGAAGAGCTGCAATCACAAGGAGAGACTGAGCAGGAGCCAGAGCCAGATCAAGAGTATGAGTTGGAGCCAGAGGCT
GAATCATATGAGCTAGACCATGAGCCTGATGAAATGGAGCCTGAACTTGTAAATTTATTAGGCGTGTCATCAGACATCGATGACAAGATTGAAGACAATGTTAAAGACAA
TGAGAAATTTGCAAAGCATGATGAACATGTGGATTTGAACTCATTGAAGCTTGCTGAACTGAGGGCGATTGCCAAATCTCGCAGTTTGAAAGGGTTCTCGAAGATGAAGA
AGAGTGAACTCGTGCGGTTGTTAAGCGACGCTCAGCTAAAGAAGTATTTAAAGAAATCGACCCAGCACCACGACGCTCTTCAATATCTCACCCTCGGGGTTGAAGTGCAG
TGCCGCAGCACTGAGGTAGCGCTACGGCGCTGTAGCGTCTGTACGAGCATCCATCTTCTTAGCGCCATGACGCTACAAGACAATGCCGGAGCACTCCATGCTTTTAACGA
AAGCTTGCTCTCGGGTTTGAGTGTCATCGCATCATTTCTTCAAGGGTAG
Protein sequenceShow/hide protein sequence
MSQAIHLLPNNLTGFGLSDSICLPCSGVSGRAASVSSRSLCAEHKINAPVKFRPLNCTSLGLSFTCKASSGGHRRNPDFSKQNRHGFSRSRNRHNEERESLDNVDESDLL
SSKNGPLLSISSTPKSQATATPGPREKEIVELFRKVQAQLRERAAMKEEKKMEAQGQTKGSETVDSLLKLLRKHSVEQGKRSSGGGGGSSKEFSFNHVKENGPYDEGKGT
SIFGLSANMRDKAQEPTGSFNRPVSNFQRKSPVPRVKYQPIYPGESVVNSTDGVNAKGVKLNGTEAGSQLKAKVWTRQESEREHWEELQSQGETEQEPEPDQEYELEPEA
ESYELDHEPDEMEPELVNLLGVSSDIDDKIEDNVKDNEKFAKHDEHVDLNSLKLAELRAIAKSRSLKGFSKMKKSELVRLLSDAQLKKYLKKSTQHHDALQYLTLGVEVQ
CRSTEVALRRCSVCTSIHLLSAMTLQDNAGALHAFNESLLSGLSVIASFLQG