; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021287 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021287
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRho_N domain-containing protein
Genome locationChr05:7375420..7376665
RNA-Seq ExpressionHG10021287
SyntenyHG10021287
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal
IPR036269 - Rho termination factor, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65614.1 hypothetical protein Csa_019894 [Cucumis sativus]3.1e-8878.12Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIAL
        MEA+VF PRILIRFPNL+SL RRPTFASKD ADVYPSK+IQ SV+ + PDGNAGNRPPRR S PGK RK+E SSRKTE  K EE +KK + N+QEE+IAL
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIAL

Query:  FRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRG
        FRKIQTSIAKESA+S D+ES KDE     SILETLRESRKQ+KGKTSKKAGAKVLRSKG SEEKEM DPS PPAADF+LVRP SKFVKRSPIP       
Subjt:  FRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRG

Query:  SHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
          L+V+ASQAIAESRELKF S ENMKLTELKALAKSRGIKGYSKLKKNELME+L S
Subjt:  SHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

XP_022139843.1 uncharacterized protein LOC111010657 isoform X2 [Momordica charantia]3.7e-7871.76Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIALF
        MEA+VFQ R L RFPNLVS GRRP FA K+ A V  SK IQ+SVT+N   GNAG RPPRR S PG+ RKNEP+  + EA   EDLK PK NNQEEIIALF
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIALF

Query:  RKIQTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPP--RGRGS
        RKIQTSIAK+SAT+ DE   +++ GA+SILE+LRESRKQVKG+TSKKAG KVLR KG SEE EM   S PAA+F+LVRP SKFVKRSPIPSPP   G  S
Subjt:  RKIQTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPP--RGRGS

Query:  HLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
         LR E SQAIAESRE+KF S+ENMKLTELKA+AKSRGIKGYSKLKKNEL+E+L S
Subjt:  HLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

XP_038893910.1 SAP-like protein BP-73 isoform X1 [Benincasa hispida]7.5e-10378.93Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASK---------------------DNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEA
        MEA++FQPRILIRFP LVSLGRRPTFASK                     D ADVYPSK IQLSV+NN PDG AGNRPPRRISAPGK RKNEPSSRKTEA
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASK---------------------DNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEA

Query:  H-KEEDLKKPKFNNQEEIIALFRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVK----GKTSKKAGAKVLRSKGTSEEKEMQDPSPPAAD
        H  EEDLKK K+NNQEEIIALFRKI+TSIAKESA+S D+ESCKDE  GA+SILETLRESRKQVK    GK+SKKAGAK LRS+GTSEEKE+ DPSPPAAD
Subjt:  H-KEEDLKKPKFNNQEEIIALFRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVK----GKTSKKAGAKVLRSKGTSEEKEMQDPSPPAAD

Query:  FQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        FQLVRP SKFVKRSPIPSPPRG GSH RV+A+QAIAESRELKF SI+NMKLTELKALAKSRGIKGYSKLKKNEL+E+LGS
Subjt:  FQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

XP_038893911.1 SAP-like protein BP-73 isoform X2 [Benincasa hispida]1.4e-10480.07Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASK---------------------DNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEA
        MEA++FQPRILIRFP LVSLGRRPTFASK                     D ADVYPSK IQLSV+NN PDG AGNRPPRRISAPGK RKNEPSSRKTEA
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASK---------------------DNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEA

Query:  H-KEEDLKKPKFNNQEEIIALFRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLV
        H  EEDLKK K+NNQEEIIALFRKI+TSIAKESA+S D+ESCKDE  GA+SILETLRESRKQVKGK+SKKAGAK LRS+GTSEEKE+ DPSPPAADFQLV
Subjt:  H-KEEDLKKPKFNNQEEIIALFRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLV

Query:  RPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        RP SKFVKRSPIPSPPRG GSH RV+A+QAIAESRELKF SI+NMKLTELKALAKSRGIKGYSKLKKNEL+E+LGS
Subjt:  RPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

XP_038893912.1 SAP-like protein BP-73 isoform X3 [Benincasa hispida]1.5e-10685.33Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAH-KEEDLKKPKFNNQEEIIAL
        MEA++FQPRILIRFP LVSLGRRPTFASKD ADVYPSK IQLSV+NN PDG AGNRPPRRISAPGK RKNEPSSRKTEAH  EEDLKK K+NNQEEIIAL
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAH-KEEDLKKPKFNNQEEIIAL

Query:  FRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVK----GKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPPR
        FRKI+TSIAKESA+S D+ESCKDE  GA+SILETLRESRKQVK    GK+SKKAGAK LRS+GTSEEKE+ DPSPPAADFQLVRP SKFVKRSPIPSPPR
Subjt:  FRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVK----GKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPPR

Query:  GRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        G GSH RV+A+QAIAESRELKF SI+NMKLTELKALAKSRGIKGYSKLKKNEL+E+LGS
Subjt:  GRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

TrEMBL top hitse value%identityAlignment
A0A0A0LX66 Rho_N domain-containing protein1.5e-8878.12Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIAL
        MEA+VF PRILIRFPNL+SL RRPTFASKD ADVYPSK+IQ SV+ + PDGNAGNRPPRR S PGK RK+E SSRKTE  K EE +KK + N+QEE+IAL
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIAL

Query:  FRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRG
        FRKIQTSIAKESA+S D+ES KDE     SILETLRESRKQ+KGKTSKKAGAKVLRSKG SEEKEM DPS PPAADF+LVRP SKFVKRSPIP       
Subjt:  FRKIQTSIAKESATS-DDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRG

Query:  SHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
          L+V+ASQAIAESRELKF S ENMKLTELKALAKSRGIKGYSKLKKNELME+L S
Subjt:  SHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

A0A1S3BFQ9 SAP-like protein BP-731.3e-7671.31Show/hide
Query:  VFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIALFRKI
        +F+ +  +RF +L++L         D ADVYPSK+IQLSV+NN PDGNA NRPPRR S PGK RK+E SSRK EA K EE++KK K N+QEEIIALFRKI
Subjt:  VFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIALFRKI

Query:  QTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRV
        Q SIAKESA+S DE    ++ GA SILETLRE RKQ+KGKTSKKAGAKV RSKGTSEEKEM DPS PPAADF+LVRP SKFVKRSPIP          +V
Subjt:  QTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRV

Query:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        +ASQAIAESRELKF SIENMKL ELKALAKSRGIKGYSKLKKNELME+L S
Subjt:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

A0A5A7UMB7 SAP-like protein BP-731.3e-7671.31Show/hide
Query:  VFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIALFRKI
        +F+ +  +RF +L++L         D ADVYPSK+IQLSV+NN PDGNA NRPPRR S PGK RK+E SSRK EA K EE++KK K N+QEEIIALFRKI
Subjt:  VFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHK-EEDLKKPKFNNQEEIIALFRKI

Query:  QTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRV
        Q SIAKESA+S DE    ++ GA SILETLRE RKQ+KGKTSKKAGAKV RSKGTSEEKEM DPS PPAADF+LVRP SKFVKRSPIP          +V
Subjt:  QTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPS-PPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRV

Query:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        +ASQAIAESRELKF SIENMKL ELKALAKSRGIKGYSKLKKNELME+L S
Subjt:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X21.8e-7871.76Show/hide
Query:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIALF
        MEA+VFQ R L RFPNLVS GRRP FA K+ A V  SK IQ+SVT+N   GNAG RPPRR S PG+ RKNEP+  + EA   EDLK PK NNQEEIIALF
Subjt:  MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIALF

Query:  RKIQTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPP--RGRGS
        RKIQTSIAK+SAT+ DE   +++ GA+SILE+LRESRKQVKG+TSKKAG KVLR KG SEE EM   S PAA+F+LVRP SKFVKRSPIPSPP   G  S
Subjt:  RKIQTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPP--RGRGS

Query:  HLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
         LR E SQAIAESRE+KF S+ENMKLTELKA+AKSRGIKGYSKLKKNEL+E+L S
Subjt:  HLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

A0A6J1K6B8 uncharacterized protein LOC1114904733.2e-6762.6Show/hide
Query:  MEAIVFQPRILIRFPNLVSL-GRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIAL
        MEA+VFQ R LIRFPNLVS   RRP F  K+ AD Y S SIQL+V++N  DGNAG++P RR SAPG+ RKN PS RKT+ HK ED+KKPK NNQEEIIAL
Subjt:  MEAIVFQPRILIRFPNLVSL-GRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIAL

Query:  FRKIQTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSH
        FRKIQTSIA+E+A+S DE    ++ G +SILE L ESRKQVKGKT K AG K LR K TSE          AA+F+LVRP S FVKRSPIP+P  G GSH
Subjt:  FRKIQTSIAKESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSH

Query:  LRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        L                 ++ENMKL ELKA+AKSRGIKGYSKLKKNEL+E+L S
Subjt:  LRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

SwissProt top hitse value%identityAlignment
Q94K75 Rho-N domain-containing protein 1, chloroplastic2.8e-0447.06Show/hide
Query:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        +A +   E+ E     +  +KL EL+ +AKSRG+KG SK+KK EL+E+LGS
Subjt:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor2.0e-0547.06Show/hide
Query:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
        +A +   E+ E     +  +KL EL+ +AKSRG+KG SK+KK EL+E+LGS
Subjt:  EASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

AT4G18740.1 Rho termination factor7.2e-1939.58Show/hide
Query:  EPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAK------ESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEM
        +PS R+T          P  +NQEEII+L ++IQ+SI+K      E   + DES K E+    +IL+ L +SRK+ +G TS K                 
Subjt:  EPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAK------ESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEM

Query:  QDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEAS-QAIAE--SRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
            PP    +L RP S FVKR+P+ S   G    L V  S +A+ +   +E K + IE MKL ELK +AK+RGIKGYSKL+K+EL+E++ S
Subjt:  QDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEAS-QAIAE--SRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

AT4G18740.2 Rho termination factor1.3e-1234.39Show/hide
Query:  EPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAK------ESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEM
        +PS R+T          P  +NQEEII+L ++IQ+SI+K      E   + DES K E+    +IL+ L +SRK+ +G TS K                 
Subjt:  EPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAK------ESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEM

Query:  QDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
            PP    +L RP S FVKR+P+ S   G                              ELK +AK+RGIKGYSKL+K+EL+E++ S
Subjt:  QDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS

AT4G18740.3 Rho termination factor1.3e-1234.39Show/hide
Query:  EPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAK------ESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEM
        +PS R+T          P  +NQEEII+L ++IQ+SI+K      E   + DES K E+    +IL+ L +SRK+ +G TS K                 
Subjt:  EPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAK------ESATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEM

Query:  QDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS
            PP    +L RP S FVKR+P+ S   G                              ELK +AK+RGIKGYSKL+K+EL+E++ S
Subjt:  QDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIENMKLTELKALAKSRGIKGYSKLKKNELMEVLGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAATAGTTTTCCAGCCTCGGATTCTAATCCGCTTTCCCAATTTGGTTTCTTTGGGAAGGAGACCCACGTTCGCTTCGAAAGACAATGCAGATGTTTATCCCAG
TAAGAGCATTCAACTATCTGTTACAAACAATTGGCCAGATGGAAATGCAGGGAATCGGCCTCCTCGTAGAATCTCTGCGCCAGGAAAAAGGAGGAAGAATGAACCTTCTT
CGAGGAAAACAGAAGCCCACAAGGAGGAAGACCTAAAAAAACCAAAATTCAATAACCAGGAGGAAATAATTGCTCTGTTCAGAAAGATTCAGACTTCCATTGCTAAAGAA
TCCGCAACCTCTGATGATGAATCCTGCAAGGATGAACAACAAGGAGCCGATTCTATTTTAGAGACTCTTCGTGAATCAAGGAAGCAAGTGAAAGGCAAAACTTCTAAGAA
GGCAGGAGCTAAAGTGTTGAGAAGTAAAGGCACGTCTGAAGAGAAGGAAATGCAGGATCCTTCACCACCAGCTGCAGATTTCCAGTTGGTTCGACCACTCTCTAAATTTG
TGAAGAGATCACCAATTCCGTCTCCCCCACGAGGCCGTGGTTCACACCTTAGAGTTGAGGCATCTCAGGCCATAGCTGAAAGCAGGGAGTTGAAGTTCACAAGTATAGAG
AATATGAAACTTACCGAGCTGAAAGCACTAGCAAAATCTAGAGGAATTAAGGGTTACTCCAAATTGAAGAAAAATGAGCTCATGGAAGTCCTGGGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCAATAGTTTTCCAGCCTCGGATTCTAATCCGCTTTCCCAATTTGGTTTCTTTGGGAAGGAGACCCACGTTCGCTTCGAAAGACAATGCAGATGTTTATCCCAG
TAAGAGCATTCAACTATCTGTTACAAACAATTGGCCAGATGGAAATGCAGGGAATCGGCCTCCTCGTAGAATCTCTGCGCCAGGAAAAAGGAGGAAGAATGAACCTTCTT
CGAGGAAAACAGAAGCCCACAAGGAGGAAGACCTAAAAAAACCAAAATTCAATAACCAGGAGGAAATAATTGCTCTGTTCAGAAAGATTCAGACTTCCATTGCTAAAGAA
TCCGCAACCTCTGATGATGAATCCTGCAAGGATGAACAACAAGGAGCCGATTCTATTTTAGAGACTCTTCGTGAATCAAGGAAGCAAGTGAAAGGCAAAACTTCTAAGAA
GGCAGGAGCTAAAGTGTTGAGAAGTAAAGGCACGTCTGAAGAGAAGGAAATGCAGGATCCTTCACCACCAGCTGCAGATTTCCAGTTGGTTCGACCACTCTCTAAATTTG
TGAAGAGATCACCAATTCCGTCTCCCCCACGAGGCCGTGGTTCACACCTTAGAGTTGAGGCATCTCAGGCCATAGCTGAAAGCAGGGAGTTGAAGTTCACAAGTATAGAG
AATATGAAACTTACCGAGCTGAAAGCACTAGCAAAATCTAGAGGAATTAAGGGTTACTCCAAATTGAAGAAAAATGAGCTCATGGAAGTCCTGGGATCCTAA
Protein sequenceShow/hide protein sequence
MEAIVFQPRILIRFPNLVSLGRRPTFASKDNADVYPSKSIQLSVTNNWPDGNAGNRPPRRISAPGKRRKNEPSSRKTEAHKEEDLKKPKFNNQEEIIALFRKIQTSIAKE
SATSDDESCKDEQQGADSILETLRESRKQVKGKTSKKAGAKVLRSKGTSEEKEMQDPSPPAADFQLVRPLSKFVKRSPIPSPPRGRGSHLRVEASQAIAESRELKFTSIE
NMKLTELKALAKSRGIKGYSKLKKNELMEVLGS