; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009144 (gene) of Chayote v1 genome

Gene IDSed0009144
OrganismSechium edule (Chayote v1)
DescriptionRho_N domain-containing protein
Genome locationLG08:35902002..35903606
RNA-Seq ExpressionSed0009144
SyntenySed0009144
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65614.1 hypothetical protein Csa_019894 [Cucumis sativus]1.1e-6965.5Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEII
        MEAVVF  R LIRFPN +S RRRP F  KD+AD   YPSK+IQ SVS +  DGNAG +PPRR       RK+E SS+K  TPK+EE ++K ++ +QEE+I
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEII

Query:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGE
        ALFRKIQTSIAKESA++ D++S KDEN   SILETL ESRKQ+KG K SK+AG +VL   G +EEKEM D +  PA DFKL RPPS FVKRSPIP     
Subjt:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGE

Query:  NGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
              VD S+AI+ESRELKFPS +NMKLTELKALAKSRGIKGYSKLKKNEL+EILRS
Subjt:  NGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

XP_022139843.1 uncharacterized protein LOC111010657 isoform X2 [Momordica charantia]1.8e-6965.76Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPR------RTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIAL
        MEAVVFQSR L RFPN VSF RRPIF LK+IA      SK IQ+SV+SNG  GNAG +PPR      RTRKNE +  +     E L+ PKS NQEEIIAL
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPR------RTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIAL

Query:  FRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGL
        FRKIQTSIAK+SATT D+DS++DE G +SILE+L ESRKQ+KG + SK+AG++VL + G +EE EM   T PA +FKL RPPS FVKRSPIPS  G NG 
Subjt:  FRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGL

Query:  LNGVDE--SKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
         + + E  S+AI+ESRE+KFPS++NMKLTELKA+AKSRGIKGYSKLKKNELLE+LRS
Subjt:  LNGVDE--SKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

XP_038893910.1 SAP-like protein BP-73 isoform X1 [Benincasa hispida]8.2e-7061.21Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIF---------------------DLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKKT
        MEAV+FQ R LIRFP  VS  RRP F                     +L DIAD   YPSK IQLSVS+N  DG AG +PPRR      TRKNE SS+KT
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIF---------------------DLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKKT

Query:  P--KNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGT---KISKRAGIEVLTKPGTTEEKEMPDQTLPAI
            NEE L+K K  NQEEIIALFRKI+TSIAKESA+++D++S KDE+G +SILETL ESRKQ+K     K SK+AG + L   GT+EEKE+ D + PA 
Subjt:  P--KNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGT---KISKRAGIEVLTKPGTTEEKEMPDQTLPAI

Query:  DFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
        DF+L RPPS FVKRSPIPS    NG    VD ++AI+ESRELKFPSI NMKLTELKALAKSRGIKGYSKLKKNEL+E+L S
Subjt:  DFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

XP_038893911.1 SAP-like protein BP-73 isoform X2 [Benincasa hispida]4.4e-7162.23Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIF---------------------DLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKKT
        MEAV+FQ R LIRFP  VS  RRP F                     +L DIAD   YPSK IQLSVS+N  DG AG +PPRR      TRKNE SS+KT
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIF---------------------DLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKKT

Query:  P--KNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFK
            NEE L+K K  NQEEIIALFRKI+TSIAKESA+++D++S KDE+G +SILETL ESRKQ+KG K SK+AG + L   GT+EEKE+ D + PA DF+
Subjt:  P--KNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFK

Query:  LTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
        L RPPS FVKRSPIPS    NG    VD ++AI+ESRELKFPSI NMKLTELKALAKSRGIKGYSKLKKNEL+E+L S
Subjt:  LTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

XP_038893912.1 SAP-like protein BP-73 isoform X3 [Benincasa hispida]3.6e-7366.15Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKKTP--KNEEGLQKPKSKNQEEII
        MEAV+FQ R LIRFP  VS  RRP F  KDIAD   YPSK IQLSVS+N  DG AG +PPRR      TRKNE SS+KT    NEE L+K K  NQEEII
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKKTP--KNEEGLQKPKSKNQEEII

Query:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGT---KISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSST
        ALFRKI+TSIAKESA+++D++S KDE+G +SILETL ESRKQ+K     K SK+AG + L   GT+EEKE+ D + PA DF+L RPPS FVKRSPIPS  
Subjt:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGT---KISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSST

Query:  GENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
          NG    VD ++AI+ESRELKFPSI NMKLTELKALAKSRGIKGYSKLKKNEL+E+L S
Subjt:  GENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX66 Rho_N domain-containing protein5.2e-7065.5Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEII
        MEAVVF  R LIRFPN +S RRRP F  KD+AD   YPSK+IQ SVS +  DGNAG +PPRR       RK+E SS+K  TPK+EE ++K ++ +QEE+I
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEII

Query:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGE
        ALFRKIQTSIAKESA++ D++S KDEN   SILETL ESRKQ+KG K SK+AG +VL   G +EEKEM D +  PA DFKL RPPS FVKRSPIP     
Subjt:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGE

Query:  NGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
              VD S+AI+ESRELKFPS +NMKLTELKALAKSRGIKGYSKLKKNEL+EILRS
Subjt:  NGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

A0A1S3BFQ9 SAP-like protein BP-731.8e-6261.42Show/hide
Query:  VFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEIIALFR
        +F+ +  +RF +        + +L DIAD   YPSK+IQLSVS+N  DGNA  +PPRR      TRK+E SS+K   PKNEE ++K K  +QEEIIALFR
Subjt:  VFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEIIALFR

Query:  KIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGENGLL
        KIQ SIAKESA++ D++S+KDE+G  SILETL E RKQ+KG K SK+AG +V    GT+EEKEM D +  PA DFKL RPPS FVKRSPIP         
Subjt:  KIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGENGLL

Query:  NGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
          VD S+AI+ESRELKFPSI+NMKL ELKALAKSRGIKGYSKLKKNEL+EILRS
Subjt:  NGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

A0A5A7UMB7 SAP-like protein BP-731.8e-6261.42Show/hide
Query:  VFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEIIALFR
        +F+ +  +RF +        + +L DIAD   YPSK+IQLSVS+N  DGNA  +PPRR      TRK+E SS+K   PKNEE ++K K  +QEEIIALFR
Subjt:  VFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRR------TRKNEHSSKK--TPKNEEGLQKPKSKNQEEIIALFR

Query:  KIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGENGLL
        KIQ SIAKESA++ D++S+KDE+G  SILETL E RKQ+KG K SK+AG +V    GT+EEKEM D +  PA DFKL RPPS FVKRSPIP         
Subjt:  KIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQT-LPAIDFKLTRPPSNFVKRSPIPSSTGENGLL

Query:  NGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
          VD S+AI+ESRELKFPSI+NMKL ELKALAKSRGIKGYSKLKKNEL+EILRS
Subjt:  NGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X28.9e-7065.76Show/hide
Query:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPR------RTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIAL
        MEAVVFQSR L RFPN VSF RRPIF LK+IA      SK IQ+SV+SNG  GNAG +PPR      RTRKNE +  +     E L+ PKS NQEEIIAL
Subjt:  MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPR------RTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIAL

Query:  FRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGL
        FRKIQTSIAK+SATT D+DS++DE G +SILE+L ESRKQ+KG + SK+AG++VL + G +EE EM   T PA +FKL RPPS FVKRSPIPS  G NG 
Subjt:  FRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGL

Query:  LNGVDE--SKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
         + + E  S+AI+ESRE+KFPS++NMKLTELKA+AKSRGIKGYSKLKKNELLE+LRS
Subjt:  LNGVDE--SKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

A0A6J1K6B8 uncharacterized protein LOC1114904731.8e-6261.09Show/hide
Query:  MEAVVFQSRALIRFPNFVSF-RRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQP------PRRTRKNEHSSKKTPKNE-EGLQKPKSKNQEEII
        MEAVVFQSR LIRFPN VSF RRRPIF LK+IADG  Y S SIQL+VSSNG DGNAG+QP      P RTRKN  S +KT  ++ E ++KPKS NQEEII
Subjt:  MEAVVFQSRALIRFPNFVSF-RRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQP------PRRTRKNEHSSKKTPKNE-EGLQKPKSKNQEEII

Query:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGEN
        ALFRKIQTSIA+E+A++ D+DSNKDE+G +SILE L ESRKQ+KG K  K AG++ L +  T+E          A +FKL RPPSNFVKRSPIP+  G N
Subjt:  ALFRKIQTSIAKESATTDDDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGEN

Query:  GLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
        G                    +++NMKL ELKA+AKSRGIKGYSKLKKNELLE+L S
Subjt:  GLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-731.3e-0654Show/hide
Query:  VDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEIL
        +DES   +    L  P +  +K+TEL+ LAKSRGIKGYSK+KKN+L+E+L
Subjt:  VDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEIL

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor9.9e-0545.45Show/hide
Query:  LNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS
        L+  DE    +E   +K   +  +KL EL+ +AKSRG+KG SK+KK EL+E+L S
Subjt:  LNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEILRS

AT4G18740.1 Rho termination factor3.4e-2138.05Show/hide
Query:  GNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENG-----VQSILETLLESRKQMKGTKISKRAGIE
        G + Y    R++K     KK   ++     P   NQEEII+L ++IQ+SI+K  +   +++ N DE+       ++IL+ L +SRK+ +           
Subjt:  GNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENG-----VQSILETLLESRKQMKGTKISKRAGIE

Query:  VLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSS-TGENGLLNGVDESKAISE--SRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELL
             G T  KE P    P    +L RPPS+FVKR+P+ SS +G  G L   +  KA+ +   +E K   I+ MKL ELK +AK+RGIKGYSKL+K+ELL
Subjt:  VLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSS-TGENGLLNGVDESKAISE--SRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELL

Query:  EILRS
        E++RS
Subjt:  EILRS

AT4G18740.2 Rho termination factor5.2e-1433.17Show/hide
Query:  GNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENG-----VQSILETLLESRKQMKGTKISKRAGIE
        G + Y    R++K     KK   ++     P   NQEEII+L ++IQ+SI+K  +   +++ N DE+       ++IL+ L +SRK+ +           
Subjt:  GNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENG-----VQSILETLLESRKQMKGTKISKRAGIE

Query:  VLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEIL
             G T  KE P    P    +L RPPS+FVKR+P+ SS                                 ELK +AK+RGIKGYSKL+K+ELLE++
Subjt:  VLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEIL

Query:  RS
        RS
Subjt:  RS

AT4G18740.3 Rho termination factor5.2e-1433.17Show/hide
Query:  GNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENG-----VQSILETLLESRKQMKGTKISKRAGIE
        G + Y    R++K     KK   ++     P   NQEEII+L ++IQ+SI+K  +   +++ N DE+       ++IL+ L +SRK+ +           
Subjt:  GNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTDDDDSNKDENG-----VQSILETLLESRKQMKGTKISKRAGIE

Query:  VLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEIL
             G T  KE P    P    +L RPPS+FVKR+P+ SS                                 ELK +AK+RGIKGYSKL+K+ELLE++
Subjt:  VLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKLTELKALAKSRGIKGYSKLKKNELLEIL

Query:  RS
        RS
Subjt:  RS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAGTAGTTTTCCAGTCTCGAGCTCTAATCCGATTTCCCAATTTTGTCTCTTTTAGAAGGAGACCCATTTTCGATTTGAAAGACATTGCAGATGGTTATCCATA
TCCTAGCAAGAGTATTCAACTATCTGTTTCAAGCAATGGAGCAGATGGAAATGCAGGGTATCAGCCTCCTCGAAGAACAAGGAAGAATGAACATTCCTCAAAGAAAACAC
CTAAGAATGAGGAAGGCCTACAAAAACCCAAATCAAAAAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGAATCCGCAACCACCGAC
GATGATGATTCCAACAAGGATGAAAATGGAGTCCAGTCTATTTTGGAAACTCTTCTTGAATCAAGGAAGCAAATGAAAGGCACAAAAATTTCAAAGAGGGCAGGAATTGA
AGTGTTAACAAAACCAGGCACGACTGAGGAGAAGGAAATGCCTGATCAGACTTTGCCAGCTATAGATTTTAAGTTAACACGACCACCATCTAACTTTGTGAAGAGATCAC
CAATCCCATCTTCCACAGGCGAAAATGGTTTGCTTAATGGAGTGGATGAATCTAAGGCCATATCTGAAAGCAGGGAGTTGAAGTTCCCAAGCATTGATAATATGAAACTT
ACCGAGCTGAAAGCACTAGCAAAATCTAGAGGAATTAAGGGTTACTCCAAATTGAAGAAAAATGAGCTCCTGGAAATCCTAAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCAGTAGTTTTCCAGTCTCGAGCTCTAATCCGATTTCCCAATTTTGTCTCTTTTAGAAGGAGACCCATTTTCGATTTGAAAGACATTGCAGATGGTTATCCATA
TCCTAGCAAGAGTATTCAACTATCTGTTTCAAGCAATGGAGCAGATGGAAATGCAGGGTATCAGCCTCCTCGAAGAACAAGGAAGAATGAACATTCCTCAAAGAAAACAC
CTAAGAATGAGGAAGGCCTACAAAAACCCAAATCAAAAAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGAATCCGCAACCACCGAC
GATGATGATTCCAACAAGGATGAAAATGGAGTCCAGTCTATTTTGGAAACTCTTCTTGAATCAAGGAAGCAAATGAAAGGCACAAAAATTTCAAAGAGGGCAGGAATTGA
AGTGTTAACAAAACCAGGCACGACTGAGGAGAAGGAAATGCCTGATCAGACTTTGCCAGCTATAGATTTTAAGTTAACACGACCACCATCTAACTTTGTGAAGAGATCAC
CAATCCCATCTTCCACAGGCGAAAATGGTTTGCTTAATGGAGTGGATGAATCTAAGGCCATATCTGAAAGCAGGGAGTTGAAGTTCCCAAGCATTGATAATATGAAACTT
ACCGAGCTGAAAGCACTAGCAAAATCTAGAGGAATTAAGGGTTACTCCAAATTGAAGAAAAATGAGCTCCTGGAAATCCTAAGATCCTAA
Protein sequenceShow/hide protein sequence
MEAVVFQSRALIRFPNFVSFRRRPIFDLKDIADGYPYPSKSIQLSVSSNGADGNAGYQPPRRTRKNEHSSKKTPKNEEGLQKPKSKNQEEIIALFRKIQTSIAKESATTD
DDDSNKDENGVQSILETLLESRKQMKGTKISKRAGIEVLTKPGTTEEKEMPDQTLPAIDFKLTRPPSNFVKRSPIPSSTGENGLLNGVDESKAISESRELKFPSIDNMKL
TELKALAKSRGIKGYSKLKKNELLEILRS