; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004187 (gene) of Snake gourd v1 genome

Gene IDTan0004187
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRho_N domain-containing protein
Genome locationLG05:76488304..76489662
RNA-Seq ExpressionTan0004187
SyntenyTan0004187
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal
IPR036269 - Rho termination factor, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139843.1 uncharacterized protein LOC111010657 isoform X2 [Momordica charantia]2.2e-9178.74Show/hide
Query:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALF
        MEAV+FQSRTL RFPNLVSFGRRPIFALK+IA    S+ IQ+SV+SN   GNAG RPPRRSS PGRTRKNEP+R   A    EDLK PKSNNQEEIIALF
Subjt:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALF

Query:  RKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEE---YHTSPAADFKLVRPPSKFVKRSPIPSPPGGNG--LH
        RKIQTSIAK+SA+T DEDS++DE GAE ILE+LRESRKQVKG+ SK+AGVKVLRRK  SEE   YHTSPAA+FKLVRPPSKFVKRSPIPSPPG NG    
Subjt:  RKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEE---YHTSPAADFKLVRPPSKFVKRSPIPSPPGGNG--LH

Query:  IRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        +R E SQAIAESRE+KFPS+ENMKL+ELKA+AKSRGIKGYSKLKKNELLELLRS
Subjt:  IRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

XP_022994883.1 uncharacterized protein LOC111490473 [Cucurbita maxima]2.1e-8174.1Show/hide
Query:  MEAVIFQSRTLIRFPNLVSF-GRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPS-RKTDAHKNEEDLKKPKSNNQEEIIA
        MEAV+FQSRTLIRFPNLVSF  RRPIF LK+IADGY S SIQL+VSSN  DGNAGH+P RRSSAPGRTRKN PS RKTD HKN ED+KKPKSNNQEEIIA
Subjt:  MEAVIFQSRTLIRFPNLVSF-GRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPS-RKTDAHKNEEDLKKPKSNNQEEIIA

Query:  LFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRV
        LFRKIQTSIA+E+AS+ DEDSNKDE G E ILE L ESRKQVKGK  K AGVK LRRK TSE      AA+FKLVRPPS FVKRSPIP+P GGNG H+  
Subjt:  LFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRV

Query:  ETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
                       ++ENMKL ELKA+AKSRGIKGYSKLKKNELLELL S
Subjt:  ETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

XP_038893910.1 SAP-like protein BP-73 isoform X1 [Benincasa hispida]5.9e-9273.12Show/hide
Query:  MEAVIFQSRTLIRFPNLVSFGRRPIFA---------------------LKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEP-SRKTDA
        MEAVIFQ R LIRFP LVS GRRP FA                     L DIAD YPS+ IQLSVS+NRPDG AG+RPPRR SAPG+TRKNEP SRKT+A
Subjt:  MEAVIFQSRTLIRFPNLVSFGRRPIFA---------------------LKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEP-SRKTDA

Query:  HKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVK----GKVSKRAGVKVLRRKSTSEEYH----TSPAADF
        H NEEDLKK K NNQEEIIALFRKI+TSIAKESAS+NDE+S KDEHGAE ILETLRESRKQVK    GK SK+AG K LR + TSEE      + PAADF
Subjt:  HKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVK----GKVSKRAGVKVLRRKSTSEEYH----TSPAADF

Query:  KLVRPPSKFVKRSPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        +LVRPPSKFVKRSPIPSPP GNG H RV+ +QAIAESRELKFPSI+NMKL+ELKALAKSRGIKGYSKLKKNEL+ELL S
Subjt:  KLVRPPSKFVKRSPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

XP_038893911.1 SAP-like protein BP-73 isoform X2 [Benincasa hispida]1.1e-9374.18Show/hide
Query:  MEAVIFQSRTLIRFPNLVSFGRRPIFA---------------------LKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEP-SRKTDA
        MEAVIFQ R LIRFP LVS GRRP FA                     L DIAD YPS+ IQLSVS+NRPDG AG+RPPRR SAPG+TRKNEP SRKT+A
Subjt:  MEAVIFQSRTLIRFPNLVSFGRRPIFA---------------------LKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEP-SRKTDA

Query:  HKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYH----TSPAADFKLVR
        H NEEDLKK K NNQEEIIALFRKI+TSIAKESAS+NDE+S KDEHGAE ILETLRESRKQVKGK SK+AG K LR + TSEE      + PAADF+LVR
Subjt:  HKNEEDLKKPKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYH----TSPAADFKLVR

Query:  PPSKFVKRSPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        PPSKFVKRSPIPSPP GNG H RV+ +QAIAESRELKFPSI+NMKL+ELKALAKSRGIKGYSKLKKNEL+ELL S
Subjt:  PPSKFVKRSPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

XP_038893912.1 SAP-like protein BP-73 isoform X3 [Benincasa hispida]1.1e-9579.07Show/hide
Query:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEP-SRKTDAHKNEEDLKKPKSNNQEEIIAL
        MEAVIFQ R LIRFP LVS GRRP FA KDIAD YPS+ IQLSVS+NRPDG AG+RPPRR SAPG+TRKNEP SRKT+AH NEEDLKK K NNQEEIIAL
Subjt:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEP-SRKTDAHKNEEDLKKPKSNNQEEIIAL

Query:  FRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVK----GKVSKRAGVKVLRRKSTSEEYH----TSPAADFKLVRPPSKFVKRSPIPSPPGG
        FRKI+TSIAKESAS+NDE+S KDEHGAE ILETLRESRKQVK    GK SK+AG K LR + TSEE      + PAADF+LVRPPSKFVKRSPIPSPP G
Subjt:  FRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVK----GKVSKRAGVKVLRRKSTSEEYH----TSPAADFKLVRPPSKFVKRSPIPSPPGG

Query:  NGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        NG H RV+ +QAIAESRELKFPSI+NMKL+ELKALAKSRGIKGYSKLKKNEL+ELL S
Subjt:  NGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX66 Rho_N domain-containing protein3.2e-8069.8Show/hide
Query:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNE-PSRKTDAHKNEEDLKKPKSNNQEEIIAL
        MEAV+F  R LIRFPNL+S  RRP FA KD+AD YPS++IQ SVS +RPDGNAG+RPPRR+S PG+ RK+E  SRKT+  K+EE +KK ++N+QEE+IAL
Subjt:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNE-PSRKTDAHKNEEDLKKPKSNNQEEIIAL

Query:  FRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYH-----TSPAADFKLVRPPSKFVKRSPIPSPPGGNGL
        FRKIQTSIAKESAS+ DE+S KDE+ +  ILETLRESRKQ+KGK SK+AG KVLR K  SEE         PAADFKLVRPPSKFVKRSPIP        
Subjt:  FRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYH-----TSPAADFKLVRPPSKFVKRSPIPSPPGGNGL

Query:  HIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
         ++V+ SQAIAESRELKFPS ENMKL+ELKALAKSRGIKGYSKLKKNEL+E+LRS
Subjt:  HIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

A0A5A7UMB7 SAP-like protein BP-732.4e-7568.53Show/hide
Query:  IFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNE-PSRKTDAHKNEEDLKKPKSNNQEEIIALFRKI
        +F+ +T +RF +L++        L DIAD YPS++IQLSVS+NRPDGNA +RPPRR+S PG+TRK+E  SRK +A KNEE++KK K N+QEEIIALFRKI
Subjt:  IFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNE-PSRKTDAHKNEEDLKKPKSNNQEEIIALFRKI

Query:  QTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYH-----TSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRV
        Q SIAKESAS+ DE+S+KDEHGA  ILETLRE RKQ+KGK SK+AG KV R K TSEE         PAADFKLVRPPSKFVKRSPIP          +V
Subjt:  QTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYH-----TSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRV

Query:  ETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        + SQAIAESRELKFPSIENMKL+ELKALAKSRGIKGYSKLKKNEL+E+LRS
Subjt:  ETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

A0A6J1CF22 uncharacterized protein LOC111010657 isoform X16.8e-7871.6Show/hide
Query:  IFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQ
        +F   T  RF +L++        L +IA    S+ IQ+SV+SN   GNAG RPPRRSS PGRTRKNEP+R   A    EDLK PKSNNQEEIIALFRKIQ
Subjt:  IFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQ

Query:  TSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEE---YHTSPAADFKLVRPPSKFVKRSPIPSPPGGNG--LHIRVE
        TSIAK+SA+T DEDS++DE GAE ILE+LRESRKQVKG+ SK+AGVKVLRRK  SEE   YHTSPAA+FKLVRPPSKFVKRSPIPSPPG NG    +R E
Subjt:  TSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEE---YHTSPAADFKLVRPPSKFVKRSPIPSPPGGNG--LHIRVE

Query:  TSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
         SQAIAESRE+KFPS+ENMKL+ELKA+AKSRGIKGYSKLKKNELLELLRS
Subjt:  TSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X21.1e-9178.74Show/hide
Query:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALF
        MEAV+FQSRTL RFPNLVSFGRRPIFALK+IA    S+ IQ+SV+SN   GNAG RPPRRSS PGRTRKNEP+R   A    EDLK PKSNNQEEIIALF
Subjt:  MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALF

Query:  RKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEE---YHTSPAADFKLVRPPSKFVKRSPIPSPPGGNG--LH
        RKIQTSIAK+SA+T DEDS++DE GAE ILE+LRESRKQVKG+ SK+AGVKVLRRK  SEE   YHTSPAA+FKLVRPPSKFVKRSPIPSPPG NG    
Subjt:  RKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEE---YHTSPAADFKLVRPPSKFVKRSPIPSPPGGNG--LH

Query:  IRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        +R E SQAIAESRE+KFPS+ENMKL+ELKA+AKSRGIKGYSKLKKNELLELLRS
Subjt:  IRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

A0A6J1K6B8 uncharacterized protein LOC1114904731.0e-8174.1Show/hide
Query:  MEAVIFQSRTLIRFPNLVSF-GRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPS-RKTDAHKNEEDLKKPKSNNQEEIIA
        MEAV+FQSRTLIRFPNLVSF  RRPIF LK+IADGY S SIQL+VSSN  DGNAGH+P RRSSAPGRTRKN PS RKTD HKN ED+KKPKSNNQEEIIA
Subjt:  MEAVIFQSRTLIRFPNLVSF-GRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPS-RKTDAHKNEEDLKKPKSNNQEEIIA

Query:  LFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRV
        LFRKIQTSIA+E+AS+ DEDSNKDE G E ILE L ESRKQVKGK  K AGVK LRRK TSE      AA+FKLVRPPS FVKRSPIP+P GGNG H+  
Subjt:  LFRKIQTSIAKESASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRV

Query:  ETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
                       ++ENMKL ELKA+AKSRGIKGYSKLKKNELLELL S
Subjt:  ETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor2.9e-0452.27Show/hide
Query:  ESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        E+ E     +  +KL EL+ +AKSRG+KG SK+KK EL+ELL S
Subjt:  ESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

AT4G18740.1 Rho termination factor2.0e-2141.76Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAE-----PILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKR
        P  +NQEEII+L ++IQ+SI+K  +   +E+ N DE   E      IL+ L +SRK+ +G  S +                  P    +L RPPS FVKR
Subjt:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAE-----PILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKR

Query:  SPIPSPPGGNGLHIRVETS-QAIAE--SRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        +P+ S   G    + V  S +A+ +   +E K   IE MKL+ELK +AK+RGIKGYSKL+K+ELLEL+RS
Subjt:  SPIPSPPGGNGLHIRVETS-QAIAE--SRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

AT4G18740.2 Rho termination factor2.1e-1536.53Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAE-----PILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKR
        P  +NQEEII+L ++IQ+SI+K  +   +E+ N DE   E      IL+ L +SRK+ +G  S +                  P    +L RPPS FVKR
Subjt:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAE-----PILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKR

Query:  SPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        +P+ S   G                              ELK +AK+RGIKGYSKL+K+ELLEL+RS
Subjt:  SPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS

AT4G18740.3 Rho termination factor2.1e-1536.53Show/hide
Query:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAE-----PILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKR
        P  +NQEEII+L ++IQ+SI+K  +   +E+ N DE   E      IL+ L +SRK+ +G  S +                  P    +L RPPS FVKR
Subjt:  PKSNNQEEIIALFRKIQTSIAKESASTNDEDSNKDEHGAE-----PILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKR

Query:  SPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS
        +P+ S   G                              ELK +AK+RGIKGYSKL+K+ELLEL+RS
Subjt:  SPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKLSELKALAKSRGIKGYSKLKKNELLELLRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAGTAATTTTTCAGTCTCGAACTCTAATCCGTTTTCCCAATTTGGTTTCATTCGGAAGGAGGCCGATTTTCGCTTTGAAAGACATTGCAGATGGTTATCCCAG
CAGGAGCATTCAACTATCTGTTTCAAGCAATAGGCCAGATGGAAATGCAGGGCATCGGCCTCCTCGTAGAAGTTCTGCGCCTGGAAGAACCAGGAAGAATGAACCCTCGA
GGAAAACAGATGCCCATAAGAATGAGGAAGACTTAAAAAAACCAAAATCAAATAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGAA
TCTGCAAGCACCAATGATGAGGATTCCAACAAGGATGAACATGGAGCCGAGCCTATTTTGGAGACTCTTCGTGAATCAAGGAAGCAAGTAAAAGGCAAAGTTTCAAAGAG
GGCAGGAGTTAAAGTTTTGAGAAGAAAAAGTACGTCTGAAGAGTATCATACTTCGCCAGCTGCAGATTTCAAGTTAGTACGACCACCATCCAAATTTGTGAAGAGATCAC
CAATCCCGTCTCCTCCAGGAGGAAATGGTTTGCATATTAGAGTGGAGACATCTCAAGCCATAGCTGAAAGCAGGGAGTTAAAGTTCCCAAGTATAGAGAATATGAAACTT
TCCGAGCTGAAAGCACTAGCAAAATCTAGAGGAATTAAGGGTTACTCCAAATTGAAGAAAAATGAGCTCCTGGAACTCCTGAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCAGTAATTTTTCAGTCTCGAACTCTAATCCGTTTTCCCAATTTGGTTTCATTCGGAAGGAGGCCGATTTTCGCTTTGAAAGACATTGCAGATGGTTATCCCAG
CAGGAGCATTCAACTATCTGTTTCAAGCAATAGGCCAGATGGAAATGCAGGGCATCGGCCTCCTCGTAGAAGTTCTGCGCCTGGAAGAACCAGGAAGAATGAACCCTCGA
GGAAAACAGATGCCCATAAGAATGAGGAAGACTTAAAAAAACCAAAATCAAATAACCAGGAGGAAATAATTGCTCTCTTCAGAAAGATACAGACTTCCATTGCTAAGGAA
TCTGCAAGCACCAATGATGAGGATTCCAACAAGGATGAACATGGAGCCGAGCCTATTTTGGAGACTCTTCGTGAATCAAGGAAGCAAGTAAAAGGCAAAGTTTCAAAGAG
GGCAGGAGTTAAAGTTTTGAGAAGAAAAAGTACGTCTGAAGAGTATCATACTTCGCCAGCTGCAGATTTCAAGTTAGTACGACCACCATCCAAATTTGTGAAGAGATCAC
CAATCCCGTCTCCTCCAGGAGGAAATGGTTTGCATATTAGAGTGGAGACATCTCAAGCCATAGCTGAAAGCAGGGAGTTAAAGTTCCCAAGTATAGAGAATATGAAACTT
TCCGAGCTGAAAGCACTAGCAAAATCTAGAGGAATTAAGGGTTACTCCAAATTGAAGAAAAATGAGCTCCTGGAACTCCTGAGATCCTAA
Protein sequenceShow/hide protein sequence
MEAVIFQSRTLIRFPNLVSFGRRPIFALKDIADGYPSRSIQLSVSSNRPDGNAGHRPPRRSSAPGRTRKNEPSRKTDAHKNEEDLKKPKSNNQEEIIALFRKIQTSIAKE
SASTNDEDSNKDEHGAEPILETLRESRKQVKGKVSKRAGVKVLRRKSTSEEYHTSPAADFKLVRPPSKFVKRSPIPSPPGGNGLHIRVETSQAIAESRELKFPSIENMKL
SELKALAKSRGIKGYSKLKKNELLELLRS