; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026434 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026434
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRho_N domain-containing protein
Genome locationscaffold402:2338919..2340042
RNA-Seq ExpressionMS026434
SyntenyMS026434
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
InterPro domainsIPR011112 - Rho termination factor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446702.1 PREDICTED: SAP-like protein BP-73 [Cucumis melo]7.8e-7466.92Show/hide
Query:  GKSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKS
        GKSDG SSFP S S  LSQF LF K+TH RFESLLNLA+IA V  SK IQ+SV++N   GNA  RPPRR+S PG+TRK+E +  + EA    E++K  K 
Subjt:  GKSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKS

Query:  NNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPI
        N+QEEIIALFRKIQ SIAK+SA++ DE+SH+DE GA SILE+LRE RKQ+KG+TSKKAG KV R KG SEE EM+  S  PAA+FKLVRPPSKFVKRSPI
Subjt:  NNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPI

Query:  PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        P            + + SQAIAESRE+KFPS+ENMKL ELKA+AKSRGIKGYSKLKKNEL+E+LRS
Subjt:  PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

XP_011648505.1 uncharacterized protein LOC105434499, partial [Cucumis sativus]2.1e-7165.66Show/hide
Query:  KSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVT-SNGGGNAGQRPPRRSSGPGRTRKNEPNRNEA-----AEDLKNPKSN
        KSDG SSFP SNS   SQF LF K+TH RFESLLNLA++A V  SK IQ SV+ S   GNAG RPPRR+S PG+ RK+E +  +       E +K  ++N
Subjt:  KSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVT-SNGGGNAGQRPPRRSSGPGRTRKNEPNRNEA-----AEDLKNPKSN

Query:  NQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPIP
        +QEE+IALFRKIQTSIAK+SA++ DE+S +DE    SILE+LRESRKQ+KG+TSKKAG KVLR KG SEE EM+  S  PAA+FKLVRPPSKFVKRSPIP
Subjt:  NQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPIP

Query:  SPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
                   L+ + SQAIAESRE+KFPS ENMKLTELKA+AKSRGIKGYSKLKKNEL+E+LRS
Subjt:  SPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

XP_022139842.1 uncharacterized protein LOC111010657 isoform X1 [Momordica charantia]2.6e-13899.63Show/hide
Query:  MQFGYSKNCIFPGKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDL
        MQFGYSKNCIF GKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDL
Subjt:  MQFGYSKNCIFPGKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDL

Query:  KNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKR
        KNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKR
Subjt:  KNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKR

Query:  SPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        SPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
Subjt:  SPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

XP_022139843.1 uncharacterized protein LOC111010657 isoform X2 [Momordica charantia]4.4e-10999.55Show/hide
Query:  LAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILES
        L EIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILES
Subjt:  LAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILES

Query:  LRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVA
        LRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVA
Subjt:  LRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVA

Query:  KSRGIKGYSKLKKNELLELLRS
        KSRGIKGYSKLKKNELLELLRS
Subjt:  KSRGIKGYSKLKKNELLELLRS

XP_038893911.1 SAP-like protein BP-73 isoform X2 [Benincasa hispida]1.7e-6870.39Show/hide
Query:  SLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHED
        +LLNLA+IA V  SKGIQ+SV++N   G AG RPPRR S PG+TRKNEP+  + EA    EDLK  K NNQEEIIALFRKI+TSIAK+SA++ DE+S +D
Subjt:  SLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHED

Query:  EIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS-PAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVE
        E GAESILE+LRESRKQVKG++SKKAG K LR +G SEE E++  S PAA+F+LVRPPSKFVKRSPIPSPP  NGS    R + +QAIAESRE+KFPS++
Subjt:  EIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS-PAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVE

Query:  NMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        NMKLTELKA+AKSRGIKGYSKLKKNEL+ELL S
Subjt:  NMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

TrEMBL top hitse value%identityAlignment
A0A1S3BFQ9 SAP-like protein BP-733.8e-7466.92Show/hide
Query:  GKSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKS
        GKSDG SSFP S S  LSQF LF K+TH RFESLLNLA+IA V  SK IQ+SV++N   GNA  RPPRR+S PG+TRK+E +  + EA    E++K  K 
Subjt:  GKSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKS

Query:  NNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPI
        N+QEEIIALFRKIQ SIAK+SA++ DE+SH+DE GA SILE+LRE RKQ+KG+TSKKAG KV R KG SEE EM+  S  PAA+FKLVRPPSKFVKRSPI
Subjt:  NNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPI

Query:  PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        P            + + SQAIAESRE+KFPS+ENMKL ELKA+AKSRGIKGYSKLKKNEL+E+LRS
Subjt:  PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

A0A5A7UMB7 SAP-like protein BP-733.8e-7466.92Show/hide
Query:  GKSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKS
        GKSDG SSFP S S  LSQF LF K+TH RFESLLNLA+IA V  SK IQ+SV++N   GNA  RPPRR+S PG+TRK+E +  + EA    E++K  K 
Subjt:  GKSDGGSSFPVSNST-LSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSN-GGGNAGQRPPRRSSGPGRTRKNEPN--RNEA---AEDLKNPKS

Query:  NNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPI
        N+QEEIIALFRKIQ SIAK+SA++ DE+SH+DE GA SILE+LRE RKQ+KG+TSKKAG KV R KG SEE EM+  S  PAA+FKLVRPPSKFVKRSPI
Subjt:  NNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTS--PAAEFKLVRPPSKFVKRSPI

Query:  PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        P            + + SQAIAESRE+KFPS+ENMKL ELKA+AKSRGIKGYSKLKKNEL+E+LRS
Subjt:  PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

A0A6J1CF22 uncharacterized protein LOC111010657 isoform X11.3e-13899.63Show/hide
Query:  MQFGYSKNCIFPGKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDL
        MQFGYSKNCIF GKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDL
Subjt:  MQFGYSKNCIFPGKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDL

Query:  KNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKR
        KNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKR
Subjt:  KNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKR

Query:  SPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        SPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
Subjt:  SPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

A0A6J1CGN3 uncharacterized protein LOC111010657 isoform X22.1e-10999.55Show/hide
Query:  LAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILES
        L EIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILES
Subjt:  LAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILES

Query:  LRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVA
        LRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVA
Subjt:  LRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVA

Query:  KSRGIKGYSKLKKNELLELLRS
        KSRGIKGYSKLKKNELLELLRS
Subjt:  KSRGIKGYSKLKKNELLELLRS

A0A6J1GSF6 uncharacterized protein LOC1114567104.3e-6262.36Show/hide
Query:  KSDGGSSFPVSN-STLSQFGLF-WKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGG-GNAGQRPPRRSSGPGRTRKNEPNRNEA----AEDLKNPKSN
        KS+G SS  VSN +   QFGLF  +ETHF FESLLNLAEIA    SK IQ++V+SNG  G  G +P RRSS PGRTRKN  +  +      ED+K PKSN
Subjt:  KSDGGSSFPVSN-STLSQFGLF-WKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGG-GNAGQRPPRRSSGPGRTRKNEPNRNEA----AEDLKNPKSN

Query:  NQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSP
        NQEEIIALFRKIQTSIA+++A++ DE+S++DE G ESILE+L ESRKQVKG+T K AGVK LRR G SE         AAEFKLVRPPS FVKRSPIPSP
Subjt:  NQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSP

Query:  PGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
         G NG+                     +VENMKL ELKAVAKSRGIKGYSKLKKNELLELL S
Subjt:  PGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

SwissProt top hitse value%identityAlignment
Q8L4E7 SAP-like protein BP-733.6e-0565.71Show/hide
Query:  PSVENMKLTELKAVAKSRGIKGYSKLKKNELLELL
        P +  +K+TEL+ +AKSRGIKGYSK+KKN+L+ELL
Subjt:  PSVENMKLTELKAVAKSRGIKGYSKLKKNELLELL

Arabidopsis top hitse value%identityAlignment
AT1G06190.1 Rho termination factor1.8e-0452.27Show/hide
Query:  ESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        E+ E     +  +KL EL+ +AKSRG+KG SK+KK EL+ELL S
Subjt:  ESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

AT4G18740.1 Rho termination factor9.6e-2241.38Show/hide
Query:  NPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAE-----SILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSK
        NP  +NQEEII+L ++IQ+SI+K  +   +E+ + DE   E     +IL+ L +SRK+ +G TS            + E+P      P  + +L RPPS 
Subjt:  NPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAE-----SILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSK

Query:  FVKRSPI-PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        FVKR+P+  S  G  G       + +      +E K   +E MKL ELK VAK+RGIKGYSKL+K+ELLEL+RS
Subjt:  FVKRSPI-PSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

AT4G18740.2 Rho termination factor1.6e-1638.15Show/hide
Query:  NPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAE-----SILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSK
        NP  +NQEEII+L ++IQ+SI+K  +   +E+ + DE   E     +IL+ L +SRK+ +G TS            + E+P      P  + +L RPPS 
Subjt:  NPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAE-----SILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSK

Query:  FVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        FVKR+P+ S    +G R                            ELK VAK+RGIKGYSKL+K+ELLEL+RS
Subjt:  FVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS

AT4G18740.3 Rho termination factor1.6e-1638.15Show/hide
Query:  NPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAE-----SILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSK
        NP  +NQEEII+L ++IQ+SI+K  +   +E+ + DE   E     +IL+ L +SRK+ +G TS            + E+P      P  + +L RPPS 
Subjt:  NPKSNNQEEIIALFRKIQTSIAKDSATTKDEDSHEDEIGAE-----SILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSK

Query:  FVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS
        FVKR+P+ S    +G R                            ELK VAK+RGIKGYSKL+K+ELLEL+RS
Subjt:  FVKRSPIPSPPGRNGSRSQLREEPSQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTTGGATATTCCAAGAACTGCATTTTTCCGGGAAAATCAGATGGAGGCAGTAGTTTTCCAGTCTCGAACTCTACGCTTTCCCAATTTGGTCTCTTTTGGAAGGA
GACCCATTTTCGCTTTGAAAGCTTGTTAAATTTGGCAGAGATTGCAGCTGTGAATTCTAGCAAGGGAATTCAAGTATCTGTTACGAGCAATGGGGGTGGAAATGCAGGGC
AACGGCCTCCCCGTAGAAGTTCTGGGCCAGGAAGGACCAGAAAGAATGAACCCAATAGAAACGAGGCAGCTGAAGACCTAAAAAATCCCAAATCAAATAACCAGGAGGAG
ATAATTGCTCTCTTCAGAAAGATTCAGACTTCCATTGCCAAGGACTCTGCAACCACCAAAGATGAAGATTCCCACGAGGATGAGATTGGAGCCGAGTCTATTTTGGAGAG
TCTTCGTGAATCAAGGAAGCAAGTCAAAGGCAGAACTTCAAAGAAGGCAGGAGTTAAAGTGTTGAGAAGAAAGGGCATCTCTGAAGAGCCGGAAATGTATCATACTTCGC
CAGCAGCAGAATTCAAGTTAGTTCGACCACCATCTAAATTCGTGAAGAGATCACCAATCCCATCTCCCCCAGGAAGAAATGGTTCACGATCACAGCTTAGGGAGGAGCCT
TCTCAGGCCATAGCTGAAAGCAGGGAAATGAAGTTCCCAAGTGTAGAGAATATGAAACTTACCGAGCTGAAAGCAGTAGCAAAATCTAGAGGAATTAAGGGTTATTCCAA
ATTGAAGAAAAATGAGCTCCTGGAACTTCTGAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTTGGATATTCCAAGAACTGCATTTTTCCGGGAAAATCAGATGGAGGCAGTAGTTTTCCAGTCTCGAACTCTACGCTTTCCCAATTTGGTCTCTTTTGGAAGGA
GACCCATTTTCGCTTTGAAAGCTTGTTAAATTTGGCAGAGATTGCAGCTGTGAATTCTAGCAAGGGAATTCAAGTATCTGTTACGAGCAATGGGGGTGGAAATGCAGGGC
AACGGCCTCCCCGTAGAAGTTCTGGGCCAGGAAGGACCAGAAAGAATGAACCCAATAGAAACGAGGCAGCTGAAGACCTAAAAAATCCCAAATCAAATAACCAGGAGGAG
ATAATTGCTCTCTTCAGAAAGATTCAGACTTCCATTGCCAAGGACTCTGCAACCACCAAAGATGAAGATTCCCACGAGGATGAGATTGGAGCCGAGTCTATTTTGGAGAG
TCTTCGTGAATCAAGGAAGCAAGTCAAAGGCAGAACTTCAAAGAAGGCAGGAGTTAAAGTGTTGAGAAGAAAGGGCATCTCTGAAGAGCCGGAAATGTATCATACTTCGC
CAGCAGCAGAATTCAAGTTAGTTCGACCACCATCTAAATTCGTGAAGAGATCACCAATCCCATCTCCCCCAGGAAGAAATGGTTCACGATCACAGCTTAGGGAGGAGCCT
TCTCAGGCCATAGCTGAAAGCAGGGAAATGAAGTTCCCAAGTGTAGAGAATATGAAACTTACCGAGCTGAAAGCAGTAGCAAAATCTAGAGGAATTAAGGGTTATTCCAA
ATTGAAGAAAAATGAGCTCCTGGAACTTCTGAGATCCTAA
Protein sequenceShow/hide protein sequence
MQFGYSKNCIFPGKSDGGSSFPVSNSTLSQFGLFWKETHFRFESLLNLAEIAAVNSSKGIQVSVTSNGGGNAGQRPPRRSSGPGRTRKNEPNRNEAAEDLKNPKSNNQEE
IIALFRKIQTSIAKDSATTKDEDSHEDEIGAESILESLRESRKQVKGRTSKKAGVKVLRRKGISEEPEMYHTSPAAEFKLVRPPSKFVKRSPIPSPPGRNGSRSQLREEP
SQAIAESREMKFPSVENMKLTELKAVAKSRGIKGYSKLKKNELLELLRS