; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010607 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010607
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAgglutinin domain-containing protein
Genome locationChr06:23920634..23921500
RNA-Seq ExpressionHG10010607
SyntenyHG10010607
Gene Ontology termsGO:0005576 - extracellular region (cellular component)
InterPro domainsIPR004991 - Aerolysin-like toxin
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140621.1 uncharacterized protein LOC101217825 [Cucumis sativus]3.4e-8357.54Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        + FRS+++ DP V  EVI TPDGH  +KNV   KF+    +  WIVLDD+ ST K+DP   FWP++++HNVVALRN   N  C   ++    + +    +
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
           I ++AKLEVI+ VLSR IY+V F+L DAR YNEKPLLMTSTIVEN NS+++ F+IKLSY+DTTTSTW  ++N   G KM  +T +PKV EG+IE   
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        ++SED YTWG+T QMK  AEVVHEV VPA TKVKAS+MAT+ASCD+PFSY QRDKL +G Y T+RYHDG+YNV +SYN+ FV E+
Subjt:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

XP_008460185.1 PREDICTED: uncharacterized protein LOC103499071 [Cucumis melo]1.7e-9061.05Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        +    S+  DP + +EVI TPDGH  +KN+ + KF+ R  +  WIVLD++SSTAKDDP  LFWP++LDHNVVALR+  +   C  ++     + +   A 
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
        L+ I D AKLEV++ VLSR+IYNV F+L DAR YNEKPLLMTSTIVENNNS+D+KF+IKLSY+DTTTSTW  ++N   G KM  ET +PKVSE +IE   
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        Q+SED YTWG+T QMK  AEVVHEV VPA TKVKAS++AT+ASCD+PFSY QRDKL +G Y T+RYHDG+YNV NSYNFHFV E+
Subjt:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

XP_022155408.1 uncharacterized protein LOC111022556 [Momordica charantia]8.6e-8754.23Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        +EFR+ D  DPG+++E++T PDGH+R+KNVPY ++W+ DP  DWI++  N ++A +D   LFWP+++D+NVVALR++ NN  CKRL+ +    +N LNA+
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
           ITDEA++EV+ELV+SR IYN+ F+L DAR+YNEKPLL+ +   EN     +K S+KLSYEDT T+TW +S++  FG K+T+ETG+PK+SEG+IE++ 
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        +  E+Y WG T+Q K L EV H+V+VPAW+KV+ SI+AT+A CDVPFSYTQRDKL+NG+ V  R  DG++   N YN+ F+AEE
Subjt:  QMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

XP_022155428.1 uncharacterized protein LOC111022575 [Momordica charantia]6.1e-10968.64Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDP-ETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNA
        ++F+ S+R DPGVR+EVITTPDGHVRIKNVPYGKF I D      I+LD+ SS    DP++LFWP++L  N VALRN+ NNCF +R++ +     N + A
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDP-ETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNA

Query:  ALDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELT
          DYITDEAK+EV+ELVLSR IYNV F+L DAR+YNE+P+ MTS +VENNNSEDQK S+KLSYEDTTTSTWSA++N +FG K+TIETG+PKVSEG++E++
Subjt:  ALDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELT

Query:  VQMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETE
         ++SE YTWGKTEQ K LAEVVH+V+VPAWTKVK SI+AT+ASCDVPFSYTQRDKL++GK+VTRRYHDGVYNV NSYNFHFV EE +
Subjt:  VQMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETE

XP_038875088.1 uncharacterized protein LOC120067616 [Benincasa hispida]5.8e-13680.21Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        +EF+SSD  DPGVRNEVI+TPDGHVRIKNVPYGKFWIRDPE +WIVLDDN STAKDDPRTLFWPV+L++NVVALRN +NNCFCKRL+++   +DN+LNAA
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
        LDYIT EA LEV ELVLSRNIYNVLF+L DAR +NE+P+ +TST+VENNNSE QKFSIKLSYEDTTTSTW+A++NG+FG KMTI+TG+PKVSEGK+E+  
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETEEV
        ++SEDYTWGKTEQMKCL+EVVHEV VPAWTKVKAS+MAT+ASCDVPFSYTQRDKL+NGKY+T RYHDGVYNV NSYNFHFVAEE EE+
Subjt:  QMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETEEV

TrEMBL top hitse value%identityAlignment
A0A0A0KDU4 Uncharacterized protein1.6e-8357.54Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        + FRS+++ DP V  EVI TPDGH  +KNV   KF+    +  WIVLDD+ ST K+DP   FWP++++HNVVALRN   N  C   ++    + +    +
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
           I ++AKLEVI+ VLSR IY+V F+L DAR YNEKPLLMTSTIVEN NS+++ F+IKLSY+DTTTSTW  ++N   G KM  +T +PKV EG+IE   
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        ++SED YTWG+T QMK  AEVVHEV VPA TKVKAS+MAT+ASCD+PFSY QRDKL +G Y T+RYHDG+YNV +SYN+ FV E+
Subjt:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

A0A1S3CD76 uncharacterized protein LOC1034990718.1e-9161.05Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        +    S+  DP + +EVI TPDGH  +KN+ + KF+ R  +  WIVLD++SSTAKDDP  LFWP++LDHNVVALR+  +   C  ++     + +   A 
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
        L+ I D AKLEV++ VLSR+IYNV F+L DAR YNEKPLLMTSTIVENNNS+D+KF+IKLSY+DTTTSTW  ++N   G KM  ET +PKVSE +IE   
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        Q+SED YTWG+T QMK  AEVVHEV VPA TKVKAS++AT+ASCD+PFSY QRDKL +G Y T+RYHDG+YNV NSYNFHFV E+
Subjt:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

A0A5D3DMB4 Agglutinin domain-containing protein8.1e-9161.05Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        +    S+  DP + +EVI TPDGH  +KN+ + KF+ R  +  WIVLD++SSTAKDDP  LFWP++LDHNVVALR+  +   C  ++     + +   A 
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
        L+ I D AKLEV++ VLSR+IYNV F+L DAR YNEKPLLMTSTIVENNNS+D+KF+IKLSY+DTTTSTW  ++N   G KM  ET +PKVSE +IE   
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        Q+SED YTWG+T QMK  AEVVHEV VPA TKVKAS++AT+ASCD+PFSY QRDKL +G Y T+RYHDG+YNV NSYNFHFV E+
Subjt:  QMSED-YTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

A0A6J1DMV7 uncharacterized protein LOC1110225564.1e-8754.23Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA
        +EFR+ D  DPG+++E++T PDGH+R+KNVPY ++W+ DP  DWI++  N ++A +D   LFWP+++D+NVVALR++ NN  CKRL+ +    +N LNA+
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAA

Query:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV
           ITDEA++EV+ELV+SR IYN+ F+L DAR+YNEKPLL+ +   EN     +K S+KLSYEDT T+TW +S++  FG K+T+ETG+PK+SEG+IE++ 
Subjt:  LDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTV

Query:  QMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE
        +  E+Y WG T+Q K L EV H+V+VPAW+KV+ SI+AT+A CDVPFSYTQRDKL+NG+ V  R  DG++   N YN+ F+AEE
Subjt:  QMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEE

A0A6J1DMX7 uncharacterized protein LOC1110225753.0e-10968.64Show/hide
Query:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDP-ETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNA
        ++F+ S+R DPGVR+EVITTPDGHVRIKNVPYGKF I D      I+LD+ SS    DP++LFWP++L  N VALRN+ NNCF +R++ +     N + A
Subjt:  MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDP-ETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNA

Query:  ALDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELT
          DYITDEAK+EV+ELVLSR IYNV F+L DAR+YNE+P+ MTS +VENNNSEDQK S+KLSYEDTTTSTWSA++N +FG K+TIETG+PKVSEG++E++
Subjt:  ALDYITDEAKLEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELT

Query:  VQMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETE
         ++SE YTWGKTEQ K LAEVVH+V+VPAWTKVK SI+AT+ASCDVPFSYTQRDKL++GK+VTRRYHDGVYNV NSYNFHFV EE +
Subjt:  VQMSEDYTWGKTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETE

SwissProt top hitse value%identityAlignment
Q66S21 Natterin-21.2e-0624.22Show/hide
Query:  NVANNCFCKRLTQNSGAYD----NALNAALDYITDEAK--------LEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYE
        N  + C   ++    GAY        N  L Y+ D A+        L + + V+ + + +V +      +   KP +M  + V N + ++   ++ L+ +
Subjt:  NVANNCFCKRLTQNSGAYD----NALNAALDYITDEAK--------LEVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYE

Query:  DTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTVQMSEDYTWG--KTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYV
         +T   W  + + +FG   T+  GIP VS   +E+++Q + D+  G  KTE    +  V   V VP       S++A     D+PF+ T       GK  
Subjt:  DTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTVQMSEDYTWG--KTEQMKCLAEVVHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYV

Query:  TRRYHDGVYNVENSYNFHFVAEE
        T+    GVY        H   E+
Subjt:  TRRYHDGVYNVENSYNFHFVAEE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTCAGAAGCTCAGATCGGGATGATCCAGGAGTGAGGAATGAGGTGATAACGACACCGGACGGTCACGTTCGTATAAAAAACGTTCCTTACGGGAAGTTTTGGAT
TCGAGATCCCGAAACAGATTGGATTGTATTGGATGATAATTCATCAACTGCAAAGGATGATCCAAGAACATTGTTTTGGCCTGTCAGACTTGATCACAATGTGGTGGCCC
TTCGTAATGTTGCCAATAATTGCTTTTGCAAGAGGCTGACTCAAAACAGTGGAGCATATGATAATGCTCTTAATGCTGCTCTTGATTACATTACTGATGAAGCAAAGTTG
GAAGTAATTGAGCTTGTTCTTTCACGTAATATTTATAATGTTCTATTCAATCTTTTTGATGCTAGGATTTATAATGAGAAGCCCCTTTTGATGACAAGCACTATTGTTGA
GAACAACAATTCTGAAGATCAAAAGTTCAGCATCAAACTCTCTTATGAAGATACTACCACCTCTACCTGGTCTGCCAGCCTCAACGGTTCGTTCGGTGCCAAGATGACGA
TCGAGACTGGCATTCCGAAAGTTTCCGAAGGAAAGATTGAACTGACAGTTCAGATGTCTGAGGATTACACATGGGGAAAAACAGAACAAATGAAATGTCTAGCTGAGGTT
GTTCATGAGGTTATTGTGCCTGCTTGGACTAAAGTTAAGGCTAGTATAATGGCAACTAAAGCTAGTTGTGATGTTCCTTTCTCCTACACTCAACGCGACAAACTCATCAA
TGGAAAGTATGTTACTCGACGATACCACGACGGTGTTTATAACGTCGAAAATTCCTATAATTTTCACTTTGTTGCTGAAGAGACTGAAGAGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTCAGAAGCTCAGATCGGGATGATCCAGGAGTGAGGAATGAGGTGATAACGACACCGGACGGTCACGTTCGTATAAAAAACGTTCCTTACGGGAAGTTTTGGAT
TCGAGATCCCGAAACAGATTGGATTGTATTGGATGATAATTCATCAACTGCAAAGGATGATCCAAGAACATTGTTTTGGCCTGTCAGACTTGATCACAATGTGGTGGCCC
TTCGTAATGTTGCCAATAATTGCTTTTGCAAGAGGCTGACTCAAAACAGTGGAGCATATGATAATGCTCTTAATGCTGCTCTTGATTACATTACTGATGAAGCAAAGTTG
GAAGTAATTGAGCTTGTTCTTTCACGTAATATTTATAATGTTCTATTCAATCTTTTTGATGCTAGGATTTATAATGAGAAGCCCCTTTTGATGACAAGCACTATTGTTGA
GAACAACAATTCTGAAGATCAAAAGTTCAGCATCAAACTCTCTTATGAAGATACTACCACCTCTACCTGGTCTGCCAGCCTCAACGGTTCGTTCGGTGCCAAGATGACGA
TCGAGACTGGCATTCCGAAAGTTTCCGAAGGAAAGATTGAACTGACAGTTCAGATGTCTGAGGATTACACATGGGGAAAAACAGAACAAATGAAATGTCTAGCTGAGGTT
GTTCATGAGGTTATTGTGCCTGCTTGGACTAAAGTTAAGGCTAGTATAATGGCAACTAAAGCTAGTTGTGATGTTCCTTTCTCCTACACTCAACGCGACAAACTCATCAA
TGGAAAGTATGTTACTCGACGATACCACGACGGTGTTTATAACGTCGAAAATTCCTATAATTTTCACTTTGTTGCTGAAGAGACTGAAGAGGTTTGA
Protein sequenceShow/hide protein sequence
MEFRSSDRDDPGVRNEVITTPDGHVRIKNVPYGKFWIRDPETDWIVLDDNSSTAKDDPRTLFWPVRLDHNVVALRNVANNCFCKRLTQNSGAYDNALNAALDYITDEAKL
EVIELVLSRNIYNVLFNLFDARIYNEKPLLMTSTIVENNNSEDQKFSIKLSYEDTTTSTWSASLNGSFGAKMTIETGIPKVSEGKIELTVQMSEDYTWGKTEQMKCLAEV
VHEVIVPAWTKVKASIMATKASCDVPFSYTQRDKLINGKYVTRRYHDGVYNVENSYNFHFVAEETEEV