; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1245 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1245
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUsp domain-containing protein
Genome locationMC03:18401697..18404371
RNA-Seq ExpressionMC03g1245
SyntenyMC03g1245
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019978.1 hypothetical protein SDJN02_18946, partial [Cucurbita argyrosperma subsp. argyrosperma]2.04e-10774.67Show/hide
Query:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN
        MA FWVCS P  K    +  +  +K ELLRS   GEEE SS+SSK+ SENGNRVMVVVDWS+EA+GAL+WTLSHA+ + DTI+L+HVLK+   QGF F N
Subjt:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN

Query:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC
        KVD  K AY+LLFSMRNMCLKRRPEVQVE+ALLEGK+RGP+IVEEAKKHKLSLLVLGQRKRPILRRLL RWAT   RR RKKK  TCR T EYCIQNSSC
Subjt:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC

Query:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA
        +TIAVRKKS++IGGYLITTKRH+NFWLLA
Subjt:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA

XP_022137570.1 uncharacterized protein LOC111008987 [Momordica charantia]1.44e-154100Show/hide
Query:  MAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKA
        MAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKA
Subjt:  MAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKA

Query:  YELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKK
        YELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKK
Subjt:  YELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKK

Query:  SRQIGGYLITTKRHRNFWLLA
        SRQIGGYLITTKRHRNFWLLA
Subjt:  SRQIGGYLITTKRHRNFWLLA

XP_022924122.1 uncharacterized protein LOC111431650 [Cucurbita moschata]5.83e-10774.67Show/hide
Query:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN
        MA FWVCS P  K    +  +  +K ELLRS   GEEE SS+SSK+ SENGNRVMVVVDWS+EA+GAL+WTLSHA+ +HDTI+L+HVLK+   QGF F N
Subjt:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN

Query:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC
        KV+  K AY+LLFSMRNMCLKRRPEVQVE+ALLEGKERGP+IVEEAKKHKLSLLVLGQRKR ILRRLL RWAT   RR RKKK  TCR T EYCIQNSSC
Subjt:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC

Query:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA
        +TIAVRKKS++IGGYLITTKRH+NFWLLA
Subjt:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA

XP_023001141.1 uncharacterized protein LOC111495368 [Cucurbita maxima]3.90e-10573.36Show/hide
Query:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN
        MA FWVCS P  K    +  +  +K ELL S   GEEE SS+SSK+ SENGNRVMVVVDWS+EA+GAL+WTLSHA+ +HDTI+L++VLK+   +GF F N
Subjt:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN

Query:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC
        KV+  K AY+LLFSMRNMCLKRRPEVQVE+ALLEGKERGP+IVEEAKKHKLSLLVLGQRKRPILRRLL RWAT   RR RKKK  +CR T EYCIQNSSC
Subjt:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC

Query:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA
        +TIAVRKKS++IGGYLITTKRH+NFWLLA
Subjt:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA

XP_023520317.1 uncharacterized protein LOC111783631 [Cucurbita pepo subsp. pepo]1.36e-10573.36Show/hide
Query:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN
        MA FWVCS P +K    +  +  +K ELL S   GEEE SS+SSK  S NGNRVMVVVDWS+EA+GAL+WTLSHA+ +HDTI+L+HVLK+   QGF F N
Subjt:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN

Query:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC
        KV+  K AY+LLFSMRNMCLKRRPEV VE+ALLEGKERGP+IVEEA+KHKLSLLVLGQRKRPILRRLL RWAT   RR RKKK  TCR T EYCIQNSSC
Subjt:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC

Query:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA
        +TIAVRKKS++IGGYLITTKRH+NFWLLA
Subjt:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA

TrEMBL top hitse value%identityAlignment
A0A1S3BY66 uncharacterized protein LOC1034944674.36e-9872.38Show/hide
Query:  KKNELLRSG---EEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQ-----GFGFNNKVDYIKAYELLFSMRNMC
        KK ELL SG   E + SS++ K++ +NGNRVMVVVDWS+EA+ AL+WTLSHA+  +DTI+L+HVLK+   Q     GF F NKV+YIKA++LLFSMR+MC
Subjt:  KKNELLRSG---EEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQ-----GFGFNNKVDYIKAYELLFSMRNMC

Query:  LKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITT
        LK +PEVQVE+ALLEGKERGP+IVEEAKKHKLSLLVLGQRKRP+LRRL NRWA R  RR RKKKK TCR T EYCIQNSSC+TIAVRKKS++IGGYLITT
Subjt:  LKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITT

Query:  KRHRNFWLLA
        K H+NFWLLA
Subjt:  KRHRNFWLLA

A0A5D3BIQ2 Putative Adenine nucleotide alpha hydrolases-like superfamily protein1.93e-9767.38Show/hide
Query:  MAGFWVCSGPNKKN----KKKKKKKNELLRSG---EEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQ-----G
        MA F +CS   ++N      +  +K ELL SG   E + SS++ K++ +NGNRVMVVVDWS+EA+ AL+WTLSHA+  +DTI+L+HVLK+   Q     G
Subjt:  MAGFWVCSGPNKKN----KKKKKKKNELLRSG---EEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQ-----G

Query:  FGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQ
        F F NKV+YIKA++LLFSMR+MCLK +PEVQVE+ALLEGKERGP+IVEEAKKHKLSLLVLGQRKRP+LRRL NRWA R  RR RKKKK TCR T EYCIQ
Subjt:  FGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQ

Query:  NSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA
        NSSC+TIAVRKKS++IGGYLITTK H+NFWLLA
Subjt:  NSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA

A0A6J1C707 uncharacterized protein LOC1110089876.99e-155100Show/hide
Query:  MAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKA
        MAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKA
Subjt:  MAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKA

Query:  YELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKK
        YELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKK
Subjt:  YELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKK

Query:  SRQIGGYLITTKRHRNFWLLA
        SRQIGGYLITTKRHRNFWLLA
Subjt:  SRQIGGYLITTKRHRNFWLLA

A0A6J1E832 uncharacterized protein LOC1114316502.82e-10774.67Show/hide
Query:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN
        MA FWVCS P  K    +  +  +K ELLRS   GEEE SS+SSK+ SENGNRVMVVVDWS+EA+GAL+WTLSHA+ +HDTI+L+HVLK+   QGF F N
Subjt:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN

Query:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC
        KV+  K AY+LLFSMRNMCLKRRPEVQVE+ALLEGKERGP+IVEEAKKHKLSLLVLGQRKR ILRRLL RWAT   RR RKKK  TCR T EYCIQNSSC
Subjt:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC

Query:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA
        +TIAVRKKS++IGGYLITTKRH+NFWLLA
Subjt:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA

A0A6J1KHT2 uncharacterized protein LOC1114953681.89e-10573.36Show/hide
Query:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN
        MA FWVCS P  K    +  +  +K ELL S   GEEE SS+SSK+ SENGNRVMVVVDWS+EA+GAL+WTLSHA+ +HDTI+L++VLK+   +GF F N
Subjt:  MAGFWVCSGPNKK----NKKKKKKKNELLRS---GEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNN

Query:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC
        KV+  K AY+LLFSMRNMCLKRRPEVQVE+ALLEGKERGP+IVEEAKKHKLSLLVLGQRKRPILRRLL RWAT   RR RKKK  +CR T EYCIQNSSC
Subjt:  KVDYIK-AYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSC

Query:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA
        +TIAVRKKS++IGGYLITTKRH+NFWLLA
Subjt:  LTIAVRKKSRQIGGYLITTKRHRNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.0e-3743.01Show/hide
Query:  SVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVD----------YIKAYELLFSMRNMCLKRRPEVQVEIALLEGK
        S+   G R++VVVD   EA+ AL WTLSH     D+ILLLH LK    Q     NK +            +A + + +++ MC  +RPEV+ E+  ++G 
Subjt:  SVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVD----------YIKAYELLFSMRNMCLKRRPEVQVEIALLEGK

Query:  ERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA
        E+GP IV+EA++ + SLLVLGQ+K+    RLL  WA+++       +  T    VEYCI NS C+ IAVRK+ +++GGY +TTKRH++FWLLA
Subjt:  ERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein8.8e-3342.62Show/hide
Query:  SVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEA
        S+   G R++VVVD   EA+ AL WTLSH     D+ILLLH LK    Q     NK    +  E     +    +   +V+ E+  ++G E+GP IV+EA
Subjt:  SVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEA

Query:  KKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA
        ++ + SLLVLGQ+K+    RLL  WA+++       +  T    VEYCI NS C+ IAVRK+ +++GGY +TTKRH++FWLLA
Subjt:  KKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.2e-3545.71Show/hide
Query:  MVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLE-GKERGPVIVEEAKKHKLSLL
        MVVVD + + + ALQW L+H +   D I LLHV +T   Q      +    +A+EL+  ++N C  ++P V+ EI ++E  +E+G  IVEE+KK    +L
Subjt:  MVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLE-GKERGPVIVEEAKKHKLSLL

Query:  VLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA
        VLGQRKR    R++ +W T+    G         G VEYCI NS C+ IAVRKKS   GGYLITTKRH++FWLLA
Subjt:  VLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.0e-4048.91Show/hide
Query:  VSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEG--KERGPVIVEE
        ++E GNRVMVVVD  I + GAL+W L H L + D + LL+  K   K G   N K + +K  EL+ +++ +C  +RP ++VEI  L+G  KE+G  IVEE
Subjt:  VSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEG--KERGPVIVEE

Query:  AKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA
        AK+ ++SLLV+G+ K+P + RLL RW  +  RRGR        GT++YC++ +SC+TIAV+ K+R++GGYLITTKRH+NFWLLA
Subjt:  AKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.4e-4148.09Show/hide
Query:  SENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEG--KERGPVIVEEA
        +E GNRVMVVVD ++ + GAL+W ++H L   DT+ LL+  K   K      N+   +K  EL+ +++ +C  +RP ++VEI  LEG  K++G  IVEE+
Subjt:  SENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTILLLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEG--KERGPVIVEEA

Query:  KKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA
        KK ++SLLV+GQ K+P + RLL RWA +  RRG +       G ++YC++N+SC+TIAV+ K+R++GGYLITTKRH+NFWLLA
Subjt:  KKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYCIQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATTTATGTTGCTTCGTTTTGCCACTAACTTTAAAGCTACATAAATGGGTGGGGGAAAAAATAGTGGGTGTTGAGTTTCAGAGGGTGAGAGTGAGGATTTTCATGGCTGG
CTTTTGGGTGTGTTCTGGACCCAACAAGAAGAATAAGAAGAAGAAGAAGAAGAAGAATGAGCTGTTGAGAAGTGGGGAAGAAGAGTTGAGCTCCAACAGCTCAAAATCTG
TGTCTGAAAATGGAAACAGAGTAATGGTGGTTGTGGATTGGAGCATTGAGGCTCAGGGGGCTCTCCAATGGACTCTTTCTCATGCCCTTCACACCCATGACACCATTCTT
CTTCTTCATGTTCTCAAGACTTCCAACAAACAAGGTTTTGGGTTTAATAATAAGGTGGATTACATAAAGGCCTATGAGCTCCTCTTTTCTATGAGGAATATGTGCCTAAA
GAGAAGGCCTGAGGTGCAAGTAGAGATAGCATTGCTGGAAGGGAAGGAAAGAGGTCCAGTAATTGTGGAAGAGGCAAAGAAGCATAAACTGTCCCTTTTGGTACTTGGTC
AAAGAAAGAGGCCGATCCTACGACGTCTGTTGAACAGATGGGCAACCAGAAGCGATCGGAGAGGGAGGAAGAAGAAGAAAACAACTTGTCGAGGGACGGTAGAATATTGC
ATTCAGAACTCATCTTGCTTGACAATTGCAGTGAGGAAGAAGAGCAGACAGATTGGAGGATATTTGATCACAACCAAACGTCACAGGAACTTCTGGCTCTTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
TGGATTTATGTTGCTTCGTTTTGCCACTAACTTTAAAGCTACATAAATGGGTGGGGGAAAAAATAGTGGGTGTTGAGTTTCAGAGGGTGAGAGTGAGGATTTTCATGGCT
GGCTTTTGGGTGTGTTCTGGACCCAACAAGAAGAATAAGAAGAAGAAGAAGAAGAAGAATGAGCTGTTGAGAAGTGGGGAAGAAGAGTTGAGCTCCAACAGCTCAAAATC
TGTGTCTGAAAATGGAAACAGAGTAATGGTGGTTGTGGATTGGAGCATTGAGGCTCAGGGGGCTCTCCAATGGACTCTTTCTCATGCCCTTCACACCCATGACACCATTC
TTCTTCTTCATGTTCTCAAGACTTCCAACAAACAAGGTTTTGGGTTTAATAATAAGGTGGATTACATAAAGGCCTATGAGCTCCTCTTTTCTATGAGGAATATGTGCCTA
AAGAGAAGGCCTGAGGTGCAAGTAGAGATAGCATTGCTGGAAGGGAAGGAAAGAGGTCCAGTAATTGTGGAAGAGGCAAAGAAGCATAAACTGTCCCTTTTGGTACTTGG
TCAAAGAAAGAGGCCGATCCTACGACGTCTGTTGAACAGATGGGCAACCAGAAGCGATCGGAGAGGGAGGAAGAAGAAGAAAACAACTTGTCGAGGGACGGTAGAATATT
GCATTCAGAACTCATCTTGCTTGACAATTGCAGTGAGGAAGAAGAGCAGACAGATTGGAGGATATTTGATCACAACCAAACGTCACAGGAACTTCTGGCTCTTGGCTTGA
TTCTCTTTTCTTTCCCCTTTTTCTGTAACATATTCTCTCATCTTTTATCTTTTAGGGTTAAAAAGTGGATATCATATTCAGTCCTTCAAACTTGGACTTGCTATACTATA
GAGTGATTGAAACTGTTGCTTGCATTTATAGGTAAAGTTGGGATTTTCTCACATCAATCAGATACAAAAGGTTTTGAAATTGTAGCTTTTGTCCCGCAGAGAGTTATAGC
AAGATCCACCTCTAAAATTCTGTTGTTCAAGCATATTTTTAAACAAAATTCGATAACTGAGGTCGAGGTAGATTATCATCGGACCCCCTTTTAATGGAACGACAGATACA
GAATAATTCATTAAGCTAAGATTATCTACCAACCAAGATATAAAATGGAACAGAAACTTGTTAAGTGTGTCCTTCAATCTGTATTTTCAACTTGAATTCATCAATTACAT
AAAAAACCAACAAATGTAGATGAACGTATTTGGATTATACACAGACAATGCCTTCAATTCCGGAAGGACACTTTTCCAGTGGATCAACATCTTCTCTTCTCCCTTTTTCA
AGTCTGCAATGCTTAATGTTGAGGTTGAACCAGGAAACAATAGAAGCTATATATCTCCATTTTCAGACTTTCAGTCATTTTGAGGCTATTTGATGTGGCCCAATGGAGTA
ACCATGTCTACATGCAATTTCGTGCAGTTGATCATAACCAGCAAGAACCCCAGCCCCTGCAACACCAACAAGCATATTGGCAGTTACTCCTCGAAAAAGAGCGGTAATAC
CCTCAACGCGAATGATCTCTGAAAATGCATGGAAAGGACTACGGTACTTCAGGGTTTGTCCAGAGGTAAGCATCATTCTCCGTCGCAAAGTGTCGAAAGGATAAGCACAT
ACCCCAGAAAAGGTTGTGATACTCCACCCCAATAAGAAACTAGCAAAAAAACTTCCCTGCATATAACATTAAACTATACAGAATTTCAAGCCATTTGAAAGAGAGGTATA
AAATAGGCGAGAAAAGAAATTGAAGATATATCTAGGAGTTGTATATCATCCATGCTTATGACAATCAGAGATGGTATTTCTCCAAGAATCAGGTCCCAAGCCAATCATTT
GTGTACTGACAATTATGAAATGTAAATGAACGTAATTGGTTTATGGATATTGTTGAAAGTTATATGACGTAATTTCCATATCTAGTATCAGAACTGCTATAAGCATGAGA
CCAGAATCTTCTGGTTAAAGGTTTGAGATACAAGTTAATTGTCTACCTAGAAGAATATAGACCGAAATTAATTGATGTTCTAAATAGAATAGATGCTTACCTCAAAGTGC
CCAACCAAAACAAGTGGCTTTAAAGTGTCGTAAATCCCAAAATACAATCCCCTGTACAAAGTGATTCCCATGATTGAAACACTGAATCCTCGGTAAAGTCCAACAATTCC
ATCACTCGAAAAGGTTTTTCTGTAGACATCCAGAAGCCCTTTAAACTGGCGCTGACCGTTACCGCCACAATCCTTTGCATCTGTGGCTAGTCGAGTGCGTGCATAATCCA
AATGATACAGAAACAAAGATGTAGTTGCTCCTGCAGCACTTCCTGAAGCAACATTTCCAGCAAACCACTTGACGTATCCATCATTCTCCTTTGAG
Protein sequenceShow/hide protein sequence
DLCCFVLPLTLKLHKWVGEKIVGVEFQRVRVRIFMAGFWVCSGPNKKNKKKKKKKNELLRSGEEELSSNSSKSVSENGNRVMVVVDWSIEAQGALQWTLSHALHTHDTIL
LLHVLKTSNKQGFGFNNKVDYIKAYELLFSMRNMCLKRRPEVQVEIALLEGKERGPVIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRSDRRGRKKKKTTCRGTVEYC
IQNSSCLTIAVRKKSRQIGGYLITTKRHRNFWLLA