; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr4:18267827..18274431
RNA-Seq ExpressionMoc04g25170
SyntenyMoc04g25170
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151685.1 uncharacterized protein LOC111019601 [Momordica charantia]1.2e-5342.75Show/hide
Query:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV
        D + ++     ++FP+ FH +CTYHL  N++SRFKN    KL+ +  YA+ +SQF                KYL+E+G ER ARCYQV RRY NMTTNS 
Subjt:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV

Query:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----
         SVN+ L+EAR L +T +++ I ++LQRWFH+RR  WST++T+HSDY E+ L  QFEKSR Y VK ID C F V+D  L+G VD+N R  TC +F     
Subjt:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----

Query:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS
                                  D+L     +PI+ +GHM EW++P D+ P  + P   +KR  RR T RI S
Subjt:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS

XP_022152153.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]2.8e-5036.42Show/hide
Query:  KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY------------------------------
        K ASSWV+ANLIK+   GT + Y++KHI +DV  +FGV+ISYDKA+RA+ELAY I+RG PEDSY H+ AY                              
Subjt:  KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY------------------------------

Query:  ----------------------------------------------------------------------------------------VRDVFPNTFHGM
                                                                                                  ++FP+ FH +
Subjt:  ----------------------------------------------------------------------------------------VRDVFPNTFHGM

Query:  CTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKH
        CTYHL  N++ RFKN A  KL+ +  YA+ +S F                KYL+E+G +R ARCYQV RRY NMTTNS ESVN++LREAR L +T ++  
Subjt:  CTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKH

Query:  IRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSI
        IR++LQRWFH+RR +WST++T+HS+Y E+ L  QFEKSR Y VK +
Subjt:  IRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSI

XP_022154925.1 uncharacterized protein LOC111022071 [Momordica charantia]3.9e-7634.94Show/hide
Query:  DDRNNTYVGGQLKGLLVPQGISFNQLLDRLYRLINVRQSECDLVLRVPYELVYSSPPLYLTNDDDLR--TDQRTVADSPDHHHE-PTESICQPTVDLSLD
        +D  N+YVGG+LKG++VP  +++ +L +RL+RL+ V Q+  DLV+RVPY L   SPP+++T+DDDLR    Q  V+  P      P ESI   +  L+ D
Subjt:  DDRNNTYVGGQLKGLLVPQGISFNQLLDRLYRLINVRQSECDLVLRVPYELVYSSPPLYLTNDDDLR--TDQRTVADSPDHHHE-PTESICQPTVDLSLD

Query:  DETA---------------------------------NHATCTRVR----------------------LNVERRSLHQQTMGGLDVHVGRLFISKNDIRL
         E +                                  HA+   V                       + +E+   +    GG ++HVG++F+SK D+R+
Subjt:  DETA---------------------------------NHATCTRVR----------------------LNVERRSLHQQTMGGLDVHVGRLFISKNDIRL

Query:  VLATMAICNNKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVH
        VL+  A+ +N+E+KV RST+S F VRC+D       ++                             K ASSWV+ANLIKD   GTG+ Y++KHI +DV 
Subjt:  VLATMAICNNKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVH

Query:  TKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FH----GMCTYHL-----------------------SRNLRSRFK------
         ++GV+ISYDKA+RA+ELAY I+RG P+DSY H+ AY   ++   P T FH    G   +                           +L+ +FK      
Subjt:  TKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FH----GMCTYHL-----------------------SRNLRSRFK------

Query:  --------------------NEAATKLF--------------------SNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNM
                            ++A+ K F                     +  YA+ +SQF                KYL+E+G ER ARCYQV RRY N 
Subjt:  --------------------NEAATKLF--------------------SNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNM

Query:  TTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVED
        TTNS ESVN++LREAR L +T +++ IR++LQRWFH+RR +WST++T+HSDY E+ L  QFEKSR Y VK +D C F VED
Subjt:  TTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVED

XP_022156308.1 uncharacterized protein LOC111023235 [Momordica charantia]2.7e-5643.48Show/hide
Query:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV
        D +  +     ++FP+ FH +CTYHL  N+++RFKN A  KL+ +  YA+ +SQF                KYL+E+G ER ARCYQV RRY NMTTNS 
Subjt:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV

Query:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----
        ESVN++LREAR L +T +++ I D+LQRWFH+RR +WST++T+HSDY E+ L  QFEKSR Y VK +D C F VED  L+  VD+N R CTC +F     
Subjt:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----

Query:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS
                                  D+L     +PI+ +GHM EW++P D++P  + P   +KR  R  T RI S
Subjt:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS

XP_022159086.1 uncharacterized protein LOC111025530 [Momordica charantia]8.2e-6638.44Show/hide
Query:  NKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISY
        N+E+K  RST+S F VRC+D       ++                             K ASSWV+ANLIKD    T + Y++KHI +DV  +F V+ISY
Subjt:  NKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISY

Query:  DKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FHGMCTYHLSRNLRSRF------------------KNEAATKLFSN---------PE---
        DKA+RA+ELAY I+RG  EDSY H+ AY   ++   P T FH      + R+  +R                   +++A+ K F           PE   
Subjt:  DKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FHGMCTYHLSRNLRSRF------------------KNEAATKLFSN---------PE---

Query:  --------YAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWST
                YA+ +SQF                KYL+E+G ER ARCYQV RRY NMTTNS ESVN +LREA  L +T +++ IRD+LQRWFH RR +WST
Subjt:  --------YAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWST

Query:  RSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-------------------------------DAL-----QPIMS
        ++T+HSDY E+ L  QFEKSR Y VK +D C F VED  L+G VD+N R CTC +F                               D+L     +PI+ 
Subjt:  RSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-------------------------------DAL-----QPIMS

Query:  IGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS
        IGHM EW++P +++   + P   +KR  RR T +I S
Subjt:  IGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS

TrEMBL top hitse value%identityAlignment
A0A6J1DBW0 uncharacterized protein LOC1110196016.0e-5442.75Show/hide
Query:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV
        D + ++     ++FP+ FH +CTYHL  N++SRFKN    KL+ +  YA+ +SQF                KYL+E+G ER ARCYQV RRY NMTTNS 
Subjt:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV

Query:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----
         SVN+ L+EAR L +T +++ I ++LQRWFH+RR  WST++T+HSDY E+ L  QFEKSR Y VK ID C F V+D  L+G VD+N R  TC +F     
Subjt:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----

Query:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS
                                  D+L     +PI+ +GHM EW++P D+ P  + P   +KR  RR T RI S
Subjt:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS

A0A6J1DE35 protein FAR-RED ELONGATED HYPOCOTYL 3-like1.1e-5036.71Show/hide
Query:  KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY------------------------------
        K ASSWV+ANLIK+   GT + Y++KHI +DV  +FGV+ISYDKA+RA+ELAY I+RG PEDSY H+ AY                              
Subjt:  KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY------------------------------

Query:  ----------------------------------------------------------------------------------------VRDVFPNTFHGM
                                                                                                  ++FP+ FH +
Subjt:  ----------------------------------------------------------------------------------------VRDVFPNTFHGM

Query:  CTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKH
        CTYHL  NL+ RFKN A  KL+ +  YA+ +S F                KYL+E+G +R ARCYQV RRY NMTTNS ESVN++LREAR L +T ++  
Subjt:  CTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKH

Query:  IRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSI
        IR++LQRWFH+RR +WST++T+HS+Y E+ L  QFEKSR Y VK +
Subjt:  IRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSI

A0A6J1DL08 uncharacterized protein LOC1110220711.9e-7634.94Show/hide
Query:  DDRNNTYVGGQLKGLLVPQGISFNQLLDRLYRLINVRQSECDLVLRVPYELVYSSPPLYLTNDDDLR--TDQRTVADSPDHHHE-PTESICQPTVDLSLD
        +D  N+YVGG+LKG++VP  +++ +L +RL+RL+ V Q+  DLV+RVPY L   SPP+++T+DDDLR    Q  V+  P      P ESI   +  L+ D
Subjt:  DDRNNTYVGGQLKGLLVPQGISFNQLLDRLYRLINVRQSECDLVLRVPYELVYSSPPLYLTNDDDLR--TDQRTVADSPDHHHE-PTESICQPTVDLSLD

Query:  DETA---------------------------------NHATCTRVR----------------------LNVERRSLHQQTMGGLDVHVGRLFISKNDIRL
         E +                                  HA+   V                       + +E+   +    GG ++HVG++F+SK D+R+
Subjt:  DETA---------------------------------NHATCTRVR----------------------LNVERRSLHQQTMGGLDVHVGRLFISKNDIRL

Query:  VLATMAICNNKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVH
        VL+  A+ +N+E+KV RST+S F VRC+D       ++                             K ASSWV+ANLIKD   GTG+ Y++KHI +DV 
Subjt:  VLATMAICNNKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVH

Query:  TKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FH----GMCTYHL-----------------------SRNLRSRFK------
         ++GV+ISYDKA+RA+ELAY I+RG P+DSY H+ AY   ++   P T FH    G   +                           +L+ +FK      
Subjt:  TKFGVHISYDKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FH----GMCTYHL-----------------------SRNLRSRFK------

Query:  --------------------NEAATKLF--------------------SNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNM
                            ++A+ K F                     +  YA+ +SQF                KYL+E+G ER ARCYQV RRY N 
Subjt:  --------------------NEAATKLF--------------------SNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNM

Query:  TTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVED
        TTNS ESVN++LREAR L +T +++ IR++LQRWFH+RR +WST++T+HSDY E+ L  QFEKSR Y VK +D C F VED
Subjt:  TTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVED

A0A6J1DQ99 uncharacterized protein LOC1110232351.3e-5643.48Show/hide
Query:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV
        D +  +     ++FP+ FH +CTYHL  N+++RFKN A  KL+ +  YA+ +SQF                KYL+E+G ER ARCYQV RRY NMTTNS 
Subjt:  DSYTHMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSV

Query:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----
        ESVN++LREAR L +T +++ I D+LQRWFH+RR +WST++T+HSDY E+ L  QFEKSR Y VK +D C F VED  L+  VD+N R CTC +F     
Subjt:  ESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-----

Query:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS
                                  D+L     +PI+ +GHM EW++P D++P  + P   +KR  R  T RI S
Subjt:  --------------------------DAL-----QPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS

A0A6J1E2V3 uncharacterized protein LOC1110255304.0e-6638.44Show/hide
Query:  NKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISY
        N+E+K  RST+S F VRC+D       ++                             K ASSWV+ANLIKD    T + Y++KHI +DV  +F V+ISY
Subjt:  NKEFKVKRSTRSLFAVRCVDRPLGPLASS-----------------------------KWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISY

Query:  DKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FHGMCTYHLSRNLRSRF------------------KNEAATKLFSN---------PE---
        DKA+RA+ELAY I+RG  EDSY H+ AY   ++   P T FH      + R+  +R                   +++A+ K F           PE   
Subjt:  DKAYRAKELAYPILRGWPEDSYTHMDAY---VRDVFPNT-FHGMCTYHLSRNLRSRF------------------KNEAATKLFSN---------PE---

Query:  --------YAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWST
                YA+ +SQF                KYL+E+G ER ARCYQV RRY NMTTNS ESVN +LREA  L +T +++ IRD+LQRWFH RR +WST
Subjt:  --------YAFWESQF----------------KYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKHIRDMLQRWFHDRRNYWST

Query:  RSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-------------------------------DAL-----QPIMS
        ++T+HSDY E+ L  QFEKSR Y VK +D C F VED  L+G VD+N R CTC +F                               D+L     +PI+ 
Subjt:  RSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDF-------------------------------DAL-----QPIMS

Query:  IGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS
        IGHM EW++P +++   + P   +KR  RR T +I S
Subjt:  IGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGCTGTCCGTTCAGGTCTACAATTTCCTTACAAGTCTGTTCAAGTGCAGCTTGTGTAGCAAGAAGTTCATTTTCTTCGACGGACGAGGGACTGCTTGGCTTGTA
TGACAACATGATGCTGGCTAAGAATGTTTCTTCTAAATGGTTCCTTTGTTGTTTCTCTGGAATCGGAGAAGTCGCAGCTGCGTTGGAAAGAGGAGGTGGCTGTGAATGGA
CGGATGCGTCAGTTCAAGAAATGACGACTGCAATCGACGATGATAGAAACAATACCTACGTCGGCGGTCAACTGAAAGGATTACTAGTGCCACAGGGTATTTCGTTTAAT
CAGTTACTTGACCGTTTGTACAGACTGATCAACGTGAGGCAGTCCGAATGCGATCTCGTCCTCAGAGTTCCTTATGAATTGGTGTACAGTTCGCCCCCATTGTACCTAAC
GAACGACGACGATCTACGGACCGATCAACGTACCGTTGCTGACAGCCCAGATCATCATCATGAGCCGACGGAGTCCATATGTCAACCAACAGTTGATCTCTCTCTCGACG
ATGAAACCGCTAACCATGCTACATGCACACGTGTGAGGCTGAATGTGGAGCGTCGGTCACTACATCAACAGACAATGGGCGGACTAGACGTGCATGTTGGCAGACTGTTC
ATATCGAAAAATGACATCCGACTAGTATTAGCAACTATGGCAATATGTAATAATAAGGAATTCAAGGTTAAGAGATCAACGCGAAGTTTGTTTGCCGTGCGATGCGTTGA
CCGTCCATTAGGTCCCCTTGCTAGCTCAAAATGGGCTAGTAGTTGGGTTATTGCCAACTTGATTAAAGATAGTCAAATCGGTACCGGGCAGACATACCAAGTAAAACACA
TAATTGATGACGTGCACACTAAGTTTGGTGTACACATTAGTTATGACAAAGCATACCGTGCAAAGGAACTTGCGTATCCTATATTGAGGGGATGGCCAGAAGATTCTTAC
ACGCACATGGATGCATACGTCAGAGACGTATTTCCTAACACGTTCCATGGAATGTGTACGTACCATTTGTCGAGGAACCTGAGATCAAGGTTTAAGAACGAGGCGGCCAC
TAAGCTGTTTTCAAACCCGGAGTATGCGTTCTGGGAGTCCCAATTCAAATATTTGGAGGAGATGGGGTTTGAACGAAGGGCTCGTTGCTACCAAGTCAAGAGAAGGTACG
TGAACATGACAACCAATAGTGTCGAGTCTGTAAATTCTATCTTGAGAGAGGCTAGGGGGTTGCTAGTCACTTTGGTCCTCAAACATATAAGGGACATGTTGCAGCGGTGG
TTCCACGATAGGCGAAACTATTGGTCCACACGGAGTACGACTCATTCGGACTATGTCGAGCAATATCTAACATCTCAGTTTGAAAAGTCTCGTACTTATAAGGTGAAATC
GATCGACGTTTGCAAATTCGATGTTGAAGACCGGGATTTGAATGGGATTGTAGATATAAATACGCGCAAATGCACATGCAAGGATTTCGATGCATTACAGCCAATTATGT
CGATCGGCCACATGTTAGAATGGAGGCAACCAGACGACTTTGAGCCGAGGACTATCCATCCATCTAGTAAGATTAAGCGGGTCAGTCGTAGGCATACTAACCGAATCCCC
TCAGTTGACACATGCAAATTTGTTGGGTTGTGGGATATAAGGATGGTGGCCGTAGGGAATGTCTGTACAACAGATGCATGCACAAGAGAAGAGCACATGCTAACTGATTT
GGTGTCCGTCAGGTCAAGACTTTATGGAATGCTTGTTGAAGTCCATGAATACATCGAACGAAGTATCGAAATGGATTACACTTTCACGTACTTGCAGGCTAGGTCGGATG
AGCTCGCCGCCCGGCTAGACGATCTCCAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGCTGTCCGTTCAGGTCTACAATTTCCTTACAAGTCTGTTCAAGTGCAGCTTGTGTAGCAAGAAGTTCATTTTCTTCGACGGACGAGGGACTGCTTGGCTTGTA
TGACAACATGATGCTGGCTAAGAATGTTTCTTCTAAATGGTTCCTTTGTTGTTTCTCTGGAATCGGAGAAGTCGCAGCTGCGTTGGAAAGAGGAGGTGGCTGTGAATGGA
CGGATGCGTCAGTTCAAGAAATGACGACTGCAATCGACGATGATAGAAACAATACCTACGTCGGCGGTCAACTGAAAGGATTACTAGTGCCACAGGGTATTTCGTTTAAT
CAGTTACTTGACCGTTTGTACAGACTGATCAACGTGAGGCAGTCCGAATGCGATCTCGTCCTCAGAGTTCCTTATGAATTGGTGTACAGTTCGCCCCCATTGTACCTAAC
GAACGACGACGATCTACGGACCGATCAACGTACCGTTGCTGACAGCCCAGATCATCATCATGAGCCGACGGAGTCCATATGTCAACCAACAGTTGATCTCTCTCTCGACG
ATGAAACCGCTAACCATGCTACATGCACACGTGTGAGGCTGAATGTGGAGCGTCGGTCACTACATCAACAGACAATGGGCGGACTAGACGTGCATGTTGGCAGACTGTTC
ATATCGAAAAATGACATCCGACTAGTATTAGCAACTATGGCAATATGTAATAATAAGGAATTCAAGGTTAAGAGATCAACGCGAAGTTTGTTTGCCGTGCGATGCGTTGA
CCGTCCATTAGGTCCCCTTGCTAGCTCAAAATGGGCTAGTAGTTGGGTTATTGCCAACTTGATTAAAGATAGTCAAATCGGTACCGGGCAGACATACCAAGTAAAACACA
TAATTGATGACGTGCACACTAAGTTTGGTGTACACATTAGTTATGACAAAGCATACCGTGCAAAGGAACTTGCGTATCCTATATTGAGGGGATGGCCAGAAGATTCTTAC
ACGCACATGGATGCATACGTCAGAGACGTATTTCCTAACACGTTCCATGGAATGTGTACGTACCATTTGTCGAGGAACCTGAGATCAAGGTTTAAGAACGAGGCGGCCAC
TAAGCTGTTTTCAAACCCGGAGTATGCGTTCTGGGAGTCCCAATTCAAATATTTGGAGGAGATGGGGTTTGAACGAAGGGCTCGTTGCTACCAAGTCAAGAGAAGGTACG
TGAACATGACAACCAATAGTGTCGAGTCTGTAAATTCTATCTTGAGAGAGGCTAGGGGGTTGCTAGTCACTTTGGTCCTCAAACATATAAGGGACATGTTGCAGCGGTGG
TTCCACGATAGGCGAAACTATTGGTCCACACGGAGTACGACTCATTCGGACTATGTCGAGCAATATCTAACATCTCAGTTTGAAAAGTCTCGTACTTATAAGGTGAAATC
GATCGACGTTTGCAAATTCGATGTTGAAGACCGGGATTTGAATGGGATTGTAGATATAAATACGCGCAAATGCACATGCAAGGATTTCGATGCATTACAGCCAATTATGT
CGATCGGCCACATGTTAGAATGGAGGCAACCAGACGACTTTGAGCCGAGGACTATCCATCCATCTAGTAAGATTAAGCGGGTCAGTCGTAGGCATACTAACCGAATCCCC
TCAGTTGACACATGCAAATTTGTTGGGTTGTGGGATATAAGGATGGTGGCCGTAGGGAATGTCTGTACAACAGATGCATGCACAAGAGAAGAGCACATGCTAACTGATTT
GGTGTCCGTCAGGTCAAGACTTTATGGAATGCTTGTTGAAGTCCATGAATACATCGAACGAAGTATCGAAATGGATTACACTTTCACGTACTTGCAGGCTAGGTCGGATG
AGCTCGCCGCCCGGCTAGACGATCTCCAAGAATGA
Protein sequenceShow/hide protein sequence
MRSCPFRSTISLQVCSSAACVARSSFSSTDEGLLGLYDNMMLAKNVSSKWFLCCFSGIGEVAAALERGGGCEWTDASVQEMTTAIDDDRNNTYVGGQLKGLLVPQGISFN
QLLDRLYRLINVRQSECDLVLRVPYELVYSSPPLYLTNDDDLRTDQRTVADSPDHHHEPTESICQPTVDLSLDDETANHATCTRVRLNVERRSLHQQTMGGLDVHVGRLF
ISKNDIRLVLATMAICNNKEFKVKRSTRSLFAVRCVDRPLGPLASSKWASSWVIANLIKDSQIGTGQTYQVKHIIDDVHTKFGVHISYDKAYRAKELAYPILRGWPEDSY
THMDAYVRDVFPNTFHGMCTYHLSRNLRSRFKNEAATKLFSNPEYAFWESQFKYLEEMGFERRARCYQVKRRYVNMTTNSVESVNSILREARGLLVTLVLKHIRDMLQRW
FHDRRNYWSTRSTTHSDYVEQYLTSQFEKSRTYKVKSIDVCKFDVEDRDLNGIVDINTRKCTCKDFDALQPIMSIGHMLEWRQPDDFEPRTIHPSSKIKRVSRRHTNRIP
SVDTCKFVGLWDIRMVAVGNVCTTDACTREEHMLTDLVSVRSRLYGMLVEVHEYIERSIEMDYTFTYLQARSDELAARLDDLQE