; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:13060510..13065157
RNA-Seq ExpressionMoc08g17110
SyntenyMoc08g17110
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7827992.1 LINE-type retrotransposon LIb DNA [Senna tora]5.4e-3120.41Show/hide
Query:  IWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETEMNEISDFISK---------
        +W   GE +L+ L ND+F+A  +   D+D  L  GPW ILD+YL +R W+  F P    I + A WV +PD P+ELY+   ++ I +FI K         
Subjt:  IWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETEMNEISDFISK---------

Query:  --------------------------------------------CWVF-HKLEECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQ
                                                    C V+ H LE C +R   K K         +    ++   +E               
Subjt:  --------------------------------------------CWVF-HKLEECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQ

Query:  VSQPKPSPGHGPWLLVDHSKRKGGGKPPRTRSERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDRVKSLQNKFPFDYQGPVETQDSIGRRKPC
               P +G W+ V  ++R     P R RS                NQ P +E+  V         A    K  Q +   D    +  Q+  G+    
Subjt:  VSQPKPSPGHGPWLLVDHSKRKGGGKPPRTRSERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDRVKSLQNKFPFDYQGPVETQDSIGRRKPC

Query:  WMDKDYNPRDMDSNQEDDLDPE-------SPIPCKGF--------------------------FDSVGASTSETLSRKSDYHQAQIMNLGDVNVVMHQRD
           +++    M++ Q++D   E       SP   KG                             + G +T +  +++S   + +  + G    V  QR 
Subjt:  WMDKDYNPRDMDSNQEDDLDPE-------SPIPCKGF--------------------------FDSVGASTSETLSRKSDYHQAQIMNLGDVNVVMHQRD

Query:  EDAPQAMER--------AGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRI
        + + +  E+        A G +F+   +++ + + TD++ + EP+ SG  A          +   I+A  + GGIW LW  + ++   + +++Q     I
Subjt:  EDAPQAMER--------AGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRI

Query:  TKGNSSS-VFTAIYRSPQRASRRDLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSR
             S     A+Y +P   +R  +W  +++IA+  S P ++ G FNEI                                    P +TW GP  QG  +
Subjt:  TKGNSSS-VFTAIYRSPQRASRRDLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSR

Query:  IFERLDR------------DTAGFCAS--NRNKKPFRIRIETLKDNNDN------------WVTEDSAFKELVVEHFQNMYGVSDMGVKL-ITDSTFPIL
        +++RLDR            D +  C +  + +  P  +  E    +  N            W  E    + +++E ++N++   D G    ++   +P +
Subjt:  IFERLDR------------DTAGFCAS--NRNKKPFRIRIETLKDNNDN------------WVTEDSAFKELVVEHFQNMYGVSDMGVKL-ITDSTFPIL

Query:  PTKSIGSLIKEVSLEEVRSAMAHDPCPTCASSSSNSLHAHFSVPSFVGAWNEISYWMRWGIGNGCLVKFWSDRWINRKNLIEIMGDIGILELEKYRPVRD
         T +  S+ K    EE++ A+ +                    P   G +  + Y   W +          D+WI   + +  M    + +      + D
Subjt:  PTKSIGSLIKEVSLEEVRSAMAHDPCPTCASSSSNSLHAHFSVPSFVGAWNEISYWMRWGIGNGCLVKFWSDRWINRKNLIEIMGDIGILELEKYRPVRD

Query:  YVLETGEW-NWEAFTGKVNQSVLLAIASIRPPD
        YV E+G+W N+E F G + +   + I S+ PP+
Subjt:  YVLETGEW-NWEAFTGKVNQSVLLAIASIRPPD

MCH80356.1 hypothetical protein [Trifolium medium]1.0e-2922.6Show/hide
Query:  IWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETE-MNEISDFISKCWVFHKLE
        +W  KG   +I L  D+F+ +L  +ED  + + DGPW I D+YL+++ W+P F P+  TID   +WV IPD P+E Y+  E   E++   S C+V H + 
Subjt:  IWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETE-MNEISDFISKCWVFHKLE

Query:  ECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKR-----KGGGKPPRTRSERGIFS---SKNLNSK
                +     N                    +   ++   +MQ S      G GPW++V  ++R     +GG +     +  G      ++N  ++
Subjt:  ECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKR-----KGGGKPPRTRSERGIFS---SKNLNSK

Query:  WNANQDPIEELTDVLDTIVDPSE---------ALDRVKSLQNKFPFDYQGPVETQ------DSIGRRKPCWMDKDYNPRDMDSNQEDDLDPESPIPCKGF
        +       +E  + +D+ +  +E          + R K   NK        +ET+       + G  +P  + +  N       + D++  E PI     
Subjt:  WNANQDPIEELTDVLDTIVDPSE---------ALDRVKSLQNKFPFDYQGPVETQ------DSIGRRKPCWMDKDYNPRDMDSNQEDDLDPESPIPCKGF

Query:  FDSVGAST-------------------------SETLSRKSDYHQAQIMNL----------------------GDVNVVMHQRDEDAPQAMERAGGPRFK
         +++   T                          +T +R  DY      NL                      G++ VV   +     +    A    F 
Subjt:  FDSVGAST-------------------------SETLSRKSDYHQAQIMNL----------------------GDVNVVMHQRDEDAPQAMERAGGPRFK

Query:  YVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSL-VELNRYNQAFHFRITKGNSSS-VFTAIYRSPQRASRR
           +  +     D+LVI+E ++      +           +     F GGI + W+H  V++ +E+N + Q  H  I  G  ++  FT++Y SP+   RR
Subjt:  YVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSL-VELNRYNQAFHFRITKGNSSS-VFTAIYRSPQRASRR

Query:  DLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR
        +LW  L  ++  I   W++ G FN+IL                                 +  KFTWRGP+  G  RIFERLDR
Subjt:  DLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR

XP_022137804.1 uncharacterized protein LOC111009151 [Momordica charantia]2.3e-2942.69Show/hide
Query:  LVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQRASRRDLWKFLDSIASEISEP
        +VI+EPKISG +A+ VC SF  FS+  +EA   KGGIWV W+ + VSL+E+    QA HFR  +   S  FT +Y SPQR+S+R+LW FL S+      P
Subjt:  LVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQRASRRDLWKFLDSIASEISEP

Query:  WLLLGYFNEILVR---------------------------------PKFTWRGPLLQGYSRIFERLDRDTA
        WLL+G FN I                                    PKFTW+GPL+ G+ R+FERLDR  A
Subjt:  WLLLGYFNEILVR---------------------------------PKFTWRGPLLQGYSRIFERLDRDTA

XP_022153253.1 uncharacterized protein LOC111020790 [Momordica charantia]1.7e-6457.45Show/hide
Query:  ETEMNEISDFISKCWVF-HKLEECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKRKGGGKPPRTR
        E+ M  I      C V+ HKLEECPLRC+T+TK NDNS G  D +G DSR PKESNI EFPQK  I + VSQ  PSPGHGPW+LVDHSKR GGGKPPRTR
Subjt:  ETEMNEISDFISKCWVF-HKLEECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKRKGGGKPPRTR

Query:  SERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDRVKSLQNK-------------------------------FPFDYQGPVETQDSIGRRKPC
        S RG  SSKNLN+KWN N DP EE  +VLDTIVDP+   D +KS Q K                                PFDYQG +  QD+ GRRKP 
Subjt:  SERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDRVKSLQNK-------------------------------FPFDYQGPVETQDSIGRRKPC

Query:  WMDKDYNPRDMDSNQEDDLDPESPIPCKGFFDSVG
        WMD+DY+P DMDS QEDD DP+SPIP  G+FDS G
Subjt:  WMDKDYNPRDMDSNQEDDLDPESPIPCKGFFDSVG

XP_031402735.1 uncharacterized protein LOC116212324 [Punica granatum]4.1e-3134.75Show/hide
Query:  AGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQ
        AG  RF  V+++MI+ +  +++VI+EP+ISG  A+ VC  F+ +S   +EA  F GGIWV WQ N V +   +R+ QA H RI++ + +  FTA+Y SP 
Subjt:  AGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQ

Query:  RASRRDLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR-----------
          +RRDLW  L +I++ I+ PW++LG FN IL                                   P+FTW GP+  GY R+FERLDR           
Subjt:  RASRRDLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR-----------

Query:  --------------------DTAGFCASNRNKKPFR
                            DT GF  S++ ++PFR
Subjt:  --------------------DTAGFCASNRNKKPFR

TrEMBL top hitse value%identityAlignment
A0A392LZC2 Uncharacterized protein (Fragment)4.9e-3022.6Show/hide
Query:  IWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETE-MNEISDFISKCWVFHKLE
        +W  KG   +I L  D+F+ +L  +ED  + + DGPW I D+YL+++ W+P F P+  TID   +WV IPD P+E Y+  E   E++   S C+V H + 
Subjt:  IWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETE-MNEISDFISKCWVFHKLE

Query:  ECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKR-----KGGGKPPRTRSERGIFS---SKNLNSK
                +     N                    +   ++   +MQ S      G GPW++V  ++R     +GG +     +  G      ++N  ++
Subjt:  ECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKR-----KGGGKPPRTRSERGIFS---SKNLNSK

Query:  WNANQDPIEELTDVLDTIVDPSE---------ALDRVKSLQNKFPFDYQGPVETQ------DSIGRRKPCWMDKDYNPRDMDSNQEDDLDPESPIPCKGF
        +       +E  + +D+ +  +E          + R K   NK        +ET+       + G  +P  + +  N       + D++  E PI     
Subjt:  WNANQDPIEELTDVLDTIVDPSE---------ALDRVKSLQNKFPFDYQGPVETQ------DSIGRRKPCWMDKDYNPRDMDSNQEDDLDPESPIPCKGF

Query:  FDSVGAST-------------------------SETLSRKSDYHQAQIMNL----------------------GDVNVVMHQRDEDAPQAMERAGGPRFK
         +++   T                          +T +R  DY      NL                      G++ VV   +     +    A    F 
Subjt:  FDSVGAST-------------------------SETLSRKSDYHQAQIMNL----------------------GDVNVVMHQRDEDAPQAMERAGGPRFK

Query:  YVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSL-VELNRYNQAFHFRITKGNSSS-VFTAIYRSPQRASRR
           +  +     D+LVI+E ++      +           +     F GGI + W+H  V++ +E+N + Q  H  I  G  ++  FT++Y SP+   RR
Subjt:  YVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSL-VELNRYNQAFHFRITKGNSSS-VFTAIYRSPQRASRR

Query:  DLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR
        +LW  L  ++  I   W++ G FN+IL                                 +  KFTWRGP+  G  RIFERLDR
Subjt:  DLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR

A0A6J1C8B2 uncharacterized protein LOC1110091511.1e-2942.69Show/hide
Query:  LVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQRASRRDLWKFLDSIASEISEP
        +VI+EPKISG +A+ VC SF  FS+  +EA   KGGIWV W+ + VSL+E+    QA HFR  +   S  FT +Y SPQR+S+R+LW FL S+      P
Subjt:  LVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQRASRRDLWKFLDSIASEISEP

Query:  WLLLGYFNEILVR---------------------------------PKFTWRGPLLQGYSRIFERLDRDTA
        WLL+G FN I                                    PKFTW+GPL+ G+ R+FERLDR  A
Subjt:  WLLLGYFNEILVR---------------------------------PKFTWRGPLLQGYSRIFERLDRDTA

A0A6J1DGZ9 uncharacterized protein LOC1110207908.1e-6557.45Show/hide
Query:  ETEMNEISDFISKCWVF-HKLEECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKRKGGGKPPRTR
        E+ M  I      C V+ HKLEECPLRC+T+TK NDNS G  D +G DSR PKESNI EFPQK  I + VSQ  PSPGHGPW+LVDHSKR GGGKPPRTR
Subjt:  ETEMNEISDFISKCWVF-HKLEECPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKRKGGGKPPRTR

Query:  SERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDRVKSLQNK-------------------------------FPFDYQGPVETQDSIGRRKPC
        S RG  SSKNLN+KWN N DP EE  +VLDTIVDP+   D +KS Q K                                PFDYQG +  QD+ GRRKP 
Subjt:  SERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDRVKSLQNK-------------------------------FPFDYQGPVETQDSIGRRKPC

Query:  WMDKDYNPRDMDSNQEDDLDPESPIPCKGFFDSVG
        WMD+DY+P DMDS QEDD DP+SPIP  G+FDS G
Subjt:  WMDKDYNPRDMDSNQEDDLDPESPIPCKGFFDSVG

A0A6P5REG9 uncharacterized protein LOC1107474352.1e-2823.09Show/hide
Query:  WKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETEMNEISDFISKC---------
        W  KG +KLI L ND+F+A  +L+ED + VL +GPW I   YL ++ W P F P+T  I R A+W+ +    +E ++   +  I + + K          
Subjt:  WKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETEMNEISDFISKC---------

Query:  --------------------------WVFHKLEE---------CPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHG
                                   V++ +E          C L  H +          P  +  D +  K  N +E P  ++  +   +   +   G
Subjt:  --------------------------WVFHKLEE---------CPLRCHTKTKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHG

Query:  PWLLV-DHSKRKGGGKPPRTRSER------------------------------------GIFSSKNLNSKWNANQDP-IEELTDVLDTIVDPSEALDRV
        PW++V    K K G K    +S                                      G+ +S +    W  ++ P     T + DT     ++  + 
Subjt:  PWLLV-DHSKRKGGGKPPRTRSER------------------------------------GIFSSKNLNSKWNANQDP-IEELTDVLDTIVDPSEALDRV

Query:  KSLQNKFPFDYQGPVETQDSIGRRKPCW-MDKDYNPRDMDS-------------NQEDDLDPESPIPC------KGFFDSVGASTSETLSRKS--DYHQA
        ++   +     +       +IG     W +  D     M S             +Q   +DP   I        KG  D + ++ +  L +    D  + 
Subjt:  KSLQNKFPFDYQGPVETQDSIGRRKPCW-MDKDYNPRDMDS-------------NQEDDLDPESPIPC------KGFFDSVGASTSETLSRKS--DYHQA

Query:  QIMNLGDVNVVMHQRDEDAPQAMERAGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRY
        QI+++G+V      R+      +E A   +FK  L ++I+ +  D+L I EP+I GN A ++  S     Y  +  + F GG+W+LW  + + +  L   
Subjt:  QIMNLGDVNVVMHQRDEDAPQAMERAGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRY

Query:  NQAFHFRIT-KGNSSSVFTAIYRSPQRASRRDLWKFLDSIASEISEPWLLLGYFNEILVRPKFTWRGPL--LQGYSRIFERLDRDTAGFCASN---RNKK
        +Q     ++  GN S +F+AIY SP  A R  LW +L+ +AS    PWL+ G FN++L         PL  L+G+ + F+  D    G+  +     NK+
Subjt:  NQAFHFRIT-KGNSSSVFTAIYRSPQRASRRDLWKFLDSIASEISEPWLLLGYFNEILVRPKFTWRGPL--LQGYSRIFERLDRDTAGFCASN---RNKK

Query:  PF
         F
Subjt:  PF

A0A6P8E5K3 uncharacterized protein LOC1162123242.0e-3134.75Show/hide
Query:  AGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQ
        AG  RF  V+++MI+ +  +++VI+EP+ISG  A+ VC  F+ +S   +EA  F GGIWV WQ N V +   +R+ QA H RI++ + +  FTA+Y SP 
Subjt:  AGGPRFKYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQ

Query:  RASRRDLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR-----------
          +RRDLW  L +I++ I+ PW++LG FN IL                                   P+FTW GP+  GY R+FERLDR           
Subjt:  RASRRDLWKFLDSIASEISEPWLLLGYFNEIL---------------------------------VRPKFTWRGPLLQGYSRIFERLDR-----------

Query:  --------------------DTAGFCASNRNKKPFR
                            DT GF  S++ ++PFR
Subjt:  --------------------DTAGFCASNRNKKPFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTTGGAAATCTAAAGGGGAATTCAAGTTGATCAGTCTCAGCAATGATTTCTTTATTGCGCACCTGGAATTGAAAGAGGATCGAGACAGAGTCCTTATGGACGG
CCCTTGGAAGATTTTAGACTACTATCTGGCGATTCGTGGCTGGTCTCCCAAATTCAGGCCATCAACAGTGACTATTGACAGGGCTGCTTTGTGGGTTTGCATCCCTGATT
GCCCGATGGAACTTTATAATGAGACAGAGATGAATGAAATCAGTGATTTCATAAGCAAGTGTTGGGTTTTTCACAAATTGGAGGAGTGTCCCTTGAGATGCCATACTAAG
ACGAAGGCCAATGATAATTCATTTGGAACACCAGACCCCATTGGAGGAGATTCGAGGGATCCAAAGGAAAGCAATATCCTGGAGTTTCCACAAAAGTCTTTAATAGCAAT
GCAAGTTTCTCAGCCAAAACCTTCACCCGGACATGGACCTTGGTTGCTAGTGGATCATTCCAAGAGAAAGGGAGGAGGAAAACCACCTCGTACAAGGTCAGAGAGAGGTA
TATTCTCATCCAAAAACTTGAATTCTAAGTGGAATGCAAATCAGGACCCAATTGAAGAGCTCACGGATGTCCTTGACACAATTGTTGATCCATCTGAGGCGCTTGATCGT
GTAAAATCTCTTCAAAATAAATTTCCTTTTGATTATCAGGGGCCTGTTGAAACTCAAGATTCAATTGGAAGACGAAAACCTTGTTGGATGGACAAAGATTATAACCCTAG
GGACATGGACTCAAACCAAGAGGACGATCTTGATCCAGAATCACCAATCCCTTGCAAGGGGTTTTTCGATTCTGTTGGGGCTTCAACAAGTGAGACGTTGAGTAGGAAGA
GCGATTACCATCAGGCCCAGATAATGAATCTGGGGGATGTCAATGTTGTTATGCATCAGAGAGATGAAGATGCACCTCAGGCAATGGAGAGGGCTGGTGGCCCCCGTTTT
AAATATGTTCTTAGGGATATGATCCAAGCTTACAATACGGATGTTCTAGTAATCTTAGAACCCAAAATTAGTGGTAACGTGGCTAATAGAGTTTGTGGTAGTTTTGCTAG
ATTTTCTTATACTCACATTGAAGCTTTAAGATTTAAGGGTGGCATATGGGTGTTGTGGCAACATAATAGTGTTTCACTTGTTGAGTTAAATAGATACAATCAAGCCTTTC
ATTTTAGAATCACTAAAGGTAACTCCTCGAGTGTGTTTACTGCTATATATAGAAGCCCACAGAGGGCTTCAAGAAGAGACTTATGGAAGTTTCTTGATTCAATTGCTTCT
GAAATTTCTGAGCCTTGGCTGCTTTTAGGATACTTTAATGAGATTTTAGTAAGGCCTAAGTTCACCTGGAGAGGGCCTTTGTTACAAGGTTATTCTAGAATCTTTGAAAG
GCTTGATAGAGATACTGCTGGATTTTGTGCTAGTAATCGTAATAAAAAACCTTTTAGGATTCGCATAGAGACACTAAAAGATAACAATGACAATTGGGTTACAGAGGATT
CAGCCTTTAAGGAGCTGGTAGTTGAGCATTTTCAAAATATGTATGGAGTCTCAGATATGGGTGTTAAACTGATTACGGATTCTACCTTCCCTATCCTCCCAACAAAATCT
ATTGGGAGTCTGATTAAGGAGGTTTCTTTGGAAGAAGTTCGTTCAGCAATGGCACATGACCCTTGTCCAACCTGTGCTTCAAGCTCTTCCAACTCACTCCATGCACATTT
TTCGGTTCCCAGTTTTGTTGGAGCATGGAATGAAATTAGCTATTGGATGAGGTGGGGAATTGGTAATGGGTGTCTGGTCAAATTCTGGAGCGATCGTTGGATAAATAGGA
AAAATCTTATAGAAATAATGGGGGACATTGGCATTCTGGAGTTAGAGAAGTATCGTCCAGTTAGGGATTATGTATTAGAAACAGGAGAGTGGAATTGGGAAGCGTTCACT
GGAAAAGTTAACCAATCAGTGCTGCTGGCCATTGCAAGCATCAGACCACCAGATGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTTGGAAATCTAAAGGGGAATTCAAGTTGATCAGTCTCAGCAATGATTTCTTTATTGCGCACCTGGAATTGAAAGAGGATCGAGACAGAGTCCTTATGGACGG
CCCTTGGAAGATTTTAGACTACTATCTGGCGATTCGTGGCTGGTCTCCCAAATTCAGGCCATCAACAGTGACTATTGACAGGGCTGCTTTGTGGGTTTGCATCCCTGATT
GCCCGATGGAACTTTATAATGAGACAGAGATGAATGAAATCAGTGATTTCATAAGCAAGTGTTGGGTTTTTCACAAATTGGAGGAGTGTCCCTTGAGATGCCATACTAAG
ACGAAGGCCAATGATAATTCATTTGGAACACCAGACCCCATTGGAGGAGATTCGAGGGATCCAAAGGAAAGCAATATCCTGGAGTTTCCACAAAAGTCTTTAATAGCAAT
GCAAGTTTCTCAGCCAAAACCTTCACCCGGACATGGACCTTGGTTGCTAGTGGATCATTCCAAGAGAAAGGGAGGAGGAAAACCACCTCGTACAAGGTCAGAGAGAGGTA
TATTCTCATCCAAAAACTTGAATTCTAAGTGGAATGCAAATCAGGACCCAATTGAAGAGCTCACGGATGTCCTTGACACAATTGTTGATCCATCTGAGGCGCTTGATCGT
GTAAAATCTCTTCAAAATAAATTTCCTTTTGATTATCAGGGGCCTGTTGAAACTCAAGATTCAATTGGAAGACGAAAACCTTGTTGGATGGACAAAGATTATAACCCTAG
GGACATGGACTCAAACCAAGAGGACGATCTTGATCCAGAATCACCAATCCCTTGCAAGGGGTTTTTCGATTCTGTTGGGGCTTCAACAAGTGAGACGTTGAGTAGGAAGA
GCGATTACCATCAGGCCCAGATAATGAATCTGGGGGATGTCAATGTTGTTATGCATCAGAGAGATGAAGATGCACCTCAGGCAATGGAGAGGGCTGGTGGCCCCCGTTTT
AAATATGTTCTTAGGGATATGATCCAAGCTTACAATACGGATGTTCTAGTAATCTTAGAACCCAAAATTAGTGGTAACGTGGCTAATAGAGTTTGTGGTAGTTTTGCTAG
ATTTTCTTATACTCACATTGAAGCTTTAAGATTTAAGGGTGGCATATGGGTGTTGTGGCAACATAATAGTGTTTCACTTGTTGAGTTAAATAGATACAATCAAGCCTTTC
ATTTTAGAATCACTAAAGGTAACTCCTCGAGTGTGTTTACTGCTATATATAGAAGCCCACAGAGGGCTTCAAGAAGAGACTTATGGAAGTTTCTTGATTCAATTGCTTCT
GAAATTTCTGAGCCTTGGCTGCTTTTAGGATACTTTAATGAGATTTTAGTAAGGCCTAAGTTCACCTGGAGAGGGCCTTTGTTACAAGGTTATTCTAGAATCTTTGAAAG
GCTTGATAGAGATACTGCTGGATTTTGTGCTAGTAATCGTAATAAAAAACCTTTTAGGATTCGCATAGAGACACTAAAAGATAACAATGACAATTGGGTTACAGAGGATT
CAGCCTTTAAGGAGCTGGTAGTTGAGCATTTTCAAAATATGTATGGAGTCTCAGATATGGGTGTTAAACTGATTACGGATTCTACCTTCCCTATCCTCCCAACAAAATCT
ATTGGGAGTCTGATTAAGGAGGTTTCTTTGGAAGAAGTTCGTTCAGCAATGGCACATGACCCTTGTCCAACCTGTGCTTCAAGCTCTTCCAACTCACTCCATGCACATTT
TTCGGTTCCCAGTTTTGTTGGAGCATGGAATGAAATTAGCTATTGGATGAGGTGGGGAATTGGTAATGGGTGTCTGGTCAAATTCTGGAGCGATCGTTGGATAAATAGGA
AAAATCTTATAGAAATAATGGGGGACATTGGCATTCTGGAGTTAGAGAAGTATCGTCCAGTTAGGGATTATGTATTAGAAACAGGAGAGTGGAATTGGGAAGCGTTCACT
GGAAAAGTTAACCAATCAGTGCTGCTGGCCATTGCAAGCATCAGACCACCAGATGAGTGA
Protein sequenceShow/hide protein sequence
MAIWKSKGEFKLISLSNDFFIAHLELKEDRDRVLMDGPWKILDYYLAIRGWSPKFRPSTVTIDRAALWVCIPDCPMELYNETEMNEISDFISKCWVFHKLEECPLRCHTK
TKANDNSFGTPDPIGGDSRDPKESNILEFPQKSLIAMQVSQPKPSPGHGPWLLVDHSKRKGGGKPPRTRSERGIFSSKNLNSKWNANQDPIEELTDVLDTIVDPSEALDR
VKSLQNKFPFDYQGPVETQDSIGRRKPCWMDKDYNPRDMDSNQEDDLDPESPIPCKGFFDSVGASTSETLSRKSDYHQAQIMNLGDVNVVMHQRDEDAPQAMERAGGPRF
KYVLRDMIQAYNTDVLVILEPKISGNVANRVCGSFARFSYTHIEALRFKGGIWVLWQHNSVSLVELNRYNQAFHFRITKGNSSSVFTAIYRSPQRASRRDLWKFLDSIAS
EISEPWLLLGYFNEILVRPKFTWRGPLLQGYSRIFERLDRDTAGFCASNRNKKPFRIRIETLKDNNDNWVTEDSAFKELVVEHFQNMYGVSDMGVKLITDSTFPILPTKS
IGSLIKEVSLEEVRSAMAHDPCPTCASSSSNSLHAHFSVPSFVGAWNEISYWMRWGIGNGCLVKFWSDRWINRKNLIEIMGDIGILELEKYRPVRDYVLETGEWNWEAFT
GKVNQSVLLAIASIRPPDE