; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g06170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g06170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:4376645..4382555
RNA-Seq ExpressionMoc02g06170
SyntenyMoc02g06170
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3477524.1 reverse transcriptase [Gossypium australe]7.3e-1324.43Show/hide
Query:  IYHKCSQRFFVVIEIGSVGMGSEILVKEWGKLSLKSEEEETSVDVDGAAALDIVKRLESMAVWNRVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSF
        +Y      +F   E+  V M +E+++ ++G L  +S                            R+L   PW FD  L  M   V    + ++ F    F
Subjt:  IYHKCSQRFFVVIEIGSVGMGSEILVKEWGKLSLKSEEEETSVDVDGAAALDIVKRLESMAVWNRVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSF

Query:  WVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDPSVKGVVISSEQEG---LQGGVDLSIDGIRNSVASV
        W+   ++PL  M+  IA  L  AI  T  + + V         + QYG+WLR                  VI+   +G    + GV++ +D    +    
Subjt:  WVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDPSVKGVVISSEQEG---LQGGVDLSIDGIRNSVASV

Query:  G--LDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQP----------CRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLN
        G  +D R+ E      K  + D   D ++T+  ++       P          CR    P   ++R     L   +P ++FL ETK    ++  ++ R  
Subjt:  G--LDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQP----------CRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLN

Query:  LHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI-IWKSYSWRFTG
        + GC  VD+ G SG L LM    V V++ ++S YHID+ + +  S + RFTG
Subjt:  LHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI-IWKSYSWRFTG

MBA0701569.1 hypothetical protein [Gossypium aridum]3.1e-1122.32Show/hide
Query:  TSVDVDGAAALD--IVKRLESMAVWNRVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIA---------------------
        T  +VD  A  D  I+ +       +R+L   PW F++ L +M   V   ++ +++F  V FW+   ++P+  M    A                     
Subjt:  TSVDVDGAAALD--IVKRLESMAVWNRVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIA---------------------

Query:  -----------------KRLV--------NAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQ---DNQLRGKGRDPSVKGVVISSEQEGLQGGVDLSIDG
                         +R+V          IGH+   C + V+E   +  + Q+G+W+R      NQ +G  R+    GV + +         D S+  
Subjt:  -----------------KRLV--------NAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQ---DNQLRGKGRDPSVKGVVISSEQEGLQGGVDLSIDG

Query:  IRNSVASVGLDHR-----DVEASTGAIKSGNIDPGGDGVNTNDTMRSAEAS-------DQPCRKQRKPFVG--------MIRVKCDFLSHKNPQVLFLSE
          N      +D +     +   ST  +++  +    +G   + + R  + S       + P R+ ++   G         +R    FL    P +LFL E
Subjt:  IRNSVASVGLDHR-----DVEASTGAIKSGNIDPGGDGVNTNDTMRSAEAS-------DQPCRKQRKPFVG--------MIRVKCDFLSHKNPQVLFLSE

Query:  TKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDA
        TK       R++    + GC  V S G SGGL +M    V+V+I ++S +HID+
Subjt:  TKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDA

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.0e-1131.9Show/hide
Query:  MGSEILVKEWGKLSLKSEEEETSVDVDGAA----------------------ALDIVKRLESMAVW--------------------------NRVLRGGP
        M  E L+ +W K  L SEE+E ++DVD  A                      + D++ R+  +A W                          NRV++ GP
Subjt:  MGSEILVKEWGKLSLKSEEEETSVDVDGAA----------------------ALDIVKRLESMAVW--------------------------NRVLRGGP

Query:  WFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGH-TRLECDE
        WFFDKALIV++K  S+  +S  +F++V+FW+H  DLP+  ++ ++A RL NAIG+   ++C+E
Subjt:  WFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGH-TRLECDE

XP_042939567.1 uncharacterized protein LOC122274609 [Carya illinoinensis]2.1e-1224.11Show/hide
Query:  RVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTR-LECDEVVI------------ERHGQDPSS----QY
        +V+ G PW FD  L V++      +     F KVSFW+H  +LPL CM+  + K++ +++G  R ++  E  +            E+ G+  S     QY
Subjt:  RVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTR-LECDEVVI------------ERHGQDPSS----QY

Query:  GSWLRYQDNQLR---GKGRDPSV--------------KGVVIS---SEQEGLQGGVDLSIDGIRNSVASVGLDH-------RDVEASTGAIKSG------
        G W+R      R   GK +  S               K V +S    E+E    G ++   G+  SV   G ++       ++VE   G           
Subjt:  GSWLRYQDNQLR---GKGRDPSV--------------KGVVIS---SEQEGLQGGVDLSIDGIRNSVASVGLDH-------RDVEASTGAIKSG------

Query:  ----NIDPGGDG------------VNTNDTMRSAEASDQPCRKQRKPFVG-------------------------MIRVKCDFLSHKNPQVLFLSETKSD
            N+   G+               + + M   + +    R + +P  G                          ++  C         ++FL ET S 
Subjt:  ----NIDPGGDG------------VNTNDTMRSAEASDQPCRKQRKPFVG-------------------------MIRVKCDFLSHKNPQVLFLSETKSD

Query:  GIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI--IWKSYSWRFTG
           +ERLK+RL + GCF V+S G  GGL L   ++  V I S+S++HI+A +    +   W FTG
Subjt:  GIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI--IWKSYSWRFTG

XP_042974662.1 uncharacterized protein LOC122306298 [Carya illinoinensis]3.3e-1335.71Show/hide
Query:  IRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI--IWKSYSWRFTGLRL--------
        +R   D +  ++P+VLFL ETK     MER++  +    CFTV S G SGGL L+  + VN+SI SFS +HIDA I  I     W+FT L          
Subjt:  IRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI--IWKSYSWRFTGLRL--------

Query:  -VINLLGT--------------IEGRRMHKFKNRPFRFEEMWTWYEKCGSLIER
           +LL T              I     ++++ + FRFE MW   ++CG +I R
Subjt:  -VINLLGT--------------IEGRRMHKFKNRPFRFEEMWTWYEKCGSLIER

TrEMBL top hitse value%identityAlignment
A0A2N9FGD9 Reverse transcriptase domain-containing protein1.0e-1227.68Show/hide
Query:  RVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRD
        RVL  G W FDK LI+++++      S   F + SFWV   DL + CM++ + +++ N +G  R+E  E   E  G        S +      + G+  D
Subjt:  RVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRD

Query:  PSVKGVVISSEQEGLQG-GVDLSIDGIRNSV-----ASVGLDHRDVEASTGAIKSGN------IDPGGD--GVNTNDTMRSAEA-----------SDQPC
           + +   S     +G G  L     R +        +GL+H ++  S   +  G+      I   G+    N+N  + + EA           S  P 
Subjt:  PSVKGVVISSEQEGLQG-GVDLSIDGIRNSV-----ASVGLDHRDVEASTGAIKSGN------IDPGGD--GVNTNDTMRSAEA-----------SDQPC

Query:  RKQRKPFV---------GMIRVKCDF------------------LSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCL--MRNSTVN
        R  +K  V          +  + C F                  +  K+P VLFLSETK D   +E L+   +  G F V SRG SGGL L  +R +TV+
Subjt:  RKQRKPFV---------GMIRVKCDF------------------LSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCL--MRNSTVN

Query:  VSIWSFSSYHIDAGIIW-KSYSWRFTGLRLVINLLG
        VS  S+S +HIDA + + +  +W FTG  + + + G
Subjt:  VSIWSFSSYHIDAGIIW-KSYSWRFTGLRLVINLLG

A0A2N9FHA2 Uncharacterized protein4.6e-1329.39Show/hide
Query:  VLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDP
        V+   PW FDK LI+ +++   +  S   F+  +FWV   DLP   M+ ++ +++   +G  ++E  E V+E           +W R+   +    G  P
Subjt:  VLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDP

Query:  SVKGVVISSEQE-GLQGGVDLSIDGIRNSVASVGLDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQ------PCRKQRKPFVGMIRVKCDFLS
        S+K    +   E G+ G     + G++     +G   RD+        SG +       + N T   A   D        CR    P    +      + 
Subjt:  SVKGVVISSEQE-GLQGGVDLSIDGIRNSVASVGLDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQ------PCRKQRKPFVGMIRVKCDFLS

Query:  HKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGIIWKS-YSWRFTG
         K+P VLFLS TK D   +E L+      G F V SRG SGGL +  +  + VSI S+S YHIDA + ++S  +WRFTG
Subjt:  HKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGIIWKS-YSWRFTG

A0A2N9G4F4 Uncharacterized protein3.0e-1223.44Show/hide
Query:  SEILVKEWGKLSLKSEEEETSVDVD--------------GAAALDIVKRLESMA---VWNR--VLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWV
        S  L++ W K S+ +EEE   V+                 A   D+   L  +    V +R  V+  GPW FDK L++M        +   + ++ SFW+
Subjt:  SEILVKEWGKLSLKSEEEETSVDVD--------------GAAALDIVKRLESMA---VWNR--VLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWV

Query:  HFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQL-RGKGRDPSVKGVVISSEQEGLQGGVDLSIDGIRNSVASVGLDH
           +LPL  M+  +A  +   +G        V+ E    +    +G +LR +  ++ R  G +    G  + +           S+   + +  ++    
Subjt:  HFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQL-RGKGRDPSVKGVVISSEQEGLQGGVDLSIDGIRNSVASVGLDH

Query:  RDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQPCRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGL
         D         + +  P     +   T     A +       K  V  +    D +   NP +LFL E +   IA+E ++  L  +  FTV S G  GGL
Subjt:  RDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQPCRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGL

Query:  CLMRNSTVNVSIWSFSSYHIDAGIIWKSYS--WRFTG
         +    T+++ I +FS  HIDA I   +    WR TG
Subjt:  CLMRNSTVNVSIWSFSSYHIDAGIIWKSYS--WRFTG

A0A2N9I611 Uncharacterized protein3.8e-1528.67Show/hide
Query:  RVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQL------
        RVL G PW FD+ ++++ +  S      F F     WV   DLPL  M+  + + + NA+G          IE    +    +G +LRY+   L      
Subjt:  RVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQL------

Query:  -RGKGRDPSVKGVVISSEQEG----LQGGVDLSID-GIRNSVASVGLDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQPCRKQRKPFVG----
          G G    V     + +++G    LQ G  L  D  I+ S  S G +    E     + +   +P          +R     ++   ++R    G    
Subjt:  -RGKGRDPSVKGVVISSEQEG----LQGGVDLSID-GIRNSVASVGLDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQPCRKQRKPFVG----

Query:  -----MIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGIIWKSYSWRFTG
              +R   D    K P+VLFLSETK +   ME ++  L     F V S+G SGGL L+ +  V++SI S++ +HIDA I  +   WRFTG
Subjt:  -----MIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGIIWKSYSWRFTG

A0A5B6W8D4 Reverse transcriptase3.5e-1324.43Show/hide
Query:  IYHKCSQRFFVVIEIGSVGMGSEILVKEWGKLSLKSEEEETSVDVDGAAALDIVKRLESMAVWNRVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSF
        +Y      +F   E+  V M +E+++ ++G L  +S                            R+L   PW FD  L  M   V    + ++ F    F
Subjt:  IYHKCSQRFFVVIEIGSVGMGSEILVKEWGKLSLKSEEEETSVDVDGAAALDIVKRLESMAVWNRVLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSF

Query:  WVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDPSVKGVVISSEQEG---LQGGVDLSIDGIRNSVASV
        W+   ++PL  M+  IA  L  AI  T  + + V         + QYG+WLR                  VI+   +G    + GV++ +D    +    
Subjt:  WVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDPSVKGVVISSEQEG---LQGGVDLSIDGIRNSVASV

Query:  G--LDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQP----------CRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLN
        G  +D R+ E      K  + D   D ++T+  ++       P          CR    P   ++R     L   +P ++FL ETK    ++  ++ R  
Subjt:  G--LDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQP----------CRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGIAMERLKQRLN

Query:  LHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI-IWKSYSWRFTG
        + GC  VD+ G SG L LM    V V++ ++S YHID+ + +  S + RFTG
Subjt:  LHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGI-IWKSYSWRFTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAATGGCTCTCTATCGAGAAATCCGAGAATCGCCCATACCCAGATGCGATTTCAGATCAATAAACCACAGGGAACAAGAATGTCCGCAGAAATGGCCGATG
CACCCAGATCCCTATATCTACCACAAATGTAGCCAGCGATTCTTCGTGGTGATTGAAATTGGAAGTGTAGGGATGGGTTCTGAAATATTGGTTAAGGAATGGGGC
AAGTTGTCGTTGAAGTCAGAGGAGGAAGAGACGTCCGTAGACGTGGATGGGGCAGCGGCACTGGACATAGTAAAAAGATTGGAGTCTATGGCGGTTTGGAATAGG
GTATTGAGAGGAGGGCCGTGGTTCTTCGACAAGGCGTTGATAGTAATGGAGAAACTTGTTTCAACCCTAAAATTGTCTGCCTTTAAATTCAGCAAGGTATCATTC
TGGGTGCACTTTTGTGATTTGCCTTTGGTGTGTATGAGTGCTTCAATTGCAAAACGCCTTGTCAACGCTATAGGACATACTAGATTGGAGTGCGATGAAGTGGTG
ATTGAGAGGCACGGGCAAGATCCAAGTTCCCAATATGGGTCTTGGCTCCGCTACCAGGATAATCAGTTAAGGGGGAAGGGGCGAGACCCCTCAGTGAAGGGGGTG
GTGATTTCTTCGGAACAAGAAGGACTGCAGGGTGGAGTTGATCTTTCAATAGATGGCATAAGGAACTCTGTGGCTAGTGTGGGTTTAGATCATCGGGATGTGGAA
GCTTCGACTGGGGCCATAAAATCTGGCAACATAGACCCAGGAGGTGATGGGGTAAACACAAATGACACGATGAGATCGGCAGAGGCTAGTGACCAGCCCTGCCGG
AAGCAACGAAAACCCTTTGTTGGAATGATCCGGGTTAAGTGTGACTTCTTGAGTCACAAAAATCCCCAAGTGCTTTTCTTGTCGGAAACTAAAAGTGATGGTATA
GCAATGGAGAGACTTAAGCAAAGATTGAATCTACATGGCTGCTTCACAGTTGATAGTAGGGGGTCTAGTGGAGGACTTTGTTTGATGAGGAATTCAACTGTAAAT
GTCTCTATCTGGTCGTTCTCATCATATCATATTGATGCCGGTATTATATGGAAATCATATAGCTGGCGATTTACTGGGTTGCGGTTGGTGATCAATTTACTTGGA
ACGATAGAAGGAAGGAGGATGCACAAATTCAAGAACAGACCATTTCGATTTGAGGAGATGTGGACATGGTACGAGAAGTGTGGATCTCTGATAGAGAGAGGAGAG
AGGTGGCTGGATGGTGAAGCAGAGGGATTAAGTGTTGGGTGTGTTGCCCTAAACTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAATGGCTCTCTATCGAGAAATCCGAGAATCGCCCATACCCAGATGCGATTTCAGATCAATAAACCACAGGGAACAAGAATGTCCGCAGAAATGGCCGATG
CACCCAGATCCCTATATCTACCACAAATGTAGCCAGCGATTCTTCGTGGTGATTGAAATTGGAAGTGTAGGGATGGGTTCTGAAATATTGGTTAAGGAATGGGGC
AAGTTGTCGTTGAAGTCAGAGGAGGAAGAGACGTCCGTAGACGTGGATGGGGCAGCGGCACTGGACATAGTAAAAAGATTGGAGTCTATGGCGGTTTGGAATAGG
GTATTGAGAGGAGGGCCGTGGTTCTTCGACAAGGCGTTGATAGTAATGGAGAAACTTGTTTCAACCCTAAAATTGTCTGCCTTTAAATTCAGCAAGGTATCATTC
TGGGTGCACTTTTGTGATTTGCCTTTGGTGTGTATGAGTGCTTCAATTGCAAAACGCCTTGTCAACGCTATAGGACATACTAGATTGGAGTGCGATGAAGTGGTG
ATTGAGAGGCACGGGCAAGATCCAAGTTCCCAATATGGGTCTTGGCTCCGCTACCAGGATAATCAGTTAAGGGGGAAGGGGCGAGACCCCTCAGTGAAGGGGGTG
GTGATTTCTTCGGAACAAGAAGGACTGCAGGGTGGAGTTGATCTTTCAATAGATGGCATAAGGAACTCTGTGGCTAGTGTGGGTTTAGATCATCGGGATGTGGAA
GCTTCGACTGGGGCCATAAAATCTGGCAACATAGACCCAGGAGGTGATGGGGTAAACACAAATGACACGATGAGATCGGCAGAGGCTAGTGACCAGCCCTGCCGG
AAGCAACGAAAACCCTTTGTTGGAATGATCCGGGTTAAGTGTGACTTCTTGAGTCACAAAAATCCCCAAGTGCTTTTCTTGTCGGAAACTAAAAGTGATGGTATA
GCAATGGAGAGACTTAAGCAAAGATTGAATCTACATGGCTGCTTCACAGTTGATAGTAGGGGGTCTAGTGGAGGACTTTGTTTGATGAGGAATTCAACTGTAAAT
GTCTCTATCTGGTCGTTCTCATCATATCATATTGATGCCGGTATTATATGGAAATCATATAGCTGGCGATTTACTGGGTTGCGGTTGGTGATCAATTTACTTGGA
ACGATAGAAGGAAGGAGGATGCACAAATTCAAGAACAGACCATTTCGATTTGAGGAGATGTGGACATGGTACGAGAAGTGTGGATCTCTGATAGAGAGAGGAGAG
AGGTGGCTGGATGGTGAAGCAGAGGGATTAAGTGTTGGGTGTGTTGCCCTAAACTCGTAG
Protein sequenceShow/hide protein sequence
MLMALYREIRESPIPRCDFRSINHREQECPQKWPMHPDPYIYHKCSQRFFVVIEIGSVGMGSEILVKEWGKLSLKSEEEETSVDVDGAAALDIVKRLESMAVWNR
VLRGGPWFFDKALIVMEKLVSTLKLSAFKFSKVSFWVHFCDLPLVCMSASIAKRLVNAIGHTRLECDEVVIERHGQDPSSQYGSWLRYQDNQLRGKGRDPSVKGV
VISSEQEGLQGGVDLSIDGIRNSVASVGLDHRDVEASTGAIKSGNIDPGGDGVNTNDTMRSAEASDQPCRKQRKPFVGMIRVKCDFLSHKNPQVLFLSETKSDGI
AMERLKQRLNLHGCFTVDSRGSSGGLCLMRNSTVNVSIWSFSSYHIDAGIIWKSYSWRFTGLRLVINLLGTIEGRRMHKFKNRPFRFEEMWTWYEKCGSLIERGE
RWLDGEAEGLSVGCVALNS