; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr7:1405432..1411500
RNA-Seq ExpressionMoc07g01830
SyntenyMoc07g01830
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4391976.1 hypothetical protein F8388_004305 [Cannabis sativa]2.2e-3740.1Show/hide
Query:  KGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTVKSGYRL
        +G RWRVGNG  + +  +KW+PRP       PF  + +  VSSL+    EW+ + + N FH+EDV  IL IP+     +D L+W ++KDG Y VKSGYR+
Subjt:  KGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTVKSGYRL

Query:  ASD-ELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPFPYMNHPEKL
        A +  L     SN      WWK  W L +P +MK FGWK  +++LP  T L +RGM ++  C  CG   ET  HA+W+    K +WK  P   +    + 
Subjt:  ASD-ELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPFPYMNHPEKL

Query:  NYMSNLL
        N M +LL
Subjt:  NYMSNLL

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]5.0e-3438.83Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV
        G  LL  G RWRVG G  + +  DKW+P P +F    P  G+ + KVS L+L    W+ + + + F   +V +IL+IP+   R DDS++WHY KDG YTV
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV

Query:  KSGYRLASDELLLGE------QSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKET
        KSG  LAS+   + E       S+++   Q W  LW L++ +K+K F W+A K FLP    L  R +   + C +CG   ET  H +WS   +K +WK  
Subjt:  KSGYRLASDELLLGE------QSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKET

Query:  PFPYMN
         F ++N
Subjt:  PFPYMN

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]2.0e-3536.73Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV
        G Q++ KG+RWR+GNG ++ I  D W+PRP +F   FP        V+ LI  +++WD  ++  +F + D   IL IPLP  + +D +LWHY K G Y+V
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV

Query:  KSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPF
        KSGY+LA         S ++A  ++W  LW+L++P K+K F W+A  + LP+   L  R +  +  C +C    ET  HA+      + +W ++PF
Subjt:  KSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPF

XP_030483251.1 uncharacterized protein LOC115699848 [Cannabis sativa]2.9e-3439.59Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV
        G ++LS+G RWRVGNG  + +  DKW+PRP     T   R     K+ + I     W    V  +FH ED+  I  IP+     +D L W Y+ +G Y V
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV

Query:  KSGYRLASDELLLGEQ-SNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPF
        KSGYR+  +  L   + SN   + +WWK LWSL++P  MK FGW+   ++LPT T L +RGM++   C  CG   ET  HA+W+ +  K +WK  P+
Subjt:  KSGYRLASDELLLGEQ-SNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPF

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]2.9e-3437.56Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV
        G++++ KG RWR+GN + V +  D W+PRP +F             V  L  P+  WD+  V   F+  D   IL++P  +  ++D +LWHYSKDG Y+V
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV

Query:  KSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKC-GGMAETPCHAIWSRNHNKHLWKETPF
        +SGYR+A+   +   QS+++A  +WWK LW LK+P K+K F WK    ++PT   LA+R + V+  C +C  G  ET  H +W+   N+ +W    F
Subjt:  KSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKC-GGMAETPCHAIWSRNHNKHLWKETPF

TrEMBL top hitse value%identityAlignment
A0A803NGM9 Uncharacterized protein1.9e-3939.51Show/hide
Query:  KETRFFCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDS
        + T  F       G+++++ G RWRVGNG+ V +  D W+PRP SF          +  V+ L   +  WD N V + F+ ED + IL++P     ++D 
Subjt:  KETRFFCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDS

Query:  LLWHYSKDGIYTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGG-MAETPCHAIWSRNH
        ++WHYSK+G YTVKSGY++A+       QS+ Q    WWK LW LK+P K+K F WK   +++PT   LA RG+DVD  C +C G + ET  HA+W    
Subjt:  LLWHYSKDGIYTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGG-MAETPCHAIWSRNH

Query:  NKHLW
        +K +W
Subjt:  NKHLW

A0A803PHH5 Uncharacterized protein7.3e-3939.44Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV
        G +++ +G RWRVGNG  + +  DKW+PRP       P   + +  VSSL+    EW+ + + N FH+EDV  IL IP+     +D L+W ++KDG Y V
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV

Query:  KSGYRLASD-ELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPFPYM
        KSGYR+A +  L     SN      WWK  WSL +P +MK FGWK  +++LP  T L +RGM ++  C  C    ET  HA+W+    K +WK  P   +
Subjt:  KSGYRLASD-ELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPFPYM

Query:  NHPEKLNYMSNLL
            + N M +LL
Subjt:  NHPEKLNYMSNLL

A0A803PQQ6 Uncharacterized protein1.9e-3941.09Show/hide
Query:  FCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHY
        F       G++LL KG RWR+GNG+ V +  D W+PRP +F          +  V  L L +  WD   +  +F+EEDV  ILN+P     V+D ++WHY
Subjt:  FCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHY

Query:  SKDGIYTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKC-GGMAETPCHAIWSRNHNKHLW
        +K+G YTV+SGYRLA D   +  QSN +   QWW+ LW  KVP K+K F WK    +LPT   L+ RG+ V   C +C GG  E   H +W  + +K +W
Subjt:  SKDGIYTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKC-GGMAETPCHAIWSRNHNKHLW

Query:  KE
        K+
Subjt:  KE

A0A803PZ98 Uncharacterized protein8.0e-3836.44Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV
        G +LL KG RW VGNG ++ I  D+W+PR   FT     +  ++  ++SL+ P+  W  NEV + FH +D+  +L+I  P     D + W  S +G+Y+V
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGIYTV

Query:  KSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPFPYMN
         SGY+L      + E SN   +  WWK +W   +  K+K F W+ F  ++PT   LA RGM +DT C  C    E  CHA+W  +  +++WK   FP + 
Subjt:  KSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKETPFPYMN

Query:  HPEKLNYMSNLLWWFWSTLPNDTWF
         P  L   +++LWW    LPN+ +F
Subjt:  HPEKLNYMSNLLWWFWSTLPNDTWF

A0A803Q852 Uncharacterized protein1.9e-3938.53Show/hide
Query:  EEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKD
        +E   G +++ +G RWRVGNG  + +  DKW+PRP    T  P   + +  VSSL+    +W+ + +   FH+EDV  IL IP+     +D+L+W ++KD
Subjt:  EEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKD

Query:  GIYTVKSGYRLASD-ELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKET
        G Y VKSGYR+A +  L     SN   +  WWK  W+L +P +MK FGWK  +++LP  + L +RGM +DT C  CG   E+  HA+W+    K +WK  
Subjt:  GIYTVKSGYRLASD-ELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLWKET

Query:  PFPYMNHPEKLNYMSNLL
        P+  +    K + M +LL
Subjt:  PFPYMNHPEKLNYMSNLL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.4e-1027.17Show/hide
Query:  LLSKGIRWRVGNGDQVFIQGDKWVP-RPGSFTTTFPFRGDSDKKVS-SLILPNSEWDRNEV----TNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGI
        ++S G+ W  G+G Q+    D+WV  +P           D D  V+  L +P   WD  ++    TNN   E    +L++    R   D L W +S+DG 
Subjt:  LLSKGIRWRVGNGDQVFIQGDKWVP-RPGSFTTTFPFRGDSDKKVS-SLILPNSEWDRNEV----TNNFHEEDVQSILNIPLPKRRVDDSLLWHYSKDGI

Query:  YTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAI
        ++V+S Y     E+L  ++     M  ++  LW ++VP ++KTF W      + T      R +     C  C G  E+  H +
Subjt:  YTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.4e-1526.87Show/hide
Query:  GEQLLSKGIRWRVGNGDQVFIQGDKWV----PRPGSFTTTFPFRGDSDKKVSSLILPNSE---WDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYS
        G  LL KG R  +G+G  + I  D  V    PRP +   T+      +  +++L         WD ++++    + D   I  I L K +  D ++W+Y+
Subjt:  GEQLLSKGIRWRVGNGDQVFIQGDKWV----PRPGSFTTTFPFRGDSDKKVSSLILPNSE---WDRNEVTNNFHEEDVQSILNIPLPKRRVDDSLLWHYS

Query:  KDGIYTVKSGYRLASDE--LLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLW
          G YTV+SGY L + +    +   +           +W+L +  K+K F W+A    L T   L  RGM +D  C +C    E+  HA+++       W
Subjt:  KDGIYTVKSGYRLASDE--LLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHNKHLW

Query:  K
        +
Subjt:  K

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.9e-0533.9Show/hide
Query:  WWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWS
        W  ++WSLK+  K+K   WKA  + LP G  L +R + ++  C +C    ET  H +++
Subjt:  WWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWS

AT4G29090.1 Ribonuclease H-like superfamily protein5.0e-1626.21Show/hide
Query:  FCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWV-PRPGSFTTTFP-------FRGDSDKKVSSLILPNS-EWDRNEVTNNFHEEDVQSILNIPLPKRR
        F  +  +  +++L +G R  VGNG+ + I   KW+  +P S                 S  KVS LI  +  EW ++ +   F E + + I  +    RR
Subjt:  FCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWV-PRPGSFTTTFP-------FRGDSDKKVSSLILPNS-EWDRNEVTNNFHEEDVQSILNIPLPKRR

Query:  VDDSLLWHYSKDGIYTVKSGYRLASDELLLGEQSNSQAMFQ-----WWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPC
        + DS  W Y+  G YTVKSGY + +   ++ ++S+ Q + +      ++++W  +   K++ F WK   + LP    LA R +  ++ C +C    ET  
Subjt:  VDDSLLWHYSKDGIYTVKSGYRLASDELLLGEQSNSQAMFQ-----WWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPC

Query:  HAIWSRNHNKHLW--KETPFPYMNHPEKLNYMSNLLWWFWSTLPNDTW
        H ++     +  W     P P         Y+ NL W F     N  W
Subjt:  HAIWSRNHNKHLW--KETPFPYMNHPEKLNYMSNLLWWFWSTLPNDTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAAGTCTCATCTCTATCAAGTTGAATTCCTCAAATTACCTATTGTGGAAGTCACAGGTGCTACCGCTTATACGAACACTTGGCCTGGAACATCACCTTACCGAAA
AGGCTCCCACAGAATCATTGCTGAAGATGTGTTAGCATTGATTGAAGGTACTGAAACTGCAAAAGAGAGGCAGAAGAAAAAGCTAGCCAAATTCAAGTTAATCAAGCTTT
CTACTCAAACCGAGGAAGTGGTAATGGGAGAGGACAACAATTCTCCTCTCGAGGAAGAGGATTTCATCATAGTAGGCAACCAAAGGATCACAACAAATCGAAGCAAGGAT
AACAATATCTATGCTGATTCTGGAGCAACGTCACACGTAGTGAATGATCCAGGCCTCGACAAGGAAACAAGATTCTTTTGTTTGGAAGAGCAGTATATGGGGGAACAACT
TCTTTCTAAAGGTATCCGGTGGAGGGTTGGAAATGGTGATCAAGTTTTTATCCAAGGAGACAAATGGGTGCCTCGGCCAGGGTCTTTCACTACTACGTTTCCCTTTAGGG
GAGATTCCGATAAGAAAGTCTCTTCTCTCATTCTTCCAAATAGTGAGTGGGACAGAAATGAGGTTACAAATAATTTTCATGAGGAGGATGTGCAGTCAATCCTCAACATT
CCACTCCCAAAGAGAAGGGTTGATGACTCTTTATTATGGCATTACTCCAAGGATGGTATCTATACAGTGAAAAGTGGGTATCGATTGGCTAGTGATGAGCTACTGTTAGG
GGAGCAGTCGAATTCTCAGGCTATGTTTCAATGGTGGAAGGAGTTGTGGTCGCTTAAAGTGCCTAGCAAAATGAAAACGTTTGGGTGGAAAGCTTTTAAAGATTTCCTCC
CGACGGGAACAACCCTTGCTAATCGAGGAATGGATGTAGACACTAAATGCTTTAAATGTGGTGGAATGGCCGAGACCCCATGTCATGCTATATGGAGTCGTAATCACAAT
AAACACCTTTGGAAGGAAACACCTTTTCCCTATATGAATCACCCTGAGAAACTTAATTATATGTCTAATCTGTTATGGTGGTTTTGGTCAACTCTCCCAAACGATACGTG
GTTTGTTCTTAGCTATGTGTTGTGCAATTTGGGGAAACAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAAGTCTCATCTCTATCAAGTTGAATTCCTCAAATTACCTATTGTGGAAGTCACAGGTGCTACCGCTTATACGAACACTTGGCCTGGAACATCACCTTACCGAAA
AGGCTCCCACAGAATCATTGCTGAAGATGTGTTAGCATTGATTGAAGGTACTGAAACTGCAAAAGAGAGGCAGAAGAAAAAGCTAGCCAAATTCAAGTTAATCAAGCTTT
CTACTCAAACCGAGGAAGTGGTAATGGGAGAGGACAACAATTCTCCTCTCGAGGAAGAGGATTTCATCATAGTAGGCAACCAAAGGATCACAACAAATCGAAGCAAGGAT
AACAATATCTATGCTGATTCTGGAGCAACGTCACACGTAGTGAATGATCCAGGCCTCGACAAGGAAACAAGATTCTTTTGTTTGGAAGAGCAGTATATGGGGGAACAACT
TCTTTCTAAAGGTATCCGGTGGAGGGTTGGAAATGGTGATCAAGTTTTTATCCAAGGAGACAAATGGGTGCCTCGGCCAGGGTCTTTCACTACTACGTTTCCCTTTAGGG
GAGATTCCGATAAGAAAGTCTCTTCTCTCATTCTTCCAAATAGTGAGTGGGACAGAAATGAGGTTACAAATAATTTTCATGAGGAGGATGTGCAGTCAATCCTCAACATT
CCACTCCCAAAGAGAAGGGTTGATGACTCTTTATTATGGCATTACTCCAAGGATGGTATCTATACAGTGAAAAGTGGGTATCGATTGGCTAGTGATGAGCTACTGTTAGG
GGAGCAGTCGAATTCTCAGGCTATGTTTCAATGGTGGAAGGAGTTGTGGTCGCTTAAAGTGCCTAGCAAAATGAAAACGTTTGGGTGGAAAGCTTTTAAAGATTTCCTCC
CGACGGGAACAACCCTTGCTAATCGAGGAATGGATGTAGACACTAAATGCTTTAAATGTGGTGGAATGGCCGAGACCCCATGTCATGCTATATGGAGTCGTAATCACAAT
AAACACCTTTGGAAGGAAACACCTTTTCCCTATATGAATCACCCTGAGAAACTTAATTATATGTCTAATCTGTTATGGTGGTTTTGGTCAACTCTCCCAAACGATACGTG
GTTTGTTCTTAGCTATGTGTTGTGCAATTTGGGGAAACAGAAATAA
Protein sequenceShow/hide protein sequence
MCKSHLYQVEFLKLPIVEVTGATAYTNTWPGTSPYRKGSHRIIAEDVLALIEGTETAKERQKKKLAKFKLIKLSTQTEEVVMGEDNNSPLEEEDFIIVGNQRITTNRSKD
NNIYADSGATSHVVNDPGLDKETRFFCLEEQYMGEQLLSKGIRWRVGNGDQVFIQGDKWVPRPGSFTTTFPFRGDSDKKVSSLILPNSEWDRNEVTNNFHEEDVQSILNI
PLPKRRVDDSLLWHYSKDGIYTVKSGYRLASDELLLGEQSNSQAMFQWWKELWSLKVPSKMKTFGWKAFKDFLPTGTTLANRGMDVDTKCFKCGGMAETPCHAIWSRNHN
KHLWKETPFPYMNHPEKLNYMSNLLWWFWSTLPNDTWFVLSYVLCNLGKQK