; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:19020185..19025410
RNA-Seq ExpressionMoc04g26170
SyntenyMoc04g26170
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.4e-2027.27Show/hide
Query:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNLVCPTDLARKTSRCMNLLFALGRSKVGSYKEKICVATSRVQKAIANLHYHPTRDD
        + EFR A+ DC LLDLG +G PFTW+NRR  A ++F                 +K    +  L    + + G  ++++    ++++    +  ++   D+
Subjt:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNLVCPTDLARKTSRCMNLLFALGRSKVGSYKEKICVATSRVQKAIANLHYHPTRDD

Query:  LFAAEASLEAILLEEEF-----------------------------RINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCW
        L   E  ++ IL +EE                              + N I  +LD  G W  D  E+E +   +F++LF+++ P+ E +          
Subjt:  LFAAEASLEAILLEEEF-----------------------------RINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCW

Query:  IDDQMNQALTRPFTEQDILCALKQIHPYKAPGPDGLSGAFYR---GVAKESLL
        ++++MN  L  PF E++I+ AL Q+ P KAPGPDGL  AF++   G  KE ++
Subjt:  IDDQMNQALTRPFTEQDILCALKQIHPYKAPGPDGLSGAFYR---GVAKESLL

XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]2.6e-1935.9Show/hide
Query:  DHLPIQDFEEMVVFLWGIWSLRNG--LRSGGARLVGSLGSWTAQYVASFRFVRRSP--PLSSTSRCSRPRCRWRSPISGSFKLNTDAVFCSISLRAGLGI
        D +     EE+ VFLW IW+ RN     +G   L+ ++  W A Y+  ++  +  P  PL    R  R    W  P++  FK+N DA F   +  AGL I
Subjt:  DHLPIQDFEEMVVFLWGIWSLRNG--LRSGGARLVGSLGSWTAQYVASFRFVRRSP--PLSSTSRCSRPRCRWRSPISGSFKLNTDAVFCSISLRAGLGI

Query:  VIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNLRWRCISLVLLVLVSLIEKE
        +IRDST +VLL+A  ++ H   V LAE  A ++G+ LA+++G  P  +E+DS +   LL  D  D SE+G+L S++R    SL +    S + +E
Subjt:  VIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNLRWRCISLVLLVLVSLIEKE

XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]6.9e-2540.62Show/hide
Query:  LLADLQDHLPIQDFEEMVVFLWGIWSLRNGLRSGGARLV-GSLGSWTAQYVASFRF-----------VRRSPPLSSTSRCSRPRCRWRSPISGSFKLNTD
        +L D +D L  +DFEE+VVFLW +W+ RN       R+    L  W + Y+A+F+            V +S   SS    ++    W     G FKL TD
Subjt:  LLADLQDHLPIQDFEEMVVFLWGIWSLRNGLRSGGARLV-GSLGSWTAQYVASFRF-----------VRRSPPLSSTSRCSRPRCRWRSPISGSFKLNTD

Query:  AVFCSISLRAGLG-IVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNLR
        A F SI   AGLG I+IRD  G VL +ATKYL H  SVD AEA A  +GL +A+++G  PI++E+DSLR   L  RD   LS+ G ++  ++
Subjt:  AVFCSISLRAGLG-IVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNLR

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]4.2e-2238.16Show/hide
Query:  WWKSKFSHLVHGRSVP---NIFVLLADLQDHLPIQDFEEMVVFLWGIWSLRN----GLRSGGARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRCSRPRC
        W  SKFSH +H   VP   ++   +    D +  Q    +VV LW IW+ RN        GG+  +  L SW+  Y+  ++  +RS   S+   C  PR 
Subjt:  WWKSKFSHLVHGRSVP---NIFVLLADLQDHLPIQDFEEMVVFLWGIWSLRN----GLRSGGARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRCSRPRC

Query:  R-WRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVG
          WR P +   K+N DA F   S  AG+G++IRDSTG V L A + L     VD  E FA+ +G+ LAV++GF    +E+DSLR   LL  D  D SEVG
Subjt:  R-WRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVG

Query:  LLLSNLR
        +L S ++
Subjt:  LLLSNLR

XP_030940247.1 uncharacterized protein LOC115965211 [Quercus lobata]1.4e-2031.67Show/hide
Query:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCN---LVCPTDLARKTSRCMNLLFALGRS--KVGSYKEKICVATSRVQKAIANLHYH
        MR+FR  +D+CG  DLGF G  FTW  R     +V+ERLDR + N   +      + K +    +    G     +     ++ +   ++ +  + LH+ 
Subjt:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCN---LVCPTDLARKTSRCMNLLFALGRS--KVGSYKEKICVATSRVQKAIANLHYH

Query:  PTRDD---LFAAEASLEAILLEEEFRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCA
           D     F  +AS       + +R N I  L + +G W     ++  +   Y+S LFT+S PS+  L  VL ++P  + D MN  L +PF +Q++  A
Subjt:  PTRDD---LFAAEASLEAILLEEEFRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCA

Query:  LKQIHPYKAPGPDGLSGAFYR
        LKQ+ P KAPGPDG+   FY+
Subjt:  LKQIHPYKAPGPDGLSGAFYR

TrEMBL top hitse value%identityAlignment
A0A2N9FTD3 Uncharacterized protein5.4e-2320.8Show/hide
Query:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRV--------------------------------------------------LCNLVCPT
        M+ FR A+DDC  +DLG+ G+ FTW N R   A ++ERL R+                                                  + +  C  
Subjt:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRV--------------------------------------------------LCNLVCPT

Query:  DLAR----------------KTSRCMNLLFALGRSKVGSYKEKICVATSRVQKAIANLHYHPTRDDLFAAEASLEAILLEEE------------------
         +A                 K ++C N L    +S  GS ++++      ++ A        +   +      +  +L +EE                  
Subjt:  DLAR----------------KTSRCMNLLFALGRSKVGSYKEKICVATSRVQKAIANLHYHPTRDDLFAAEASLEAILLEEE------------------

Query:  -----------FRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCALKQIHPYKAPGPD
                    R N I  L D  G W+ +P +++ M+++YF ++F +S PS  ++  VL   P  I D MN+AL++P+T   +  ALKQ+ P  APGPD
Subjt:  -----------FRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCALKQIHPYKAPGPD

Query:  GLSGAFYRG---VAKESLLLYEQASGQTINFEKSIISFSLNTVAASQVSLIYCSVYALSSTIFGVAIF------YAAE----------------------
        G    FY+    +  E ++   +    ++N EK + +F+ + + A QV  +  +   L   +F    F      YA +                      
Subjt:  GLSGAFYRG---VAKESLLLYEQASGQTINFEKSIISFSLNTVAASQVSLIYCSVYALSSTIFGVAIF------YAAE----------------------

Query:  ---------------------SPPNFQ-----FFEGQNLEKPPRLARFVLPTASPSTVECVIFLLLRVIGMNAGIFNIKSGYLLAQRQLTPSGPS-----
                             SP N       + +  ++  PP++  F+    S S +   + L  R +  NA            + ++   G +     
Subjt:  ---------------------SPPNFQ-----FFEGQNLEKPPRLARFVLPTASPSTVECVIFLLLRVIGMNAGIFNIKSGYLLAQRQLTPSGPS-----

Query:  LSNLAWILRWWKSKFSHLVHGRSVPNIFVLLADLQDHLPIQDFEEM----VVFLWGIWSLRNGLRSGGARLVGSLGSWTAQYVASFR-FVRRSPPLSSTS
        +++  W  + W  +F+  +             DL   + +++  E+     +  W +W  RN  R   +          ++ +A ++ F+  + P S + 
Subjt:  LSNLAWILRWWKSKFSHLVHGRSVPNIFVLLADLQDHLPIQDFEEM----VVFLWGIWSLRNGLRSGGARLVGSLGSWTAQYVASFR-FVRRSPPLSSTS

Query:  RCSRPRCRWRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLER
        +  RP   W  P  G +K N D      +  AGLG++IRDS G ++    + +P+ GSV++ EA A +  ++ A++ G   I +E DS + I  + +
Subjt:  RCSRPRCRWRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLER

A0A2N9HFT1 Uncharacterized protein1.0e-2129.2Show/hide
Query:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNL----------VCPTDLARKTSRCMNLLFALGRSKVGSYK---EKICVATSRVQK
        M+ FR A+DDCG +DLG+ G+PFTW N R   A V+E+LDR++ +           V   D      + + L   +  +++ +     E++ ++ +   +
Subjt:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNL----------VCPTDLARKTSRCMNLLFALGRSKVGSYK---EKICVATSRVQK

Query:  AIANL---HYHPTRDDLFAAE----------------ASLEAILLEEE-----------------------------FRINLITSLLDVHGHWQADPMEM
         I N    H    R++L  AE                A +  +L +EE                              R N I  L D  G W+ DP ++
Subjt:  AIANL---HYHPTRDDLFAAE----------------ASLEAILLEEE-----------------------------FRINLITSLLDVHGHWQADPMEM

Query:  EGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCALKQIHPYKAPGPDGLSGAFYR
        + ++ +YF ++F SS PS  ++  VL  +P  I D MN+AL+RP+T  ++  AL+Q+ P  APGPDGL   FY+
Subjt:  EGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCALKQIHPYKAPGPDGLSGAFYR

A0A2N9HKS5 Reverse transcriptase domain-containing protein1.2e-2232.3Show/hide
Query:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNLVCPTDLARKTSRCMNLLFALGRSKV-------GSYKEKICVATSRVQKAIANLH
        M++F + ++ CGL+DLGFRG PFTW NRR   AL+ +RLDR L N      L RK    +  L ++ +S +       G     I    + + K + +  
Subjt:  MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNLVCPTDLARKTSRCMNLLFALGRSKV-------GSYKEKICVATSRVQKAIANLH

Query:  YH---PTRDDLFAAEASLEAIL---LEEEFRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQ
         H    +R    AA  S          +  R N ++ L +    W  D  ++E +  +YF  +F +S P    L   L +V   +  ++NQ L +PFT  
Subjt:  YH---PTRDDLFAAEASLEAIL---LEEEFRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQ

Query:  DILCALKQIHPYKAPGPDGLSGAFYR
        ++  AL Q+HP KAPGPDG+S  F++
Subjt:  DILCALKQIHPYKAPGPDGLSGAFYR

A0A6J1CDQ4 uncharacterized protein LOC1110105333.4e-2540.62Show/hide
Query:  LLADLQDHLPIQDFEEMVVFLWGIWSLRNGLRSGGARLV-GSLGSWTAQYVASFRF-----------VRRSPPLSSTSRCSRPRCRWRSPISGSFKLNTD
        +L D +D L  +DFEE+VVFLW +W+ RN       R+    L  W + Y+A+F+            V +S   SS    ++    W     G FKL TD
Subjt:  LLADLQDHLPIQDFEEMVVFLWGIWSLRNGLRSGGARLV-GSLGSWTAQYVASFRF-----------VRRSPPLSSTSRCSRPRCRWRSPISGSFKLNTD

Query:  AVFCSISLRAGLG-IVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNLR
        A F SI   AGLG I+IRD  G VL +ATKYL H  SVD AEA A  +GL +A+++G  PI++E+DSLR   L  RD   LS+ G ++  ++
Subjt:  AVFCSISLRAGLG-IVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNLR

A0A6J1DBJ7 uncharacterized protein LOC1110189732.0e-2238.16Show/hide
Query:  WWKSKFSHLVHGRSVP---NIFVLLADLQDHLPIQDFEEMVVFLWGIWSLRN----GLRSGGARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRCSRPRC
        W  SKFSH +H   VP   ++   +    D +  Q    +VV LW IW+ RN        GG+  +  L SW+  Y+  ++  +RS   S+   C  PR 
Subjt:  WWKSKFSHLVHGRSVP---NIFVLLADLQDHLPIQDFEEMVVFLWGIWSLRN----GLRSGGARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRCSRPRC

Query:  R-WRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVG
          WR P +   K+N DA F   S  AG+G++IRDSTG V L A + L     VD  E FA+ +G+ LAV++GF    +E+DSLR   LL  D  D SEVG
Subjt:  R-WRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVG

Query:  LLLSNLR
        +L S ++
Subjt:  LLLSNLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.5e-0526.62Show/hide
Query:  LWGIWSLRNGLRSGG---------ARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRCSRP-RCRWRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGS
        LW +W  RN L   G          R +     W+ +        R     +S  +  R    +W++P     K NTDA +   + R G+G ++R+ +G 
Subjt:  LWGIWSLRNGLRSGG---------ARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRCSRP-RCRWRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGS

Query:  VLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERD
        VL    + LP   +V  AE  AL   +       ++ II ESD+   + LL  D
Subjt:  VLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCGAGTTTCGCAATGCTATGGATGACTGTGGCCTTTTGGACTTGGGTTTTCGGGGCTCACCATTTACTTGGACTAATCGACGGCCTTGGGCTGCTCTGGTGTTTGA
GCGTTTGGATCGAGTCCTCTGTAATCTGGTTTGTCCTACTGATCTAGCTCGCAAAACTTCCCGTTGTATGAATCTTCTATTTGCTTTGGGACGTTCAAAAGTTGGCAGCT
ATAAGGAGAAAATTTGTGTGGCGACGTCCCGGGTGCAGAAGGCAATAGCTAACCTTCATTACCACCCTACTCGGGATGATCTTTTTGCAGCTGAAGCCTCTTTGGAGGCT
ATTCTACTGGAGGAGGAGTTCCGAATCAATTTGATCACCAGTTTATTAGATGTGCATGGTCATTGGCAGGCTGATCCAATGGAAATGGAAGGTATGGTCTCTACCTATTT
TTCCTCCCTTTTCACCTCTTCTCTTCCTTCTGAGGAGACTCTTGGGCGTGTTCTAGCATCAGTGCCTTGCTGGATTGATGATCAAATGAATCAGGCGCTTACCCGGCCTT
TTACTGAGCAAGATATCCTCTGTGCACTGAAACAAATTCACCCCTACAAGGCTCCTGGTCCCGATGGGTTGTCTGGGGCATTTTATCGTGGGGTGGCAAAAGAGTCTCTA
CTTCTTTATGAGCAAGCATCGGGTCAGACTATTAATTTTGAGAAATCCATTATCTCTTTTAGTCTTAACACGGTGGCGGCTTCTCAGGTGTCTCTTATTTACTGTTCCGT
ATACGCGTTGTCATCAACAATATTTGGGGTTGCCATCTTTTATGCCGCGGAATCGCCGCCAAACTTTCAGTTTTTTGAAGGACAGAATTTGGAGAAACCTCCAAGGTTGG
CGAGGTTCGTTCTCCCCACCGCCTCCCCTTCAACAGTCGAGTGTGTGATCTTTTTACTGCTTCGGGTCATTGGCATGAACGCGGGTATTTTTAATATCAAGAGTGGTTAT
CTTTTAGCACAGAGGCAACTTACTCCTTCTGGGCCTTCGTTGTCAAATCTGGCGTGGATTTTGCGCTGGTGGAAGTCCAAATTTAGCCACCTTGTTCATGGGCGGAGTGT
TCCAAATATTTTTGTTTTACTGGCTGACTTACAGGATCATCTCCCAATACAAGATTTTGAGGAAATGGTTGTTTTTCTTTGGGGAATTTGGTCACTTCGGAATGGTCTCC
GTTCTGGTGGTGCGCGTCTAGTGGGTTCTTTAGGAAGCTGGACTGCTCAATATGTTGCCTCATTTAGATTTGTCAGGCGATCTCCACCGCTATCTTCCACTTCCCGTTGC
TCGCGCCCTCGGTGCAGGTGGCGCTCGCCCATTAGCGGTTCCTTTAAATTAAACACCGATGCTGTCTTTTGCTCTATTTCTCTGCGGGCTGGTTTGGGGATTGTGATCCG
AGATTCTACAGGAAGTGTTTTGCTCGCTGCTACAAAGTATTTGCCCCACTGTGGTTCAGTTGATCTGGCAGAAGCCTTTGCTCTTGAAAAGGGTCTGTCTTTGGCAGTCG
ATTCGGGTTTTCGGCCTATAATCATGGAGTCTGATTCTTTGCGTTGTATCACCTTGCTGGAACGGGACGTTGCGGACTTGTCGGAAGTGGGTCTTTTGCTTTCTAATCTC
CGGTGGCGTTGCATCAGTCTGGTTCTGTTGGTTTTGGTTTCACTCATCGAGAAGGAAATCGTGCGGCTGACTGTCTCACTCGTTTTGCTTTGGCGCACCAGTTTGTTGAA
GTTCTTCGTCTTGGGTTTGAGAAGTTCTTCGGCTTTGCAGCTTGTGAAGAATTTGCAACTTGATGGAGTCTTCGGCTTCAAGTCTTGTGAAGTTCTTCGGCTTATAGCTT
GGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCGAGTTTCGCAATGCTATGGATGACTGTGGCCTTTTGGACTTGGGTTTTCGGGGCTCACCATTTACTTGGACTAATCGACGGCCTTGGGCTGCTCTGGTGTTTGA
GCGTTTGGATCGAGTCCTCTGTAATCTGGTTTGTCCTACTGATCTAGCTCGCAAAACTTCCCGTTGTATGAATCTTCTATTTGCTTTGGGACGTTCAAAAGTTGGCAGCT
ATAAGGAGAAAATTTGTGTGGCGACGTCCCGGGTGCAGAAGGCAATAGCTAACCTTCATTACCACCCTACTCGGGATGATCTTTTTGCAGCTGAAGCCTCTTTGGAGGCT
ATTCTACTGGAGGAGGAGTTCCGAATCAATTTGATCACCAGTTTATTAGATGTGCATGGTCATTGGCAGGCTGATCCAATGGAAATGGAAGGTATGGTCTCTACCTATTT
TTCCTCCCTTTTCACCTCTTCTCTTCCTTCTGAGGAGACTCTTGGGCGTGTTCTAGCATCAGTGCCTTGCTGGATTGATGATCAAATGAATCAGGCGCTTACCCGGCCTT
TTACTGAGCAAGATATCCTCTGTGCACTGAAACAAATTCACCCCTACAAGGCTCCTGGTCCCGATGGGTTGTCTGGGGCATTTTATCGTGGGGTGGCAAAAGAGTCTCTA
CTTCTTTATGAGCAAGCATCGGGTCAGACTATTAATTTTGAGAAATCCATTATCTCTTTTAGTCTTAACACGGTGGCGGCTTCTCAGGTGTCTCTTATTTACTGTTCCGT
ATACGCGTTGTCATCAACAATATTTGGGGTTGCCATCTTTTATGCCGCGGAATCGCCGCCAAACTTTCAGTTTTTTGAAGGACAGAATTTGGAGAAACCTCCAAGGTTGG
CGAGGTTCGTTCTCCCCACCGCCTCCCCTTCAACAGTCGAGTGTGTGATCTTTTTACTGCTTCGGGTCATTGGCATGAACGCGGGTATTTTTAATATCAAGAGTGGTTAT
CTTTTAGCACAGAGGCAACTTACTCCTTCTGGGCCTTCGTTGTCAAATCTGGCGTGGATTTTGCGCTGGTGGAAGTCCAAATTTAGCCACCTTGTTCATGGGCGGAGTGT
TCCAAATATTTTTGTTTTACTGGCTGACTTACAGGATCATCTCCCAATACAAGATTTTGAGGAAATGGTTGTTTTTCTTTGGGGAATTTGGTCACTTCGGAATGGTCTCC
GTTCTGGTGGTGCGCGTCTAGTGGGTTCTTTAGGAAGCTGGACTGCTCAATATGTTGCCTCATTTAGATTTGTCAGGCGATCTCCACCGCTATCTTCCACTTCCCGTTGC
TCGCGCCCTCGGTGCAGGTGGCGCTCGCCCATTAGCGGTTCCTTTAAATTAAACACCGATGCTGTCTTTTGCTCTATTTCTCTGCGGGCTGGTTTGGGGATTGTGATCCG
AGATTCTACAGGAAGTGTTTTGCTCGCTGCTACAAAGTATTTGCCCCACTGTGGTTCAGTTGATCTGGCAGAAGCCTTTGCTCTTGAAAAGGGTCTGTCTTTGGCAGTCG
ATTCGGGTTTTCGGCCTATAATCATGGAGTCTGATTCTTTGCGTTGTATCACCTTGCTGGAACGGGACGTTGCGGACTTGTCGGAAGTGGGTCTTTTGCTTTCTAATCTC
CGGTGGCGTTGCATCAGTCTGGTTCTGTTGGTTTTGGTTTCACTCATCGAGAAGGAAATCGTGCGGCTGACTGTCTCACTCGTTTTGCTTTGGCGCACCAGTTTGTTGAA
GTTCTTCGTCTTGGGTTTGAGAAGTTCTTCGGCTTTGCAGCTTGTGAAGAATTTGCAACTTGATGGAGTCTTCGGCTTCAAGTCTTGTGAAGTTCTTCGGCTTATAGCTT
GGTGA
Protein sequenceShow/hide protein sequence
MREFRNAMDDCGLLDLGFRGSPFTWTNRRPWAALVFERLDRVLCNLVCPTDLARKTSRCMNLLFALGRSKVGSYKEKICVATSRVQKAIANLHYHPTRDDLFAAEASLEA
ILLEEEFRINLITSLLDVHGHWQADPMEMEGMVSTYFSSLFTSSLPSEETLGRVLASVPCWIDDQMNQALTRPFTEQDILCALKQIHPYKAPGPDGLSGAFYRGVAKESL
LLYEQASGQTINFEKSIISFSLNTVAASQVSLIYCSVYALSSTIFGVAIFYAAESPPNFQFFEGQNLEKPPRLARFVLPTASPSTVECVIFLLLRVIGMNAGIFNIKSGY
LLAQRQLTPSGPSLSNLAWILRWWKSKFSHLVHGRSVPNIFVLLADLQDHLPIQDFEEMVVFLWGIWSLRNGLRSGGARLVGSLGSWTAQYVASFRFVRRSPPLSSTSRC
SRPRCRWRSPISGSFKLNTDAVFCSISLRAGLGIVIRDSTGSVLLAATKYLPHCGSVDLAEAFALEKGLSLAVDSGFRPIIMESDSLRCITLLERDVADLSEVGLLLSNL
RWRCISLVLLVLVSLIEKEIVRLTVSLVLLWRTSLLKFFVLGLRSSSALQLVKNLQLDGVFGFKSCEVLRLIAW