; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038931 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038931
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H domain
Genome locationchr2:31158094..31158944
RNA-Seq ExpressionLag0038931
SyntenyLag0038931
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG71533.1 hypothetical protein EZV62_000112 [Acer yangbiense]8.1e-2030.15Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKN-SLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSA
        +C+   ET+ H L+ C   KE+ + + N    +    N   I+  +    Q  ++E E++ +    +W+ RNK  HN +I P     +W   ++  +R A
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKN-SLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSA

Query:  NRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAA--WSQKDRRLGLSGPMAELKAMFEGILLA----ISELCKSGGRVGLHGGYQSGHEEERAMGEVEGI
        N R  V   E   L  PRW  PP    KINTDAA  +  K    G   P+AE  A+ EG+  A     S++C     + +    Q+ + +     EV  +
Subjt:  NRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAA--WSQKDRRLGLSGPMAELKAMFEGILLA----ISELCKSGGRVGLHGGYQSGHEEERAMGEVEGI

Query:  LEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESDLEIS
        L DI+ ++  F E+SF+F PR  N VA  LAK   + +    W+   P  + +LV  D+ IS
Subjt:  LEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESDLEIS

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.9e-1726.04Show/hide
Query:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANR
        C   +ET +H L  C  A++IW      V   +D          ++  ++S  E E++ V    IW+ RNK          +       + L++++  ++
Subjt:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANR

Query:  RNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPM--AELKAMFEGILLA-ISELCKSGGRVGLHGGYQSGHE----------------
           V  A+   +   +WKPP +N  K+N DAA S KD+++GL   +  AE K +  GI  A   E         +H G Q  ++                
Subjt:  RNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPM--AELKAMFEGILLA-ISELCKSGGRVGLHGGYQSGHE----------------

Query:  -----EERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPS
              + +  E+  IL D+      F ++ FSF PR CN  A  LAK A  + +++ WV +FP+
Subjt:  -----EERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPS

XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]2.9e-1733.56Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR---
        +C   IETT+H LF C RAKE+W  +  + F + DF NS+ D  + + +  S  + +++ V   AIWNDRN +     IP  +I+ +WI TY+  F+   
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR---

Query:  ---SANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGL
             + R     + + + ++  W PPP  + KIN DAA  +   R G+
Subjt:  ---SANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.2e-2329.62Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQI---SNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR
        +C    E+  H  F C RA++IW+ +      + AE++   S ++ W  + +Q   ++L + A+T   IWNDRN L H + + P++ KC W+  +L+S  
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQI---SNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR

Query:  SANRRNPVPVAES-PKLVDPRWKPPPRNFWKINTDAAWSQKDRRLG---------------------LSGPMAELKAMFEGI-LLAISELCKSGGRVGLH
         A   N  P  +S  + V   W+P      K+NTDAA        G                     LS  +AE++ + EG+   A S            
Subjt:  SANRRNPVPVAES-PKLVDPRWKPPPRNFWKINTDAAWSQKDRRLG---------------------LSGPMAELKAMFEGI-LLAISELCKSGGRVGLH

Query:  GGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSE-TWVWSFPSWLLSLVESDLEIS-AHVA
           Q    E    G+ +  + +I AL   F  +SFS S R+CN  A  LAK    S ++   W+++FP+WLL LV+ D   + AHVA
Subjt:  GGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSE-TWVWSFPSWLLSLVESDLEIS-AHVA

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]7.9e-1524.91Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSAN
        +C S  ET +H LF C RAK +W+  +  +  +   ++S  D  + +    S  ELE+  V   +IW++RN ++H   +        +  +YL  F+ A 
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSAN

Query:  RRNPVPVAESPKLVD----------PRWKPPPRNFWKINTDAAWSQKDRRLGLSGPM--------AELKAMFE-----------GILLAISELCKSGGRV
         +N  PV  S               P+W  PPR   K+NTDAA  ++   +G+   +        A L   F            G+ L+++ L      V
Subjt:  RRNPVPVAESPKLVD----------PRWKPPPRNFWKINTDAAWSQKDRRLGLSGPM--------AELKAMFE-----------GILLAISELCKSGGRV

Query:  G--------LHGGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLV
                 +  G ++ H     +     +L +I  L+  F         R  N  A  LAK A    T   W+ +FPS L++L+
Subjt:  G--------LHGGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLV

TrEMBL top hitse value%identityAlignment
A0A2N9I509 Uncharacterized protein3.8e-1523.36Show/hide
Query:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANR
        CL   ET +H L+GC  A+ +WK+ S  +    D + +  +             LE+   T+ A+WN RN+ + +  +P +   C+        F  +  
Subjt:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANR

Query:  RNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLG----LSGPMAELKAMFEGILLAISELCKSGGRVGLHG---GYQSG--------------
        +    +A S    D RW+PP +  +K+N          R+G    +   +  + A  E  +L+  ++ ++   V L      +  G              
Subjt:  RNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLG----LSGPMAELKAMFEGILLAISELCKSGGRVGLHG---GYQSG--------------

Query:  ---HEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESD
            +    +  +  I++D+    P F +LSFSF  + CN  A VLA  A  S     W+   P+ +  LV+SD
Subjt:  ---HEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESD

A0A5C7IQ65 RNase H domain-containing protein3.9e-2030.15Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKN-SLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSA
        +C+   ET+ H L+ C   KE+ + + N    +    N   I+  +    Q  ++E E++ +    +W+ RNK  HN +I P     +W   ++  +R A
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKN-SLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSA

Query:  NRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAA--WSQKDRRLGLSGPMAELKAMFEGILLA----ISELCKSGGRVGLHGGYQSGHEEERAMGEVEGI
        N R  V   E   L  PRW  PP    KINTDAA  +  K    G   P+AE  A+ EG+  A     S++C     + +    Q+ + +     EV  +
Subjt:  NRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAA--WSQKDRRLGLSGPMAELKAMFEGILLA----ISELCKSGGRVGLHGGYQSGHEEERAMGEVEGI

Query:  LEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESDLEIS
        L DI+ ++  F E+SF+F PR  N VA  LAK   + +    W+   P  + +LV  D+ IS
Subjt:  LEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESDLEIS

A0A6J1CTE3 uncharacterized protein LOC1110145781.4e-1733.56Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR---
        +C   IETT+H LF C RAKE+W  +  + F + DF NS+ D  + + +  S  + +++ V   AIWNDRN +     IP  +I+ +WI TY+  F+   
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR---

Query:  ---SANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGL
             + R     + + + ++  W PPP  + KIN DAA  +   R G+
Subjt:  ---SANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGL

A0A6J1DX30 uncharacterized protein LOC1110248745.9e-2429.62Show/hide
Query:  MCLSNIETTNHCLFGCIRAKEIWKQI---SNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR
        +C    E+  H  F C RA++IW+ +      + AE++   S ++ W  + +Q   ++L + A+T   IWNDRN L H + + P++ KC W+  +L+S  
Subjt:  MCLSNIETTNHCLFGCIRAKEIWKQI---SNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFR

Query:  SANRRNPVPVAES-PKLVDPRWKPPPRNFWKINTDAAWSQKDRRLG---------------------LSGPMAELKAMFEGI-LLAISELCKSGGRVGLH
         A   N  P  +S  + V   W+P      K+NTDAA        G                     LS  +AE++ + EG+   A S            
Subjt:  SANRRNPVPVAES-PKLVDPRWKPPPRNFWKINTDAAWSQKDRRLG---------------------LSGPMAELKAMFEGI-LLAISELCKSGGRVGLH

Query:  GGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSE-TWVWSFPSWLLSLVESDLEIS-AHVA
           Q    E    G+ +  + +I AL   F  +SFS S R+CN  A  LAK    S ++   W+++FP+WLL LV+ D   + AHVA
Subjt:  GGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSE-TWVWSFPSWLLSLVESDLEIS-AHVA

A0A6P9E5W8 uncharacterized protein LOC1089821751.6e-1325.82Show/hide
Query:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANR
        C    ET+ H L+GCI AK++W Q   KV       + + D W ++ +  S+EELE +A T   IW  RN     ++           +T L  +R A  
Subjt:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANR

Query:  RNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPMAELKAMFEGILLAISELCKSG---------------GRVGLHGGYQSGHEEE--
         + +    +     PRW  P    +K+N DAA +QKDR++G+   + +   +F G L A   L  S                  VG+      G  ++  
Subjt:  RNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPMAELKAMFEGILLAISELCKSG---------------GRVGLHGGYQSGHEEE--

Query:  ------RAMGEVEG-ILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESDL
                 G + G ++ED   ++ S+   S + + R+ N+ A  LAK A           + PS +L +V  ++
Subjt:  ------RAMGEVEG-ILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWLLSLVESDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.4e-1123.3Show/hide
Query:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDR--W---MKIDDQNSMEELEMIAVTSLAIWNDRNKL-FHNEDIPPIQIKCNWIRTYLES
        C  + ET NH LF C  A+ +W       + E ++ +SL     W   ++++     +   ++      +W  RN+L F  ++    ++    +R  +E 
Subjt:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDR--W---MKIDDQNSMEELEMIAVTSLAIWNDRNKL-FHNEDIPPIQIKCNWIRTYLES

Query:  FRS-ANRRNPVPVAESPKL---VDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPMAE-------------------LKAMFEGILLAISELCK-SGGRV
        F   + RR     A  P++   +  +WK PP  + K NTDA W  ++ R G+   +                     L+A  E +  A+  + + +  R+
Subjt:  FRS-ANRRNPVPVAESPKL---VDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPMAE-------------------LKAMFEGILLAISELCK-SGGRV

Query:  GLHGGYQ---SGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSF-PSWLLS
              Q   +    +     ++  LEDI  L+  F+E+ F F+PR  N VAD +A+ +      +  ++S  P WL S
Subjt:  GLHGGYQ---SGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSF-PSWLLS

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-0423.13Show/hide
Query:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEE----DFKNSLIDRWMKIDDQNSMEELEMIAVTSL-AIWNDRNKLFHNE--DIPPIQIKCNWIRT--Y
        C    E+ NH LF C  A   W+   + +   +    DF+ ++ +    + D    +  +++ V  +  IW  RN +  N+  + P   +      T  +
Subjt:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEE----DFKNSLIDRWMKIDDQNSMEELEMIAVTSL-AIWNDRNKLFHNE--DIPPIQIKCNWIRT--Y

Query:  LESFRSANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWS-QKDRRLG------------------LSGPMAELKAMFEGILLAISELCKSG-GRVG
        L + +S +++ P P  +  +     W+ PP  + K N DA +  QK    G                  L+     L+A  + +L A+ +    G  +V 
Subjt:  LESFRSANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWS-QKDRRLG------------------LSGPMAELKAMFEGILLAISELCKSG-GRVG

Query:  LHGGYQSGHEEERAMGEVEGI---------LEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWL
        + G  Q+       +  + GI         LEDI      F  + F F  RK N +A VLAK      T  +   S P WL
Subjt:  LHGGYQSGHEEERAMGEVEGI---------LEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSFPSWL

AT4G29090.1 Ribonuclease H-like superfamily protein5.3e-0922.92Show/hide
Query:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDR--WM-KIDDQNSMEE--LEMIAVTSLAIWNDRNKL-FHNEDIPPIQIKC-------NW
        C S  ET NH LF C  A+  W   S  +    ++ +S+     W+  + + N   E   +++      +W +RN+L F   +    ++          W
Subjt:  CLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDR--WM-KIDDQNSMEE--LEMIAVTSLAIWNDRNKL-FHNEDIPPIQIKC-------NW

Query:  -IRTYLESFRSANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGL---------------SGPMAELKAMFEGILLAISELCKSGGRVG
         IRT  ES  +  + N             RW+PPP  + K NTDA W++ + R G+               +  + +LK++ E  L A+     S  R  
Subjt:  -IRTYLESFRSANRRNPVPVAESPKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGL---------------SGPMAELKAMFEGILLAISELCKSGGRVG

Query:  LHGGYQSGHEEERAMGEV----------EGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSF-PSWLLSLVE
            Y     + + + E+          +  ++D+  L+  F E+ F F PR+ N +A+ +A+ +      +  ++S  PSW  S ++
Subjt:  LHGGYQSGHEEERAMGEV----------EGILEDIWALIPSFDELSFSFSPRKCNVVADVLAKRAKHSKTSETWVWSF-PSWLLSLVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTATCGAATATTGAAACCACTAATCATTGTCTTTTTGGATGCATTCGAGCCAAGGAAATCTGGAAACAAATTTCGAACAAGGTCTTTGCAGAGGAAGACTTCAA
AAACAGTTTGATAGATCGTTGGATGAAGATCGATGACCAAAATTCAATGGAAGAGCTGGAGATGATAGCTGTAACGAGTTTGGCAATATGGAACGATCGGAACAAATTAT
TTCACAATGAAGATATCCCTCCCATTCAAATCAAGTGTAATTGGATCAGAACTTACTTGGAGAGTTTTCGAAGCGCCAATAGGAGAAATCCAGTACCAGTTGCTGAAAGT
CCAAAGCTGGTTGATCCCCGATGGAAACCTCCCCCAAGAAATTTCTGGAAAATTAACACTGATGCGGCTTGGTCCCAGAAAGACCGGAGATTGGGATTATCGGGCCCTAT
GGCGGAGCTAAAAGCTATGTTCGAAGGGATTCTGTTGGCGATCTCTGAATTGTGTAAATCTGGTGGTAGAGTCGGATTGCATGGAGGCTATCAATCTGGTCACGAAGAAG
AAAGAGCAATGGGTGAAGTGGAAGGCATTCTGGAAGACATCTGGGCCCTAATTCCATCCTTTGATGAGTTATCTTTTTCTTTCTCCCCTAGAAAATGCAATGTCGTGGCT
GATGTACTGGCTAAGAGAGCTAAACACTCAAAAACTAGTGAAACTTGGGTATGGTCTTTCCCAAGTTGGCTACTGTCGTTGGTCGAAAGTGACCTTGAAATTTCTGCCCA
TGTGGCGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTATCGAATATTGAAACCACTAATCATTGTCTTTTTGGATGCATTCGAGCCAAGGAAATCTGGAAACAAATTTCGAACAAGGTCTTTGCAGAGGAAGACTTCAA
AAACAGTTTGATAGATCGTTGGATGAAGATCGATGACCAAAATTCAATGGAAGAGCTGGAGATGATAGCTGTAACGAGTTTGGCAATATGGAACGATCGGAACAAATTAT
TTCACAATGAAGATATCCCTCCCATTCAAATCAAGTGTAATTGGATCAGAACTTACTTGGAGAGTTTTCGAAGCGCCAATAGGAGAAATCCAGTACCAGTTGCTGAAAGT
CCAAAGCTGGTTGATCCCCGATGGAAACCTCCCCCAAGAAATTTCTGGAAAATTAACACTGATGCGGCTTGGTCCCAGAAAGACCGGAGATTGGGATTATCGGGCCCTAT
GGCGGAGCTAAAAGCTATGTTCGAAGGGATTCTGTTGGCGATCTCTGAATTGTGTAAATCTGGTGGTAGAGTCGGATTGCATGGAGGCTATCAATCTGGTCACGAAGAAG
AAAGAGCAATGGGTGAAGTGGAAGGCATTCTGGAAGACATCTGGGCCCTAATTCCATCCTTTGATGAGTTATCTTTTTCTTTCTCCCCTAGAAAATGCAATGTCGTGGCT
GATGTACTGGCTAAGAGAGCTAAACACTCAAAAACTAGTGAAACTTGGGTATGGTCTTTCCCAAGTTGGCTACTGTCGTTGGTCGAAAGTGACCTTGAAATTTCTGCCCA
TGTGGCGCAATAA
Protein sequenceShow/hide protein sequence
MCLSNIETTNHCLFGCIRAKEIWKQISNKVFAEEDFKNSLIDRWMKIDDQNSMEELEMIAVTSLAIWNDRNKLFHNEDIPPIQIKCNWIRTYLESFRSANRRNPVPVAES
PKLVDPRWKPPPRNFWKINTDAAWSQKDRRLGLSGPMAELKAMFEGILLAISELCKSGGRVGLHGGYQSGHEEERAMGEVEGILEDIWALIPSFDELSFSFSPRKCNVVA
DVLAKRAKHSKTSETWVWSFPSWLLSLVESDLEISAHVAQ