; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017704 (gene) of Chayote v1 genome

Gene IDSed0017704
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG03:14361371..14362294
RNA-Seq ExpressionSed0017704
SyntenySed0017704
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023876296.1 uncharacterized protein LOC111988743 [Quercus suber]4.4e-2731.97Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEVN--EVFKPSDATDQKTFYLP
        +D  C  C + AET  HA+W C  A+D W GS      + +   +V  LF     +L + + + F++  W  WN RN  V+  ++  P     +   YL 
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEVN--EVFKPSDATDQKTFYLP

Query:  AY-----DWDYLLDKLK------PLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETD
         +       + LL +L       P D V +K+N DAAI  E   S +G ++RN KGE MA+   + P + DSE A+ +A R  ++ A+D GF  + +E D
Subjt:  AY-----DWDYLLDKLK------PLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETD

Query:  ALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP
           V+  L++     S++G +V++++ LA    +VSF    R AN++AH LA+      E+  W+EE P
Subjt:  ALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP

XP_023904177.1 uncharacterized protein LOC112015942 [Quercus suber]1.9e-3031.25Show/hide
Query:  DSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRN--MEVNEVFKPSDATDQKTFYLPA
        ++ C  C +  E   H IW C  AQD W GS       P    ++  LF +   +L  + M  F++  W  W+ RN  +   ++ +P     +   +L  
Subjt:  DSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRN--MEVNEVFKPSDATDQKTFYLPA

Query:  Y-DWDYLLDKLKPLDAVN----------FKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA
        +      LD L+P   V           FK+N DAAI ++  RS  G V+RN  GE MA+     P +  S+ A+ +A R  M+ A+D GF  + +E D+
Subjt:  Y-DWDYLLDKLKPLDAVN----------FKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA

Query:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCI
        ++V+  L++     S VG +V +IK L      VSF W  RD N +AH LAK      E+ +WME+ P + +
Subjt:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCI

XP_023928118.1 uncharacterized protein LOC112039474 [Quercus suber]9.8e-2731.34Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEV--NEVFKPSDATDQKTFYLP
        E+ +C  C + AE   HA+W C  A+D W GS            +V  LF    GKL   + + F+I  W  WN RN  V   ++  P         +L 
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEV--NEVFKPSDATDQKTFYLP

Query:  AYDW---------DYLLDKL-KPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA
         +             L D + +P     FK+N DA + S+   S +G ++RN  GE MA+   ++P ++DS +A+ +A R  M  A   GF  + +E D 
Subjt:  AYDW---------DYLLDKL-KPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA

Query:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP
        L V+  L       S +G ++++IK L R+F  VSF +  R ANS+A+ LA+     +E+ +WME+ P
Subjt:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP

XP_030931246.1 uncharacterized protein LOC115957168 [Quercus lobata]2.0e-2730.11Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEV--NEVFKPSDATDQKTFYLP
        EDS C  C   +ET  H +W C  AQD W GS        + +  V +L      KL  ++++ F +  W  W  RN       +  PS    +   YL 
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEV--NEVFKPSDATDQKTFYLP

Query:  AYDWDYLLDKLKPLDA----------VNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA
         +    +   +  + A          +++K+N DAAI  +   S  G+V+RN  GE MA++  + P I DSE A+ +A R  ++ A D GF  + +E D 
Subjt:  AYDWDYLLDKLKPLDA----------VNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA

Query:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPE
          V+N++++    +S++G L  ++  LA     +S     R ANS+AH LA+     NE+  W EE P+
Subjt:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPE

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]2.2e-2628.52Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEVN--EVFKPSDATDQKTFYLP
        ED  C+ C    ET  HA+W C  AQD W GS            +V  LF     +L  ++ + F+ + W  WN RN  ++  ++  P     +   YL 
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEVN--EVFKPSDATDQKTFYLP

Query:  AY----------DWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA
         Y               + + +P     +K+N DAAI +E   S  G ++RN +GE MA+   + P + +SE A+A+A R  ++ A+  GF  + +E D 
Subjt:  AY----------DWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDA

Query:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEM
        ++V+  + +    +S++G + ++I+ L     + S     R AN +AH LA+   + +EE  W+EE P +
Subjt:  LHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEM

TrEMBL top hitse value%identityAlignment
A0A2N9ED81 Uncharacterized protein2.5e-2830.77Show/hide
Query:  DSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNME--------VNEV----------
        D  C  C+++ ET  HA+W CK  ++ W   P+      N      +LF   +  L + ++Q F    W  W+ RN +        VN++          
Subjt:  DSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNME--------VNEV----------

Query:  FKPSDATDQKTFYLPAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIE
        F  +  +   T   PA     +    KP     +K N D A   E   + +G+++RN KGE MAS C +IP     E  +A A R  + L+ DLG + ++
Subjt:  FKPSDATDQKTFYLPAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIE

Query:  METDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP
        +E D+L +VN L N   C S  G L+ + K LA++ L V F    RD N +AH +AK        + WME VP
Subjt:  METDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP

A0A2N9F767 Uncharacterized protein3.6e-2729.82Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNME-VNEVFKPSDATDQKTFYL--
        +D+ C+ C    ETT HA+W C   +  W   P+       +     DL  +      S + + F ++CW  W  RN + +++  +P D    K   L  
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNME-VNEVFKPSDATDQKTFYL--

Query:  -----------PAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMET
                   P         + KP D    K+N D A+ +E   + +G+++RN  GE MA+ C +IP        +A A R    LA+D+G   +E+E 
Subjt:  -----------PAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMET

Query:  DALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDF
        D+  +V+ L +   C +  G L+E+ K LA++  FVSF    RD NS+AH LAK        + W+E V      D L  L KDF
Subjt:  DALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDF

A0A2N9I1D9 Uncharacterized protein1.4e-2628.32Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNW---NCRNM------------EVNEVFK
        +D  C  C+   ET  HA+W C   ++ W   P+      NH     +LF   +  L   ++Q F    W  W   NC+ +                +  
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNW---NCRNM------------EVNEVFK

Query:  PSDATDQKTFYLPAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEME
           A  +                 KP     +K N D A   +   + +G+++RN +G AMAS C +IP     E  +A A R  ++L+ DLG + +++E
Subjt:  PSDATDQKTFYLPAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEME

Query:  TDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDF
         D+  +VN L N   C +  G LV + K +A++ L V F    RD N +AH LAK   F    + WME VP     D +  L  DF
Subjt:  TDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDF

A0A2N9J0V8 Uncharacterized protein7.3e-2828.41Show/hide
Query:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNME-----VNEVFK--PSDATDQK
        +D  C  C    E T+HA+W CK+ Q+ W+ + +      +H  +  +L+C     LR++++Q F +  W  W  RN +     + E+ +  P       
Subjt:  EDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNME-----VNEVFK--PSDATDQK

Query:  TFY--------LPAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEME
         F+         P       L K  P  A ++K N D AI +E   + +G+++RN +G  MAS   +IP     E  +A A R     A+DLG + +  E
Subjt:  TFY--------LPAYDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEME

Query:  TDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP
         D+L ++  L +   C ++ G L+ + K++A++F+   F    R+ N +AH LA+        Q WME+VP
Subjt:  TDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVP

A0A2N9J4C9 Uncharacterized protein1.5e-2529.45Show/hide
Query:  DSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRN--------MEVNEVFKPSD---AT
        D  C+ C    ET  HA+W CK  +  W   P+      N      +LF   +  L + ++Q F I  W  W  RN        + ++++   +    + 
Subjt:  DSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRN--------MEVNEVFKPSD---AT

Query:  DQKTFYLPA-----------YDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGF
         Q    +PA             W       +P     +K+N D A   E   + +G+V+RN  GE MAS C ++P     E  +A A R  ++L+ DLGF
Subjt:  DQKTFYLPA-----------YDWDYLLDKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGF

Query:  QRIEMETDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDF
        +  +ME D+  VVN L N   C +  G LV + K +A+  L V +    RD N +AH LAK        + WME VP     D +  L  DF
Subjt:  QRIEMETDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.1e-0423.26Show/hide
Query:  CKRCHKLAETTFHAIWACKHAQDSWRGSPFL--HYGFPNHVTEVADLF--CWFYGKLRSKQMQKFMIMCWRNWNCRNMEV--NEVFKPSDATDQKTFYLP
        C RC    ET+ H ++ C  A   W  +P    H      + E  +L         +       F  +CW  W  RN  +  N  F   +   +      
Subjt:  CKRCHKLAETTFHAIWACKHAQDSWRGSPFL--HYGFPNHVTEVADLF--CWFYGKLRSKQMQKFMIMCWRNWNCRNMEV--NEVFKPSDATDQKTFYLP

Query:  AYDWDYL-LDKLKPLD---------AVNFKINVDAAINSEDRRSVLGMVMRNWKGE-----AMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIE
        A+    L L K++              +F   VDAA   +   +  G V +            ++ C + P  +    A+A A++  M  A+ L    + 
Subjt:  AYDWDYL-LDKLKPLD---------AVNFKINVDAAINSEDRRSVLGMVMRNWKGE-----AMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIE

Query:  METDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKM
        + +D+  +V+ LN+ +      G LV EI+S+   F  +SF +  R  NSIA   AK+
Subjt:  METDALHVVNLLNNEMHCRSKVGRLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAKM

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.9e-0626.87Show/hide
Query:  DKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDALHVVNLLNNEMHCRSKVG
        DK +   A   K N D + ++  + S L  ++RN +G  +   C +  G    + A+  A+   +  A DLG++R+E E D + V  L+ N+     ++ 
Subjt:  DKLKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDALHVVNLLNNEMHCRSKVG

Query:  RLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAK
          +E I+  ++ F  V F +  R+ N     LAK
Subjt:  RLVEEIKSLARNFLFVSFGWCGRDANSIAHKLAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGGATTCGATTTGCAAACGATGCCACAAGCTGGCAGAGACAACTTTTCATGCTATTTGGGCGTGCAAACATGCACAAGACTCTTGGAGGGGTTCTCCGTTTCT
GCACTATGGGTTTCCAAACCATGTTACAGAGGTGGCAGACTTATTTTGTTGGTTTTATGGAAAGCTTCGGTCAAAACAAATGCAAAAATTCATGATCATGTGTTGGAGGA
ATTGGAACTGCAGAAACATGGAAGTCAATGAGGTTTTTAAGCCTAGTGATGCTACTGATCAAAAGACATTTTATTTGCCTGCTTATGATTGGGATTATTTACTTGATAAG
TTGAAGCCCCTAGACGCAGTAAATTTTAAAATTAATGTTGATGCAGCAATTAATTCAGAGGATCGTCGAAGCGTGCTTGGGATGGTGATGCGAAATTGGAAGGGGGAGGC
CATGGCTTCTTATTGTTGTCAGATTCCAGGGATTATAGATTCTGAGATAGCGAAGGCAATGGCAGTACGTCATGGCATGGACCTTGCGATGGATCTCGGATTTCAAAGAA
TTGAAATGGAGACGGATGCTTTGCATGTGGTAAATCTTCTCAACAACGAAATGCATTGTCGATCGAAAGTTGGAAGGCTTGTCGAAGAGATAAAAAGCTTAGCGAGAAAC
TTCTTGTTCGTGAGCTTTGGGTGGTGTGGACGAGATGCGAATTCTATTGCTCACAAGTTAGCTAAGATGAGAGCTTTTGAGAATGAGGAACAATTCTGGATGGAAGAAGT
TCCAGAAATGTGCATAAGAGATTATTTGTGCGAGTTGGTGAAGGATTTCCGTCGAAGGGTGTGTGACGGACTTCAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGGATTCGATTTGCAAACGATGCCACAAGCTGGCAGAGACAACTTTTCATGCTATTTGGGCGTGCAAACATGCACAAGACTCTTGGAGGGGTTCTCCGTTTCT
GCACTATGGGTTTCCAAACCATGTTACAGAGGTGGCAGACTTATTTTGTTGGTTTTATGGAAAGCTTCGGTCAAAACAAATGCAAAAATTCATGATCATGTGTTGGAGGA
ATTGGAACTGCAGAAACATGGAAGTCAATGAGGTTTTTAAGCCTAGTGATGCTACTGATCAAAAGACATTTTATTTGCCTGCTTATGATTGGGATTATTTACTTGATAAG
TTGAAGCCCCTAGACGCAGTAAATTTTAAAATTAATGTTGATGCAGCAATTAATTCAGAGGATCGTCGAAGCGTGCTTGGGATGGTGATGCGAAATTGGAAGGGGGAGGC
CATGGCTTCTTATTGTTGTCAGATTCCAGGGATTATAGATTCTGAGATAGCGAAGGCAATGGCAGTACGTCATGGCATGGACCTTGCGATGGATCTCGGATTTCAAAGAA
TTGAAATGGAGACGGATGCTTTGCATGTGGTAAATCTTCTCAACAACGAAATGCATTGTCGATCGAAAGTTGGAAGGCTTGTCGAAGAGATAAAAAGCTTAGCGAGAAAC
TTCTTGTTCGTGAGCTTTGGGTGGTGTGGACGAGATGCGAATTCTATTGCTCACAAGTTAGCTAAGATGAGAGCTTTTGAGAATGAGGAACAATTCTGGATGGAAGAAGT
TCCAGAAATGTGCATAAGAGATTATTTGTGCGAGTTGGTGAAGGATTTCCGTCGAAGGGTGTGTGACGGACTTCAACTTTAA
Protein sequenceShow/hide protein sequence
MEEDSICKRCHKLAETTFHAIWACKHAQDSWRGSPFLHYGFPNHVTEVADLFCWFYGKLRSKQMQKFMIMCWRNWNCRNMEVNEVFKPSDATDQKTFYLPAYDWDYLLDK
LKPLDAVNFKINVDAAINSEDRRSVLGMVMRNWKGEAMASYCCQIPGIIDSEIAKAMAVRHGMDLAMDLGFQRIEMETDALHVVNLLNNEMHCRSKVGRLVEEIKSLARN
FLFVSFGWCGRDANSIAHKLAKMRAFENEEQFWMEEVPEMCIRDYLCELVKDFRRRVCDGLQL