; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019631 (gene) of Chayote v1 genome

Gene IDSed0019631
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:62926352..62929577
RNA-Seq ExpressionSed0019631
SyntenySed0019631
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015385738.1 uncharacterized protein LOC107177034 [Citrus sinensis]3.9e-1925.25Show/hide
Query:  NIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGS
        N++    N   R I L  +C  C +  E+T HT+  CK AKK+W   P++            +++  +     +   E  + +CW +W  RNK +     
Subjt:  NIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGS

Query:  DYDGLPMDWNL----CQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRN---------------------
         ++G  M+  L     ++ +  +R           +Y   +S N++ W+P     FK+N DAA+  +   +G+G+VIR+                     
Subjt:  DYDGLPMDWNL----CQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRN---------------------

Query:  -EKEAM----------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEE
         E EA+          +FQ + VE D   +V L+       TE+  ++ EI  +S     +S+Y   R  N  AH+LAKLA+  +E  +W+E + + VE 
Subjt:  -EKEAM----------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEE

Query:  L
        +
Subjt:  L

XP_023904177.1 uncharacterized protein LOC112015942 [Quercus suber]2.0e-2327.97Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDG---
        N  +R I     CP C + PE+  H +WEC  A+ +W  +  +  KCP G      L  ++    + +    FL+  W +W  RN  + GG     G   
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDG---

Query:  -LPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------K
           ++W      + EF +   ++     +  Q  +     W P  +P FKMN DAAI  + G SG G VIRNE                          +
Subjt:  -LPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------K

Query:  EAM------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEE
        +AM       F  + +E DS +++  L  +   ++ +G +V +I  + S L  VSF W  R  N++AHALAK A  ++E+  W+E+
Subjt:  EAM------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEE

XP_023915286.1 uncharacterized protein LOC112026812 [Quercus suber]1.0e-1926.03Show/hide
Query:  NIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKH---FEEFLIMCWCVWRYRNKEVRG
        NI+  +    +R I  NE CP C   P++  H +WECK A+ +W     + L+  KG+T  + ++    D   +     FE FL++CW +W  RN+ V G
Subjt:  NIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKH---FEEFLIMCWCVWRYRNKEVRG

Query:  GGSDYDGLPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRN--------------------EK
        G     G  +    C   + EF E       TH    +  + +++ W P     +K+N D A+  ++ +SG+G++IRN                    E 
Subjt:  GGSDYDGLPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRN--------------------EK

Query:  EAMSFQR------------VEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEE
        E ++ +R            + VE D+ +++  +       + LG +V ++  ++ RL  V F    R  N +AH+LA+ A  + E+ +W+E+
Subjt:  EAMSFQR------------VEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEE

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]1.3e-1926.39Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLW-------VTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGS
        N   R +  +  C RCG+  E+T H MW C  +KK+W       V   W+E        +F DL   V  ++ ++  E F ++CW +W+ RN+  +  G 
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLW-------VTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGS

Query:  DYDGLPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEK--------------------EAM
          +   + W+     +N F++      +  ++ TQ+    K +W P   P  K+NTDAAI  +   + +GMV+R+ +                    EA+
Subjt:  DYDGLPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEK--------------------EAM

Query:  S------------FQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVE
        +            FQ + VE DST +++ L +T   ++  G ++ +I  ++S    V +    R+ N  AH +AK A+  DE+ +W E
Subjt:  S------------FQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVE

XP_030964861.1 uncharacterized protein LOC115986145 [Quercus lobata]1.6e-1725Show/hide
Query:  NIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGS
        NI+    N + R I  ++ C  C +  E+  H +WEC  AK +W +   +  KC +       L   + +      FE FL+  W +W  RN  + GG  
Subjt:  NIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGS

Query:  DYDGLPMDW--NLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEKE----AMS-------------
                W     +  + +F +   ++     +    +S     W P   P +K+N DAAI  ++G SG+G VIRN  +    AMS             
Subjt:  DYDGLPMDW--NLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEKE----AMS-------------

Query:  ---------------FQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIW
                       F  + +E D+ +++  +  +   ++ LG I+++I  ++  L +VSF    R  N +AH++A+ A  +DEE  W
Subjt:  ---------------FQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIW

TrEMBL top hitse value%identityAlignment
A0A2N9EEC8 Reverse transcriptase domain-containing protein1.7e-2027.2Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM
        N   R + ++ +C  C   PE   H +W C     +W    W +      +  FADL   V  +A Q++ E F+++ W +W+ RNK  +  G D   +  
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM

Query:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEKEAMS--FQRVEVELDSTSIVNLLQRTMKCI
          N  ++ + EF       F T        + N K W P     FK+N D A+ +E   +G+G+++RN+           E E DS ++V  L  +    
Subjt:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEKEAMS--FQRVEVELDSTSIVNLLQRTMKCI

Query:  TELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY
           G ++ +   ++  LT   F    R+ N LAHALA++A  +D+  +W+E+V    + LY
Subjt:  TELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY

A0A2N9FR17 Reverse transcriptase domain-containing protein1.9e-1925.42Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM
        N  +R I +  +C  CGK  E T H +W CK  + +W    W +      +  FADL+  V    +    E F+I+CW +W+ RNK +R           
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM

Query:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKE--------WNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE-----------------------
             Q  ++   + + KV     EYT++    K +        W P     +K+N D A+ +E   +G+ +++R+                        
Subjt:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKE--------WNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE-----------------------

Query:  ---KEAMSF------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY
           K A+ F         E E DS  IV+ L  +   +   G ++ +   ++S+L   SF    R+ N+LAHALA+ A+  +   +W+E V   +E LY
Subjt:  ---KEAMSF------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY

A0A2N9HSI4 Uncharacterized protein5.5e-1925.43Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM
        N   R + ++ +C  C   PE   H +W C     +W    W +      +  FADL   V  +A Q++ E F+++ W +W+ RNK  +  G D   +  
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM

Query:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------KEAMS
          N  ++ + EF       F T        + N K W P     FK+N D A+ +E   +G+G+++RN+                          K A+ 
Subjt:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------KEAMS

Query:  F------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY
        F         E E DS ++V  L  +       G ++ +   ++  LT   F    R+ N LAHALA++A  +D+  +W+E+V    + LY
Subjt:  F------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY

A0A2N9INY3 Reverse transcriptase domain-containing protein1.4e-1925.43Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM
        N   R I ++ +C  C   PE   H +W C   K  W    W +      +  FADL   V  +A Q++ E F++  W +W+ RNK+        D   +
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM

Query:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------KEAMS
        D N   +    +       F      T+ N     +W P  Y NFK+N D A+ +E   +G+G+++RN                           K A+ 
Subjt:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------KEAMS

Query:  F------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY
        F         E E DS ++V  L  +    +  G ++ +   ++S L    F     + N LAHALA++A  +D   +W+E+V    + LY
Subjt:  F------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY

A0A2N9IV57 Uncharacterized protein5.5e-1925.43Show/hide
Query:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM
        N   R + ++ +C  C   PE   H +W C     +W    W +      +  FADL   V  +A Q++ E F+++ W +W+ RNK  +  G D   +  
Subjt:  NFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPM

Query:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------KEAMS
          N  ++ + EF       F T        + N K W P     FK+N D A+ +E   +G+G+++RN+                          K A+ 
Subjt:  DWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNE--------------------------KEAMS

Query:  F------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY
        F         E E DS ++V  L  +       G ++ +   ++  LT   F    R+ N LAHALA++A  +D+  +W+E+V    + LY
Subjt:  F------QRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.8e-0720.81Show/hide
Query:  SNIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEF-----LIMCWCVWRYRNKE
        S  ++       RG+ ++ SCPRC +  ES  H ++ C  A   W  +    ++       F + +  + +  Q     +F     + + W +W+ RN  
Subjt:  SNIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLVHWVFDSAQQKHFEEF-----LIMCWCVWRYRNKE

Query:  VRGGGSDYDGLPMDWNLCQSCINEFRETNAKVF--------------STHREY---TQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRN--
        V                     N+FRE+ +K                 +H++    T+Q + NK EW        K N DA    +   +  G +IRN  
Subjt:  VRGGGSDYDGLPMDWNLCQSCINEFRETNAKVF--------------STHREY---TQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRN--

Query:  --------------------EKEAM----------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAK
                            E +A+           + +V +E D  +++NL+   +   + L   +++I   +++   + F +  RK NKLAH LAK
Subjt:  --------------------EKEAM----------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAK

AT4G29090.1 Ribonuclease H-like superfamily protein6.2e-0722.38Show/hide
Query:  SNIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLV----HWVFDSAQ-----QKHFEEFLIMCWCVWRY
        SN + +      R +    +C RC    E+  H +++C  A+  W  +    +  P G   +AD +    +WVF+        +K  +    + W +W+ 
Subjt:  SNIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPWQELKCPKGITTFADLV----HWVFDSAQ-----QKHFEEFLIMCWCVWRY

Query:  RNKEVRGGGSDYDGLPMDWNLCQSCINEFR-ETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEK------------
        RN E+   G +++   +     +  + E+R  T A+   T     Q N  +   W P  +   K NTDA    +    G+G V+RNEK            
Subjt:  RNKEVRGGGSDYDGLPMDWNLCQSCINEFR-ETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAAIREEMGSSGMGMVIRNEK------------

Query:  ----------EAM----------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAK
                  EAM           +  V  E DS  ++ +L    +    L   ++++ ++ S+ T V F +  R+ N LA  +A+
Subjt:  ----------EAM----------SFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAATATCACGAGAGAATTGCATTGGCAAAAGATGATCTTCAAGCTACTATTCGTTTGAACAATGAGGTTCAAGAACCGAGAAGAAGTCTCAAGAGGGAACCTGG
ATTGAGGATGCTTAAGGAATTGAGAGGGAGATTTGTAGTTATTTCCAAAAACTGTTCACTACAAATGGATTCCAACATCATCAGTTTGGTTCCGAATTTTTTGGAAAGGG
GGATTGATTTGAATGAGAGTTGCCCTAGATGTGGAAAATGGCCTGAATCAACATACCATACTATGTGGGAGTGTAAAGGGGCAAAGAAGCTATGGGTGACAACTCCTTGG
CAGGAACTTAAGTGTCCTAAGGGAATCACTACTTTTGCTGATTTGGTTCATTGGGTTTTTGATTCTGCACAACAAAAACATTTTGAAGAGTTTCTAATCATGTGTTGGTG
TGTGTGGCGATATCGAAATAAGGAGGTGAGGGGTGGAGGAAGTGATTATGATGGTTTGCCTATGGATTGGAATTTATGTCAATCCTGTATTAATGAGTTTAGGGAGACAA
ATGCAAAGGTTTTCTCTACTCATCGTGAGTATACACAACAGAACAGTTTCAACAAGAAAGAGTGGAATCCCCTTAATTATCCTAATTTCAAAATGAATACTGATGCTGCT
ATTCGAGAGGAAATGGGAAGTAGTGGAATGGGCATGGTGATTCGAAATGAAAAAGAGGCGATGAGCTTTCAGCGTGTTGAGGTCGAGTTGGATTCGACTTCAATTGTGAA
TCTTCTACAACGAACAATGAAATGCATCACTGAATTGGGCAAAATCGTGAAGGAAATTTGGAAAATTTCAAGCCGCCTTACTTTTGTGTCGTTCTATTGGTGTGGTCGAA
AGATGAATAAATTAGCACATGCTCTAGCAAAATTGGCAATTCCCATGGATGAAGAAACAATTTGGGTGGAGGAAGTCCTGATGACCGTGGAAGAGCTTTACAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAATATCACGAGAGAATTGCATTGGCAAAAGATGATCTTCAAGCTACTATTCGTTTGAACAATGAGGTTCAAGAACCGAGAAGAAGTCTCAAGAGGGAACCTGG
ATTGAGGATGCTTAAGGAATTGAGAGGGAGATTTGTAGTTATTTCCAAAAACTGTTCACTACAAATGGATTCCAACATCATCAGTTTGGTTCCGAATTTTTTGGAAAGGG
GGATTGATTTGAATGAGAGTTGCCCTAGATGTGGAAAATGGCCTGAATCAACATACCATACTATGTGGGAGTGTAAAGGGGCAAAGAAGCTATGGGTGACAACTCCTTGG
CAGGAACTTAAGTGTCCTAAGGGAATCACTACTTTTGCTGATTTGGTTCATTGGGTTTTTGATTCTGCACAACAAAAACATTTTGAAGAGTTTCTAATCATGTGTTGGTG
TGTGTGGCGATATCGAAATAAGGAGGTGAGGGGTGGAGGAAGTGATTATGATGGTTTGCCTATGGATTGGAATTTATGTCAATCCTGTATTAATGAGTTTAGGGAGACAA
ATGCAAAGGTTTTCTCTACTCATCGTGAGTATACACAACAGAACAGTTTCAACAAGAAAGAGTGGAATCCCCTTAATTATCCTAATTTCAAAATGAATACTGATGCTGCT
ATTCGAGAGGAAATGGGAAGTAGTGGAATGGGCATGGTGATTCGAAATGAAAAAGAGGCGATGAGCTTTCAGCGTGTTGAGGTCGAGTTGGATTCGACTTCAATTGTGAA
TCTTCTACAACGAACAATGAAATGCATCACTGAATTGGGCAAAATCGTGAAGGAAATTTGGAAAATTTCAAGCCGCCTTACTTTTGTGTCGTTCTATTGGTGTGGTCGAA
AGATGAATAAATTAGCACATGCTCTAGCAAAATTGGCAATTCCCATGGATGAAGAAACAATTTGGGTGGAGGAAGTCCTGATGACCGTGGAAGAGCTTTACAATTGA
Protein sequenceShow/hide protein sequence
MGKYHERIALAKDDLQATIRLNNEVQEPRRSLKREPGLRMLKELRGRFVVISKNCSLQMDSNIISLVPNFLERGIDLNESCPRCGKWPESTYHTMWECKGAKKLWVTTPW
QELKCPKGITTFADLVHWVFDSAQQKHFEEFLIMCWCVWRYRNKEVRGGGSDYDGLPMDWNLCQSCINEFRETNAKVFSTHREYTQQNSFNKKEWNPLNYPNFKMNTDAA
IREEMGSSGMGMVIRNEKEAMSFQRVEVELDSTSIVNLLQRTMKCITELGKIVKEIWKISSRLTFVSFYWCGRKMNKLAHALAKLAIPMDEETIWVEEVLMTVEELYN