; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10000892 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10000892
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationChr09:10935707..10937143
RNA-Seq ExpressionHG10000892
SyntenyHG10000892
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]1.2e-4749.74Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        MFARFWWGSS   KK+HW SW +MCLPK  GGLNFRDLEGFNQAL+AKQVWR+L  P+ L + VLK+ Y+    +L A    + S  W+  +WGRDLL+K
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRFLD-DNLFWHYEKNGFYTVKS
        GLR R+G GS++        PRP +F+PI    G  +  VAD I  N QW+  L+  +F E+D D+I  +P   +   D+  WH++K G Y VKS
Subjt:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRFLD-DNLFWHYEKNGFYTVKS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.9e-4847.47Show/hide
Query:  KKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLRYRIGAGSSVK
        +K+HW  W +MC PK CGGLNFRDLEGFNQAL+AK VWR L  P+ L + VLK  Y+    +L A   S  S  W+  LWGRDLL+KGLR R+G GS++K
Subjt:  KKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLRYRIGAGSSVK

Query:  ALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLFLNLKILEASSS
        A      PRP+TFKP+    G  +++VA FI ++  W+ + +   F  +D D+I  +P   + L D+  WHY+K G Y+V+SGYKL+++LK    S+S
Subjt:  ALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLFLNLKILEASSS

XP_024043257.1 uncharacterized protein LOC112099952 [Citrus clementina]8.9e-4644.5Show/hide
Query:  ARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGL
        A+FWWG+    K I+W  W +M   KV GGL FRDL  FNQAL+AKQ WR++  P+SL A VLK+ Y+  G I+ A +GS PS IWRS+LWGR ++ +G 
Subjt:  ARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGL

Query:  RYRIGAGSSVKALRT-LAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLFLN
        R+RIG G +VK  ++   P PSTF+PI       +++VA+ I S   W E L+ + F  +D   I  I   R   DD L WHY+K G Y+VKSGY++ + 
Subjt:  RYRIGAGSSVKALRT-LAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLFLN

Query:  LKILEASSSDDVESRCRW
        +K  +  S     +R +W
Subjt:  LKILEASSSDDVESRCRW

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]1.4e-4627.97Show/hide
Query:  ARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGL
        A+FWWGS    + IHW  W K+   K+ GGL FR+   FNQAL+AKQ WRLL  P+SL + VL++ Y+     L A+ G++ S IWRS++WGR ++ KG+
Subjt:  ARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGL

Query:  RYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPF-CRFLDDNLFWHYEKNGFYTVKSGYKLFLN
        R+RIG G  +        PRP TF+PI       +S VAD I +++QW+E  L++ F + D   I  IP      +D + WHY+K G Y+VKSGY+L L 
Subjt:  RYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPF-CRFLDDNLFWHYEKNGFYTVKSGYKLFLN

Query:  LKILEASSSDDVESRC------------------------------RWIRKYIEE------------VSKKLENTSSGGVLCLNSDVSKP----------
         K  +++S  +   +                                W RK +EE            +S  L    +   + L S  S P          
Subjt:  LKILEASSSDDVESRC------------------------------RWIRKYIEE------------VSKKLENTSSGGVLCLNSDVSKP----------

Query:  ------------------------------PC-------------------------------------------KWFPPEFGFLKLNVDASWSGSLPAI
                                       C                                           +W PP     K+NVDA+++    + 
Subjt:  ------------------------------PC-------------------------------------------KWFPPEFGFLKLNVDASWSGSLPAI

Query:  GWSTIVR-KLWRLVVVAANHLKCSSEGTLVELFAILNGLRLAKRCRASHILVESDCLEAINLINRVSSYLNE
        G   ++R    ++V    N        +L E  A+L GL+LA+    S +++ESDCLE + L+N      +E
Subjt:  GWSTIVR-KLWRLVVVAANHLKCSSEGTLVELFAILNGLRLAKRCRASHILVESDCLEAINLINRVSSYLNE

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]1.1e-4846.76Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        + +RFWWG S +   IHW +W  +C  KV GG+ FR+   FNQAL+AKQ WR+L  P SL A VLKS Y+A G  L A  G+ PS+ W+S++WG++LLLK
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKAL-RTLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF
        GLR+RIG+G+ V  + +   P PS FKP+       +  VADFI  + QW+   L++ FT  D D I  IP   F  +D L WHY   GFYTVKSGYKL 
Subjt:  GLRYRIGAGSSVKAL-RTLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF

Query:  LNLKILEASSSDDVES
         ++   + +SS   E+
Subjt:  LNLKILEASSSDDVES

TrEMBL top hitse value%identityAlignment
A0A803PTB0 Uncharacterized protein9.9e-5142.75Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        M ARFWWGSS   KK HW +WSK+CLPK  GGL F+DLE FN+AL+AKQVWR++  P SL   VLKSSY++   IL A+ GS  S +WR LLWGR+++  
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGF-ENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGI-PFCRFLDDNLFWHYEKNGFYTVKSGYKL
        G R+R+G+G ++  +     PRP  F+PI I P   E + V D  L+N  W+   +KE+F EDD ++I  I       +D L WH+ KNG Y V+SGY  
Subjt:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGF-ENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGI-PFCRFLDDNLFWHYEKNGFYTVKSGYKL

Query:  FLNLKILEASSSDDVESRCRW------------IRKYIEEVSKKLENTSSGGVLCLNSDVSK
         LN ++ E +S  D E+  +W            +++++ +VS +   T+S  + C   +V K
Subjt:  FLNLKILEASSSDDVESRCRW------------IRKYIEEVSKKLENTSSGGVLCLNSDVSK

A0A803Q0L5 Uncharacterized protein6.4e-5031.28Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        M ARFWWGSS   KKIHWC W+ +C PK  GGL FRDL  FNQA++AKQVWR +   ++L + VLK+SY+    IL A+ G+  S +WRSL+WG+ +++ 
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF
        G R+R+G G +V+ L     PRP TFK     P   N  VAD    +  W+   ++EVF  DD ++I  +P   + L+D + WHY KNG YTVKSGYK+ 
Subjt:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF

Query:  LNLKILEASSSDDV---------------------------------------ESRCRWIRKYIEEVSKKLENTS-------------------------
         +L   +  S+D V                                        S+  W     ++  K+L++                           
Subjt:  LNLKILEASSSDDV---------------------------------------ESRCRWIRKYIEEVSKKLENTS-------------------------

Query:  --------------------------------SGGVLCLNSDVSKPPCKWFPPEFGFLKLNVDASWSGSLPAIGWSTIVRKLWRLVVVAANHLKCSSEGT
                                         GG     S V +   KW  PE G +K+NVDA         G   ++R      + A++ +       
Subjt:  --------------------------------SGGVLCLNSDVSKPPCKWFPPEFGFLKLNVDASWSGSLPAIGWSTIVRKLWRLVVVAANHLKCSSEGT

Query:  L-VELFAILNGLRLAKRCRASHILVESDCLEAINLINR
        L +EL AIL GL++    +     +ESDCL+A+ LI R
Subjt:  L-VELFAILNGLRLAKRCRASHILVESDCLEAINLINR

A0A803Q185 Uncharacterized protein3.8e-5046.64Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        M ARFWWGSS   KKIHWC W  +C PK  GG+ FRDL  FNQAL+AKQ+WR +  P++L   VLK+SY+    +L A+ G+  S +WRSL+WG+ L+LK
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF
        G R+R+G GS+++ L     PRP TFK     P  E+  V D  L +  W++  +  VF +DD ++I  +P   + LDD + WHY KNG Y+VKSGY++ 
Subjt:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF

Query:  LNLKILEASSSDDVESRCRWIRK
          LK  E   SDD     +W RK
Subjt:  LNLKILEASSSDDVESRCRWIRK

A0A803Q6Z2 Uncharacterized protein5.4e-4946.76Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        + +RFWWG S +   IHW +W  +C  KV GG+ FR+   FNQAL+AKQ WR+L  P SL A VLKS Y+A G  L A  G+ PS+ W+S++WG++LLLK
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKAL-RTLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF
        GLR+RIG+G+ V  + +   P PS FKP+       +  VADFI  + QW+   L++ FT  D D I  IP   F  +D L WHY   GFYTVKSGYKL 
Subjt:  GLRYRIGAGSSVKAL-RTLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKLF

Query:  LNLKILEASSSDDVES
         ++   + +SS   E+
Subjt:  LNLKILEASSSDDVES

A0A803QJV0 Uncharacterized protein1.1e-4942.74Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        M ARFWWGS+   KKIHWC W  +C PK  GGL FRDLE FNQAL+AKQ+WR L +P SL   VLK+SY+    +LAA+ G+  S +WRSL+WG++++LK
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCD-IIRGIPFCRFLDDNLFWHYEKNGFYTVKSGYKLF
        G R+R+G G  V+ L     PRP++FK     P  +   V D    +  W+ES ++  F  +D + I+R  P    L+D + WHY +NG YTV+SGY++ 
Subjt:  GLRYRIGAGSSVKALR-TLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCD-IIRGIPFCRFLDDNLFWHYEKNGFYTVKSGYKLF

Query:  LNLKILEASSSDDVESRCRWIRKYIEEVSKKLEN
          ++  EA+S   +  +  W + +  ++S K+++
Subjt:  LNLKILEASSSDDVESRCRWIRKYIEEVSKKLEN

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003104.6e-2144.55Show/hide
Query:  FWWGSSLHHKKIHWCSWSKMCLPKV-CGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLR
        FWW S  + +KI W +W K+C  K   GGL FRDL  FNQAL+AKQ +R++ +P +L + +L+S Y+    ++   VG+ PS  WRS++ GR+LL +GL 
Subjt:  FWWGSSLHHKKIHWCSWSKMCLPKV-CGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLR

Query:  YRIGAGSSVK
          IG G   K
Subjt:  YRIGAGSSVK

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-0931.47Show/hide
Query:  LKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLRYRIGAGSSVK-----ALRTLAPRP----STFKPIGICPGFENSSVADFILSNDQWNESLLK
        +K+ Y+    IL A+V    S  W SLL G  LL KG R+ IG G +++      + +  PRP     T+K + I   FE      F      W++S + 
Subjt:  LKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLRYRIGAGSSVK-----ALRTLAPRP----STFKPIGICPGFENSSVADFILSNDQWNESLLK

Query:  EVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKL
        +   + D   I  I   +    D + W+Y   G YTV+SGY L
Subjt:  EVFTEDDCDIIRGIPFCRF-LDDNLFWHYEKNGFYTVKSGYKL

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-3034.53Show/hide
Query:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK
        + A FWW +    K +HW +W  +   K  GG+ F+D+E FN AL+ KQ+WR+L RP+SL A V KS Y+ K D L A +GS PS +W+S+   +++L +
Subjt:  MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLK

Query:  GLRYRIGAGSSVKALR--TLAPRP-STFKPIGICPGFENSSVADFILSND-------QWNESLLKEVFTEDDCDIIRGI-PFCRFLDDNLFWHYEKNGFY
        G R  +G G  +   R   L  +P S    +   P  E +SV+  +  +D       +W + +++ +F E +  +I  + P  R + D+  W Y  +G Y
Subjt:  GLRYRIGAGSSVKALR--TLAPRP-STFKPIGICPGFENSSVADFILSND-------QWNESLLKEVFTEDDCDIIRGI-PFCRFLDDNLFWHYEKNGFY

Query:  TVKSGYKLFLNLKILEASSSDDV
        TVKSGY +   + I + SS  +V
Subjt:  TVKSGYKLFLNLKILEASSSDDV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.3e-2244.55Show/hide
Query:  FWWGSSLHHKKIHWCSWSKMCLPKV-CGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLR
        FWW S  + +KI W +W K+C  K   GGL FRDL  FNQAL+AKQ +R++ +P +L + +L+S Y+    ++   VG+ PS  WRS++ GR+LL +GL 
Subjt:  FWWGSSLHHKKIHWCSWSKMCLPKV-CGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLR

Query:  YRIGAGSSVK
          IG G   K
Subjt:  YRIGAGSSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCTCGCTTTTGGTGGGGATCCTCTCTTCATCACAAGAAAATTCATTGGTGTAGTTGGAGTAAAATGTGTCTTCCAAAGGTGTGTGGTGGACTGAATTTTAGAGA
TCTGGAAGGTTTCAACCAAGCTTTGATTGCAAAGCAGGTTTGGAGACTCCTAATAAGACCTGATTCTTTAGCTGCACTGGTTTTGAAAAGCTCCTATTATGCAAAAGGGG
ATATTTTGGCTGCTGAAGTTGGCTCTAGTCCTTCTGTCATCTGGAGGAGTTTGTTGTGGGGTAGAGATCTTTTGTTAAAGGGTCTCAGATATCGAATAGGTGCTGGCTCT
TCTGTCAAAGCTTTAAGGACCCTTGCTCCTCGCCCGAGCACTTTTAAGCCCATTGGAATTTGCCCTGGTTTTGAGAATTCCAGTGTTGCTGATTTTATTCTCTCAAATGA
TCAATGGAATGAATCTCTTCTAAAGGAAGTCTTCACTGAAGATGATTGTGATATTATTAGGGGTATCCCTTTCTGTAGATTTTTGGATGACAACTTATTTTGGCACTATG
AAAAGAATGGTTTTTACACTGTGAAAAGTGGTTATAAGCTCTTCTTAAATTTAAAGATCTTAGAAGCCTCTTCTAGTGATGATGTGGAGTCTCGGTGTAGATGGATTCGA
AAGTATATTGAAGAAGTTTCAAAAAAACTGGAAAACACTTCATCTGGTGGTGTTTTGTGTCTTAATTCTGATGTTTCCAAGCCCCCATGCAAGTGGTTTCCTCCTGAGTT
TGGCTTTTTGAAGCTAAATGTGGATGCATCTTGGTCTGGGTCTCTTCCTGCCATTGGTTGGAGTACAATAGTGAGAAAACTTTGGAGATTGGTTGTTGTTGCTGCAAACC
ACTTGAAGTGTTCTTCTGAGGGCACGCTTGTTGAGCTTTTTGCCATTCTAAATGGTCTTCGATTGGCTAAAAGATGTCGAGCCTCTCATATTCTAGTAGAATCTGATTGC
CTTGAAGCCATTAATCTGATCAATAGAGTTTCTTCCTATCTCAATGAAGGTGTATGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCTCGCTTTTGGTGGGGATCCTCTCTTCATCACAAGAAAATTCATTGGTGTAGTTGGAGTAAAATGTGTCTTCCAAAGGTGTGTGGTGGACTGAATTTTAGAGA
TCTGGAAGGTTTCAACCAAGCTTTGATTGCAAAGCAGGTTTGGAGACTCCTAATAAGACCTGATTCTTTAGCTGCACTGGTTTTGAAAAGCTCCTATTATGCAAAAGGGG
ATATTTTGGCTGCTGAAGTTGGCTCTAGTCCTTCTGTCATCTGGAGGAGTTTGTTGTGGGGTAGAGATCTTTTGTTAAAGGGTCTCAGATATCGAATAGGTGCTGGCTCT
TCTGTCAAAGCTTTAAGGACCCTTGCTCCTCGCCCGAGCACTTTTAAGCCCATTGGAATTTGCCCTGGTTTTGAGAATTCCAGTGTTGCTGATTTTATTCTCTCAAATGA
TCAATGGAATGAATCTCTTCTAAAGGAAGTCTTCACTGAAGATGATTGTGATATTATTAGGGGTATCCCTTTCTGTAGATTTTTGGATGACAACTTATTTTGGCACTATG
AAAAGAATGGTTTTTACACTGTGAAAAGTGGTTATAAGCTCTTCTTAAATTTAAAGATCTTAGAAGCCTCTTCTAGTGATGATGTGGAGTCTCGGTGTAGATGGATTCGA
AAGTATATTGAAGAAGTTTCAAAAAAACTGGAAAACACTTCATCTGGTGGTGTTTTGTGTCTTAATTCTGATGTTTCCAAGCCCCCATGCAAGTGGTTTCCTCCTGAGTT
TGGCTTTTTGAAGCTAAATGTGGATGCATCTTGGTCTGGGTCTCTTCCTGCCATTGGTTGGAGTACAATAGTGAGAAAACTTTGGAGATTGGTTGTTGTTGCTGCAAACC
ACTTGAAGTGTTCTTCTGAGGGCACGCTTGTTGAGCTTTTTGCCATTCTAAATGGTCTTCGATTGGCTAAAAGATGTCGAGCCTCTCATATTCTAGTAGAATCTGATTGC
CTTGAAGCCATTAATCTGATCAATAGAGTTTCTTCCTATCTCAATGAAGGTGTATGCTAG
Protein sequenceShow/hide protein sequence
MFARFWWGSSLHHKKIHWCSWSKMCLPKVCGGLNFRDLEGFNQALIAKQVWRLLIRPDSLAALVLKSSYYAKGDILAAEVGSSPSVIWRSLLWGRDLLLKGLRYRIGAGS
SVKALRTLAPRPSTFKPIGICPGFENSSVADFILSNDQWNESLLKEVFTEDDCDIIRGIPFCRFLDDNLFWHYEKNGFYTVKSGYKLFLNLKILEASSSDDVESRCRWIR
KYIEEVSKKLENTSSGGVLCLNSDVSKPPCKWFPPEFGFLKLNVDASWSGSLPAIGWSTIVRKLWRLVVVAANHLKCSSEGTLVELFAILNGLRLAKRCRASHILVESDC
LEAINLINRVSSYLNEGVC