; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023576 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023576
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1258:309453..311060
RNA-Seq ExpressionMS023576
SyntenyMS023576
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]2.1e-9136.6Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG
        M+ FWW    + R +HWV W+ +C  K  GGLGFRDLE+FNQALLAKQ WR+L    SL+AR+ +++Y  +  FL A  G+N S IWRSL  G++LL+ G
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG

Query:  LRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPL-GVSLLDKLIWHYDKRGQYNVKSGYRLA-
        LRWRVGNG SI+VY+D W+P      I SP  L + +LVCDL   SGQW+V  ++  F D+E  + L IPL  ++  D LIWHY++ G Y+VKSGYRLA 
Subjt:  LRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPL-GVSLLDKLIWHYDKRGQYNVKSGYRLA-

Query:  -QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLW
         ++  + G PS   ++   ++WK  W L++P+KIK F WR   D LP  + L  + +     C  C +K ES  H  WLC+  + +W+ S + +      
Subjt:  -QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLW

Query:  VPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGG---SLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAI--WRPPAAPLLK
        V    ++ H +++ +   S +  G    L W +WN RN     F   G   + + L+       Q +  A   S +       P+  +  WRPP A + K
Subjt:  VPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGG---SLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAI--WRPPAAPLLK

Query:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFN--LLTTDCVDDSEVGVLCSVIKLFL
        +NVD A +    V GVGV++R++ G      +R +  +      E  A  EG+   ++ GF    +E D+    N  L T +C  +   G+L   +  +L
Subjt:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFN--LLTTDCVDDSEVGVLCSVIKLFL

Query:  SSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWP
          N   V   +T R+GN  AH LAQ          W+EE P
Subjt:  SSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWP

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.2e-11039.89Show/hide
Query:  SHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGL
        + FWWG   ED+K+HWV+W  + LPKC GG+GFRDLE FN+ALLAKQ WR+L+  +S+L+RVLK +YFK   F+ A+   N S IWRS+L GRDLL  GL
Subjt:  SHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGL

Query:  RWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLL-LPSGQWDVEKIRQHFSDEEASSILNIPLGVSL-LDKLIWHYDKRGQYNVKSGYRLA-
        RWR+GNG+S+ +Y DNW+P   +L I S   L + S V  L+    G W  + +R  F+ +EA  IL+IP+G     D+LIW+Y+K G Y+V+SGY++A 
Subjt:  RWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLL-LPSGQWDVEKIRQHFSDEEASSILNIPLGVSL-LDKLIWHYDKRGQYNVKSGYRLA-

Query:  QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWV
          +     PS SS E +  WW GFWK+ +P+KIK+F WRLCLDRLPT  NL  +GV++ NCC FCG+ GE + H+FW+CK   ++W  SKF      L  
Subjt:  QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWV

Query:  PVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARN------QTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLK
                 +R   + +S      + V++W +WN RN       T   F +G    +LV W+  Y   ++ A+ +  +  V        +W+PP   + K
Subjt:  PVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARN------QTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLK

Query:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSS
        +N D +F      AG+G+II +  G V   A + L     VD  E  A  EG+ L  E G                +     D SE G +    K F + 
Subjt:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSS

Query:  NTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS
        +    SF+F  R GN AAH+LA+  L      IW+E+WP E+ S
Subjt:  NTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]4.7e-13996.89Show/hide
Query:  MRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPR
        MRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSS SALVVCRPPR
Subjt:  MRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPR

Query:  VAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEV
        VAIWRPPAAPLLKVNVD AFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILL VEAGFIRFQIETDSLRIFNLLTTDCVDDSEV
Subjt:  VAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEV

Query:  GVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS
        GVLCSVIKLFLSS+ E VSFSFTHRNGNA AHLLAQL LTSPHLQIWVEEWPDEISS
Subjt:  GVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS

XP_023872411.1 uncharacterized protein LOC111985024 [Quercus suber]1.2e-8936.33Show/hide
Query:  HFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLR
        +FWWG   E+RK+ W+SW+++C PK  GG+ F+ L+ FN A+LAKQGWRL +  +SLL RV KSKYF T +F+ A  G N S  WRS++  + L+  G R
Subjt:  HFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLR

Query:  WRVGNGESIRVYSDNWIPMDGSLSIRSPI-SLSVDSLVCDLL-LPSGQWDVEKIRQHFSDEEASSILNIPLGVSL-LDKLIWHYDKRGQYNVKSGYRLAQ
        W+VGNG SI ++ + W+P   +  + SP+ ++  DS V DL+    G W  + +R  F   EA  I  I L  ++  DK +W     G ++V+S Y+LA 
Subjt:  WRVGNGESIRVYSDNWIPMDGSLSIRSPI-SLSVDSLVCDLL-LPSGQWDVEKIRQHFSDEEASSILNIPLGVSL-LDKLIWHYDKRGQYNVKSGYRLAQ

Query:  --RSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCS---------K
          RS  HG  S S    L ++W+  W   +P KI+ F WR C D LPT+ENL+ + V   +CC  C    E++ H+FW C++ R IW  S          
Subjt:  --RSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCS---------K

Query:  FSHFLHLLWVPVGSDMGHFMRVWTDLVSWQH--IGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPP
        F  F+ +LW  V             +  W+H  +  ++V+ WA+W+ RN+  ++  +  S   L+  +  YL+ YQA      S     +P   A W PP
Subjt:  FSHFLHLLWVPVGSDMGHFMRVWTDLVSWQH--IGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPP

Query:  AAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVI
        +    KVNVD A  KE  +AGVG++IRD+ G +     R L         E  AV  G+L   +     F +E+DSL + N L       S V  L    
Subjt:  AAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVI

Query:  KLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWP
         + +S     V FS   R+GN  AHLLA+  L    L +WVEE P
Subjt:  KLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWP

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.6e-9138.43Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG
        M+ FWWG   + + +HW  W+R+   K  GG+GFRDL SFNQAL+AKQGWR++   SSL+ARVLK++YFK   F++A  GS  S +WRS++ GR +LH G
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG

Query:  LRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGV-SLLDKLIWHYDKRGQYNVKSGYRLAQ
         RWR+GNG+++ VY +NWIP   +    S  S+  D+ V +L+    QW  + I QHF  E+A +I+ IPL      D+LIWHYDK+G Y+VKSGY++A 
Subjt:  LRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGV-SLLDKLIWHYDKRGQYNVKSGYRLAQ

Query:  RSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVP
        R      PS S+ +  L  W+  WKL +P K+KIF WR   D LPT ENL  K V     C+ C    E+ +H    C + R IW+ S  +  L  ++  
Subjt:  RSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVP

Query:  VGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVDVAF
           D+   ++ W    +      +  LLWAIW ARN+          L  +V+ +E  ++ ++  ++            R   W PP     KVNVD A 
Subjt:  VGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVDVAF

Query:  RKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQI-ETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEWVS
          E+ +AG+GV++RDS G     AI+ L     V   E  A+  G L V E   I F I E+DSL + +L+       +E+G L S I+  L  N +   
Subjt:  RKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQI-ETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEWVS

Query:  FSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEI
           + R+ N AAH LA+L L      IW++E P EI
Subjt:  FSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEI

TrEMBL top hitse value%identityAlignment
A0A1S8ACU2 Ribonuclease H-like superfamily protein2.2e-8935.06Show/hide
Query:  FWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLRW
        FWWG   + +  HW SW+++C  K  GGLGFRD+ SFNQAL+AKQ WR++    SL+A++L++KYFK  DFL A+ GS  S +WRS++ GR ++  G+RW
Subjt:  FWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLRW

Query:  RVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGVS-LLDKLIWHYDKRGQYNVKSGYRLAQRSI
        R+G G+ +++Y  +WIP   +    S  +L +D  V +L+  + +W    I+QHF+ E+A  I  I L +S   D+++WHYDK+G Y+VKSGY++A R  
Subjt:  RVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGVS-LLDKLIWHYDKRGQYNVKSGYRLAQRSI

Query:  VHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVPVGS
        +     PSS  S L  W   W L+LP KIKIF W+   + LPT ENL  + +     C  C  K E   H   +CK  + +W+ + F   + LL      
Subjt:  VHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVPVGS

Query:  DMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARN--------QTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVN
        D+   ++   +  S   +  I+ L W  W+ RN        Q PQ      S++   +  E+Y +V     +++A A+    P + A W PP     K N
Subjt:  DMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARN--------QTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVN

Query:  VDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNT
        VD A  KE    G+GV+IRD +G + + A+       DV   E  AV  G+ +V+EA      +ETD   + + +  +    +E+    S I+  L S  
Subjt:  VDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNT

Query:  EWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS
          V      R  NA AH LA++ L +    +W  E P  I S
Subjt:  EWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS

A0A5E4FZN9 PREDICTED: retrotransposon1.0e-9136.6Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG
        M+ FWW    + R +HWV W+ +C  K  GGLGFRDLE+FNQALLAKQ WR+L    SL+AR+ +++Y  +  FL A  G+N S IWRSL  G++LL+ G
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG

Query:  LRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPL-GVSLLDKLIWHYDKRGQYNVKSGYRLA-
        LRWRVGNG SI+VY+D W+P      I SP  L + +LVCDL   SGQW+V  ++  F D+E  + L IPL  ++  D LIWHY++ G Y+VKSGYRLA 
Subjt:  LRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPL-GVSLLDKLIWHYDKRGQYNVKSGYRLA-

Query:  -QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLW
         ++  + G PS   ++   ++WK  W L++P+KIK F WR   D LP  + L  + +     C  C +K ES  H  WLC+  + +W+ S + +      
Subjt:  -QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLW

Query:  VPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGG---SLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAI--WRPPAAPLLK
        V    ++ H +++ +   S +  G    L W +WN RN     F   G   + + L+       Q +  A   S +       P+  +  WRPP A + K
Subjt:  VPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGG---SLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAI--WRPPAAPLLK

Query:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFN--LLTTDCVDDSEVGVLCSVIKLFL
        +NVD A +    V GVGV++R++ G      +R +  +      E  A  EG+   ++ GF    +E D+    N  L T +C  +   G+L   +  +L
Subjt:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFN--LLTTDCVDDSEVGVLCSVIKLFL

Query:  SSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWP
          N   V   +T R+GN  AH LAQ          W+EE P
Subjt:  SSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWP

A0A6J1DAR4 uncharacterized protein LOC1110189545.9e-11139.89Show/hide
Query:  SHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGL
        + FWWG   ED+K+HWV+W  + LPKC GG+GFRDLE FN+ALLAKQ WR+L+  +S+L+RVLK +YFK   F+ A+   N S IWRS+L GRDLL  GL
Subjt:  SHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGL

Query:  RWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLL-LPSGQWDVEKIRQHFSDEEASSILNIPLGVSL-LDKLIWHYDKRGQYNVKSGYRLA-
        RWR+GNG+S+ +Y DNW+P   +L I S   L + S V  L+    G W  + +R  F+ +EA  IL+IP+G     D+LIW+Y+K G Y+V+SGY++A 
Subjt:  RWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLL-LPSGQWDVEKIRQHFSDEEASSILNIPLGVSL-LDKLIWHYDKRGQYNVKSGYRLA-

Query:  QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWV
          +     PS SS E +  WW GFWK+ +P+KIK+F WRLCLDRLPT  NL  +GV++ NCC FCG+ GE + H+FW+CK   ++W  SKF      L  
Subjt:  QRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWV

Query:  PVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARN------QTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLK
                 +R   + +S      + V++W +WN RN       T   F +G    +LV W+  Y   ++ A+ +  +  V        +W+PP   + K
Subjt:  PVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARN------QTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLK

Query:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSS
        +N D +F      AG+G+II +  G V   A + L     VD  E  A  EG+ L  E G                +     D SE G +    K F + 
Subjt:  VNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSS

Query:  NTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS
        +    SF+F  R GN AAH+LA+  L      IW+E+WP E+ S
Subjt:  NTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS

A0A6J1DBJ7 uncharacterized protein LOC1110189732.3e-13996.89Show/hide
Query:  MRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPR
        MRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSS SALVVCRPPR
Subjt:  MRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPR

Query:  VAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEV
        VAIWRPPAAPLLKVNVD AFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILL VEAGFIRFQIETDSLRIFNLLTTDCVDDSEV
Subjt:  VAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEV

Query:  GVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS
        GVLCSVIKLFLSS+ E VSFSFTHRNGNA AHLLAQL LTSPHLQIWVEEWPDEISS
Subjt:  GVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS

A0A803QQT2 Uncharacterized protein3.1e-8835.4Show/hide
Query:  SHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGL
        S FWWG LD+++K+HW  W+ +C PK  GGLGFRDL  FNQALLAKQ WR L     L +RVLK+ YF     L A  G+NAS +WRSL+ G+ L+  G 
Subjt:  SHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGL

Query:  RWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLG-VSLLDKLIWHYDKRGQYNVKSGYRLAQR
        RWRVGNGES+RV  D W+P   +  +    SL  +  V DL L  GQWD   IR  F+  +   IL IP       DK++WHY K G+Y+VKSGYR+A  
Subjt:  RWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLG-VSLLDKLIWHYDKRGQYNVKSGYRLAQR

Query:  SIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGV-DVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVP
                 S+  S++QWWK  W+L++P K+K F W++  + LP   NL  +G+   + C R      ES AH  W CK  +  W+ S     L  +   
Subjt:  SIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGV-DVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVP

Query:  VGSD-MGHFMRVWTDLVSW--QHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVD
        +G D +   MR+      W  + +   +++ W IWN RN T  H        +++ W  N+L  ++       S     R    + W PPA   + +NVD
Subjt:  VGSD-MGHFMRVWTDLVSW--QHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVD

Query:  VAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEW
           ++   ++G+G ++RD+ G+V   A  +L +      +E  A+ +GI + ++    RF +ETD L+  +L+        ++  L + I+  +S ++ +
Subjt:  VAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEW

Query:  VSFSFTHRNGNAAAHLLAQLTLTSPHLQIWV
        V  SF  R  N  AH LA   L      +W+
Subjt:  VSFSFTHRNGNAAAHLLAQLTLTSPHLQIWV

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.4e-4527.51Show/hide
Query:  FWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSAR---AGSNASCIWRSL-LGGRDLLHT
        F WG   E +K H V W ++C PK  GGLG R  +S N+AL++K GWRLL + +SL   VL+ KY   G+   +R      + S  WRS+ +G RD++  
Subjt:  FWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSAR---AGSNASCIWRSL-LGGRDLLHT

Query:  GLRWRVGNGESIRVYSDNWIPMDGSLSI-RSPISLSVDSLVC-DLLLPSGQWDVEKIRQHFSD----EEASSILNIPLGVSLLDKLIWHYDKRGQYNVKS
        G+ W  G+G+ IR ++D W+     L +         D++V  DL +P   WD  KI  + ++    E  + +L++  G    D+L W + + GQ++V+S
Subjt:  GLRWRVGNGESIRVYSDNWIPMDGSLSI-RSPISLSVDSLVC-DLLLPSGQWDVEKIRQHFSD----EEASSILNIPLGVSLLDKLIWHYDKRGQYNVKS

Query:  GYRLAQRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIW-----QCSK
         Y +     V  +P P    ++  ++   WK+++P ++K F W +    + T E    + +   N C+ C    ES  HV   C     IW     Q  +
Subjt:  GYRLAQRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIW-----QCSK

Query:  FSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAA
           F   L+  +  ++G   R   + + W  I A+++     W   N   ++      +  +  W+   ++VY+A   +    +   R  R+  W  P  
Subjt:  FSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAA

Query:  PLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKL
          +KVN D A R    +A  G ++RD TG  +     L          E + VY G+    E    R ++E DS  I   L T   D   +  L  +   
Subjt:  PLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKL

Query:  FLSSNTEW-VSFSFTHRNGNAAAHLLA
        FL    +W V     +R  N  A  LA
Subjt:  FLSSNTEW-VSFSFTHRNGNAAAHLLA

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-2444.17Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPK-CMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHT
        M+ FWW   +  RK+ WV+W+++C  K   GGLGFRDL  FNQALLAKQ +R++    +LL+R+L+S+YF     +    G+  S  WRS++ GR+LL  
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPK-CMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHT

Query:  GLRWRVGNGESIRVYSDNWI
        GL   +G+G   +V+ D WI
Subjt:  GLRWRVGNGESIRVYSDNWI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein6.3e-3325.35Show/hide
Query:  LKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQ---WDVEKIRQHFSD
        +K++YFK    L A+     S  W SLL G  LL  G R  +G+G++IR+  DN +       + +  +   +  + +L    G    WD  KI Q    
Subjt:  LKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLRWRVGNGESIRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQ---WDVEKIRQHFSD

Query:  EEASSILNIPLGVSLL-DKLIWHYDKRGQYNVKSGYRLAQRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNC
         +   I  I L  S   DK+IW+Y+  G+Y V+SGY L        +P+ +     +      W L +  K+K F WR     L T E L  +G+ +   
Subjt:  EEASSILNIPLGVSLL-DKLIWHYDKRGQYNVKSGYRLAQRSIVHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNC

Query:  CRFCGQKGESAAHVFWLCKKMRSIWQCS----------------KFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTP-QHFS
        C  C ++ ES  H  + C      W+ S                  S+ L+ +     SD    + VW              L+W IW ARN      F 
Subjt:  CRFCGQKGESAAHVFWLCKKMRSIWQCS----------------KFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAIWNARNQTP-QHFS

Query:  LGGSLSDLVSWSE--NYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFA
           S + L + +E  ++L   Q+ +++ +    +        WR P A  +K N D  F  +   A  G IIR+  G         LA  S+    E  A
Subjt:  LGGSLSDLVSWSE--NYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFA

Query:  VYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLT-------SPHLQIWVEEW
        +   +      G+ +  +E D   + NL+       S    L  +   F ++    + F F  R GN  AH+LA+   T       S  L IW++ +
Subjt:  VYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLT-------SPHLQIWVEEW

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-0834Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFK-TGD-----FLSARAGSNASCIWRSLLGGR
        MS FW  D D    L  V     CLPK  GGLG R    +N  L  K  WRL S   SL     +  + +  GD     F +++   + S  W+ LL  R
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFK-TGD-----FLSARAGSNASCIWRSLLGGR

Query:  DLLHTGLRWRVGNGESIRVYSDNWIPMDGSL---------SIRSPISLSV
         L    LR  +GNG + R ++DNW P    L          +R PI  SV
Subjt:  DLLHTGLRWRVGNGESIRVYSDNWIPMDGSL---------SIRSPISLSV

AT4G29090.1 Ribonuclease H-like superfamily protein6.5e-5429.16Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG
        ++ FWW +  E + +HW +W  +   K  GG+GF+D+E+FN ALL KQ WR+LS   SL+A+V KS+YF   D L+A  GS  S +W+S+   +++L  G
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTG

Query:  LRWRVGNGESIRVYSDNWI---PMDGSLSI-RSP----ISLSVDSLVCDLLLPSG-QWDVEKIRQHFSDEEASSILNI-PLGVSLLDKLIWHYDKRGQYN
         R  VGNGE I ++   W+   P   +L + R P     S+S    V DL+  SG +W  + I   F + E   I  + P G  +LD   W Y   G Y 
Subjt:  LRWRVGNGESIRVYSDNWI---PMDGSLSI-RSP----ISLSVDSLVCDLLLPSG-QWDVEKIRQHFSDEEASSILNI-PLGVSLLDKLIWHYDKRGQYN

Query:  VKSGYRLAQRSI-VHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKF
        VKSGY +  + I     P   S  SL   ++  WK Q   KI+ F W+   + LP    L  + +   + C  C    E+  H+ + C   R  W  S  
Subjt:  VKSGYRLAQRSI-VHGLPSPSSLESLLQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKF

Query:  ----------SHFLHLLWVPVGSDMGHFMRVW---TDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQ-AAQRSSASALVVC
                  S +++L WV    ++G+    W   + LV W        LLW +W  RN+         +  +++  +E+ L+ ++   +  S       
Subjt:  ----------SHFLHLLWVPVGSDMGHFMRVW---TDLVSWQHIGAIVVLLWAIWNARNQTPQHFSLGGSLSDLVSWSENYLQVYQ-AAQRSSASALVVC

Query:  RPPRVAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVD
               WRPP    +K N D  + +++   G+G ++R+  G V     R L +   V   E  A+   +L +    +     E+DS  +  +L  D + 
Subjt:  RPPRVAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVD

Query:  DSEVGVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLT----SPHLQIWVEEW
         S    +  + +L LS  TE V F F  R GN  A  +A+ +L+     P L   V  W
Subjt:  DSEVGVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLT----SPHLQIWVEEW

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-0724.26Show/hide
Query:  LRWRVGNGESIRVYSDNWIPMDGSLSI---RSPISLSV--DSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGVSLLDKLIWHYDKRGQYNVKSGY
        +R  +GNGES   + D W      L+      P  L +  D+ V +    +G W +   R   S    +++   P+           ++ RGQ +    +
Subjt:  LRWRVGNGESIRVYSDNWIPMDGSLSI---RSPISLSV--DSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGVSLLDKLIWHYDKRGQYNVKSGY

Query:  RLAQRSIVHGLPSPSSLESL------LQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQ--CS
        R A  S +    S  + E +      + W K  W  +   +  +  W   L+RLPTR+ L   G+++ +    C    E+ AH+F+ C    +IW+   S
Subjt:  RLAQRSIVHGLPSPSSLESL------LQWWKGFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQ--CS

Query:  KF
        KF
Subjt:  KF

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.5e-2644.17Show/hide
Query:  MSHFWWGDLDEDRKLHWVSWKRMCLPK-CMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHT
        M+ FWW   +  RK+ WV+W+++C  K   GGLGFRDL  FNQALLAKQ +R++    +LL+R+L+S+YF     +    G+  S  WRS++ GR+LL  
Subjt:  MSHFWWGDLDEDRKLHWVSWKRMCLPK-CMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHT

Query:  GLRWRVGNGESIRVYSDNWI
        GL   +G+G   +V+ D WI
Subjt:  GLRWRVGNGESIRVYSDNWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCATTTCTGGTGGGGTGACCTTGATGAGGATCGTAAGTTGCACTGGGTTAGTTGGAAGCGCATGTGTTTGCCTAAATGTATGGGAGGCTTGGGGTTTCGTGATCT
GGAATCGTTCAATCAGGCTTTGTTAGCTAAACAAGGGTGGCGGCTTCTCTCTGACCTGTCTTCCCTCCTTGCCCGAGTTCTGAAATCCAAATATTTTAAGACTGGTGATT
TTCTGAGTGCTCGGGCAGGATCCAATGCTTCTTGCATATGGCGGAGTCTCCTTGGGGGTCGAGATCTTCTTCATACTGGTCTTCGGTGGCGAGTGGGCAACGGTGAGTCC
ATTCGAGTTTATTCTGATAATTGGATTCCTATGGATGGGTCTTTGTCCATTCGCTCACCGATTTCATTATCTGTGGATAGCTTAGTTTGTGATCTCCTATTACCTTCAGG
TCAGTGGGATGTGGAGAAAATTCGACAGCACTTTAGCGATGAGGAGGCTTCCTCAATTCTGAATATTCCGCTTGGTGTTAGTCTGCTGGATAAACTCATTTGGCATTATG
ACAAAAGGGGACAATACAATGTTAAAAGTGGGTACCGTCTTGCACAACGCAGCATCGTTCATGGTTTGCCTTCTCCTTCTTCCCTCGAGTCTCTGTTACAGTGGTGGAAG
GGGTTTTGGAAACTCCAACTCCCTAGTAAAATTAAGATTTTTGGATGGAGGTTATGCCTTGACCGTTTACCGACGAGGGAGAACCTCTTAGCCAAAGGAGTTGATGTGCT
GAATTGCTGCAGGTTTTGTGGCCAAAAGGGGGAAAGTGCAGCCCATGTGTTTTGGTTATGTAAAAAAATGCGATCTATATGGCAGTGTTCTAAATTCTCCCATTTTCTCC
ATCTTCTGTGGGTGCCGGTGGGATCTGATATGGGTCATTTTATGCGGGTGTGGACGGATCTGGTTTCTTGGCAGCATATTGGAGCTATTGTGGTGTTGTTGTGGGCTATT
TGGAACGCGCGTAATCAAACTCCCCAACATTTCTCATTAGGTGGCTCTTTGTCGGACTTGGTTTCTTGGTCGGAAAATTACCTCCAGGTTTATCAAGCGGCTCAACGGAG
TTCTGCGTCTGCTCTAGTGGTTTGTCGTCCTCCTCGGGTGGCTATCTGGCGTCCTCCTGCAGCGCCTCTCCTTAAGGTGAATGTGGATGTGGCTTTCAGAAAGGAATCTT
TTGTAGCAGGGGTGGGTGTAATTATTCGGGATTCGACCGGTCTTGTGTATCTTACAGCCATTCGTCTGCTTGCTCGTGCTAGTGATGTTGATTGGGTGGAAGGCTTTGCG
GTTTATGAGGGTATTCTCCTTGTTGTGGAAGCAGGTTTTATTCGGTTCCAGATCGAAACAGATTCACTCCGGATTTTTAATTTATTGACGACAGATTGTGTGGATGATTC
TGAAGTTGGGGTTCTCTGCTCTGTCATCAAGCTTTTTTTGTCTTCCAATACTGAGTGGGTTTCTTTTAGTTTTACACATAGGAATGGTAACGCCGCAGCCCATCTATTAG
CTCAACTGACATTGACTTCACCACACCTTCAAATTTGGGTGGAGGAATGGCCTGATGAGATCTCTTCG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCATTTCTGGTGGGGTGACCTTGATGAGGATCGTAAGTTGCACTGGGTTAGTTGGAAGCGCATGTGTTTGCCTAAATGTATGGGAGGCTTGGGGTTTCGTGATCT
GGAATCGTTCAATCAGGCTTTGTTAGCTAAACAAGGGTGGCGGCTTCTCTCTGACCTGTCTTCCCTCCTTGCCCGAGTTCTGAAATCCAAATATTTTAAGACTGGTGATT
TTCTGAGTGCTCGGGCAGGATCCAATGCTTCTTGCATATGGCGGAGTCTCCTTGGGGGTCGAGATCTTCTTCATACTGGTCTTCGGTGGCGAGTGGGCAACGGTGAGTCC
ATTCGAGTTTATTCTGATAATTGGATTCCTATGGATGGGTCTTTGTCCATTCGCTCACCGATTTCATTATCTGTGGATAGCTTAGTTTGTGATCTCCTATTACCTTCAGG
TCAGTGGGATGTGGAGAAAATTCGACAGCACTTTAGCGATGAGGAGGCTTCCTCAATTCTGAATATTCCGCTTGGTGTTAGTCTGCTGGATAAACTCATTTGGCATTATG
ACAAAAGGGGACAATACAATGTTAAAAGTGGGTACCGTCTTGCACAACGCAGCATCGTTCATGGTTTGCCTTCTCCTTCTTCCCTCGAGTCTCTGTTACAGTGGTGGAAG
GGGTTTTGGAAACTCCAACTCCCTAGTAAAATTAAGATTTTTGGATGGAGGTTATGCCTTGACCGTTTACCGACGAGGGAGAACCTCTTAGCCAAAGGAGTTGATGTGCT
GAATTGCTGCAGGTTTTGTGGCCAAAAGGGGGAAAGTGCAGCCCATGTGTTTTGGTTATGTAAAAAAATGCGATCTATATGGCAGTGTTCTAAATTCTCCCATTTTCTCC
ATCTTCTGTGGGTGCCGGTGGGATCTGATATGGGTCATTTTATGCGGGTGTGGACGGATCTGGTTTCTTGGCAGCATATTGGAGCTATTGTGGTGTTGTTGTGGGCTATT
TGGAACGCGCGTAATCAAACTCCCCAACATTTCTCATTAGGTGGCTCTTTGTCGGACTTGGTTTCTTGGTCGGAAAATTACCTCCAGGTTTATCAAGCGGCTCAACGGAG
TTCTGCGTCTGCTCTAGTGGTTTGTCGTCCTCCTCGGGTGGCTATCTGGCGTCCTCCTGCAGCGCCTCTCCTTAAGGTGAATGTGGATGTGGCTTTCAGAAAGGAATCTT
TTGTAGCAGGGGTGGGTGTAATTATTCGGGATTCGACCGGTCTTGTGTATCTTACAGCCATTCGTCTGCTTGCTCGTGCTAGTGATGTTGATTGGGTGGAAGGCTTTGCG
GTTTATGAGGGTATTCTCCTTGTTGTGGAAGCAGGTTTTATTCGGTTCCAGATCGAAACAGATTCACTCCGGATTTTTAATTTATTGACGACAGATTGTGTGGATGATTC
TGAAGTTGGGGTTCTCTGCTCTGTCATCAAGCTTTTTTTGTCTTCCAATACTGAGTGGGTTTCTTTTAGTTTTACACATAGGAATGGTAACGCCGCAGCCCATCTATTAG
CTCAACTGACATTGACTTCACCACACCTTCAAATTTGGGTGGAGGAATGGCCTGATGAGATCTCTTCG
Protein sequenceShow/hide protein sequence
MSHFWWGDLDEDRKLHWVSWKRMCLPKCMGGLGFRDLESFNQALLAKQGWRLLSDLSSLLARVLKSKYFKTGDFLSARAGSNASCIWRSLLGGRDLLHTGLRWRVGNGES
IRVYSDNWIPMDGSLSIRSPISLSVDSLVCDLLLPSGQWDVEKIRQHFSDEEASSILNIPLGVSLLDKLIWHYDKRGQYNVKSGYRLAQRSIVHGLPSPSSLESLLQWWK
GFWKLQLPSKIKIFGWRLCLDRLPTRENLLAKGVDVLNCCRFCGQKGESAAHVFWLCKKMRSIWQCSKFSHFLHLLWVPVGSDMGHFMRVWTDLVSWQHIGAIVVLLWAI
WNARNQTPQHFSLGGSLSDLVSWSENYLQVYQAAQRSSASALVVCRPPRVAIWRPPAAPLLKVNVDVAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFA
VYEGILLVVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSNTEWVSFSFTHRNGNAAAHLLAQLTLTSPHLQIWVEEWPDEISS