; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G12870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G12870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr05:10869235..10876729
RNA-Seq ExpressionClc05G12870
SyntenyClc05G12870
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]4.6e-7222.87Show/hide
Query:  KLRKWKSRASSGRKTF-WALKIKESKNPHYLPAELSQT--------SNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWER-
        K+  W  +     +TF  A K+ +   P  L   LS+T        + RR L+  +    ++  +G  G L +    +V L V SY ++HID+ I  E  
Subjt:  KLRKWKSRASSGRKTF-WALKIKESKNPHYLPAELSQT--------SNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWER-

Query:  KSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------NNCG
         SWR + +YG PE  +K  TW LLR L   +  PWL   D NEI +  EK GGN +    +  FR+A+ D  L D+  +G  FTW++RR+      N   
Subjt:  KSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------NNCG

Query:  SENL--------------EVKLKGIESRQKRAYNEFLEVDF-QKIHVIEFELDNLLEEEEIYWK-------------------------KQKDEIKGDKD
         E+L              + +L+ ++++ K   + F   D   ++   E ++DN+L++EEI+WK                         ++K+ I G  D
Subjt:  SENL--------------EVKLKGIESRQKRAYNEFLEVDF-QKIHVIEFELDNLLEEEEIYWK-------------------------KQKDEIKGDKD

Query:  FKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEV--------------------------------------
         +G+W ED   +E  F  +F  LFS++ P  + MD        KV+  MN +L AP+ + E+                                      
Subjt:  FKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEV--------------------------------------

Query:  --------------------------------------------------------------------------------------EKEANSRGALHG--
                                                                                               K    +G  +G  
Subjt:  --------------------------------------------------------------------------------------EKEANSRGALHG--

Query:  -----------------------------------IECAPT------------------------CP---------------------------------
                                           + C  T                        CP                                 
Subjt:  -----------------------------------IECAPT------------------------CP---------------------------------

Query:  --KISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV---
           ISHLLF DDSL+F RA + +   LK I  ++  AS    N+ KS+++ S NV+  +   +  I  + +      YLGLPS   R K      I    
Subjt:  --KISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV---

Query:  ---------------DKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQVLD
                        +EVLIK++ QA+P YAMS+F+L   +C++I ++  RFWWGS+ DR+ IHW  W+ LC  K  GG+ FR+   FNQALIAKQ   
Subjt:  ---------------DKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQVLD

Query:  SIYFPDASILEVPMGINASYLWKSLLWEFK---------NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINSNIE-DKWVWHFDKKSIYS-----
         I  P++ +  V         W      FK            +A  I  +  W  +M+ + F +ED  +I+R+ +    + D+++WH+DK   YS     
Subjt:  SIYFPDASILEVPMGINASYLWKSLLWEFK---------NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINSNIE-DKWVWHFDKKSIYS-----

Query:  ---------------------------SGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKILSRVSGAQNFN------NCFCNRWLSLSSSCSDE---
                                   + +P K+K F WRA+  ++PT  NL +  +++ G  + R  G +N +      +C   + +  ++ C++E   
Subjt:  ---------------------------SGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKILSRVSGAQNFN------NCFCNRWLSLSSSCSDE---

Query:  --------------------DLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLK-WSNPPKGFIKLNV
                            +L L  + CW +W  +N      +  D  +         D F+R    I K  S + E  +RN+ + W+ PP G+ K+NV
Subjt:  --------------------DLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLK-WSNPPKGFIKLNV

Query:  DAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLS
        DAA        GL VI+++ NG+++  + +           E   V  G+K A       +++E++  +V++L  ++   +    + + +I+        
Subjt:  DAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLS

Query:  IEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYPSWISGMPFPLF
            Y  R  N     +AK A     +V W   +P  I  + FP F
Subjt:  IEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYPSWISGMPFPLF

XP_012851712.1 PREDICTED: uncharacterized protein LOC105971405 [Erythranthe guttata]2.0e-8324.39Show/hide
Query:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY
        + E++  H L   + +  N+ +L   S +      +G  G L +    ++D+R+ISY  NHID+E++       WR +  YG+PE +RK  +W+L+R L 
Subjt:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY

Query:  NHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL
        +H   PWL+G D NEILS +EK+GGN++   +I +FR  + D +L D+   GS FTW++ RH                              N  GS++L
Subjt:  NHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL

Query:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ
         +K      K +++ R+KR +                                                                  E  E D      +
Subjt:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ

Query:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL
        +I  +   L+   E+ ++YW +Q+ +I    +GD                      KD +G W +    IE     +F+ +F S+NP +D +D+VL ++ 
Subjt:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL

Query:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK
         +V  S ++ L  P+T  E                              +  +A  RG   G+  APT P +S+L F DD+LIF  A ES    LK IL 
Subjt:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK

Query:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA
         +   S   INF KS ++ SP+ +++    +  +LG ++ E    YLG+P+   ++K +    + D                  KEVLIK++ QA+P+Y 
Subjt:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA

Query:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINA
        MS F L   L  EI     RFWWGS  +R  + W SWK LC  K  GGL FR++  FN AL+ KQ              ++ + YFP   +L   +G   
Subjt:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINA

Query:  SYLWKSLLW------------EFKNAKLASFISSDW--------------------------------KWNTDMLGRAFNQEDINLILRMPINSNI-EDK
        S  W+S +W               N    S  S  W                                +W+  +L   F   D+  IL +P+ S + +D+
Subjt:  SYLWKSLLW------------EFKNAKLASFISSDW--------------------------------KWNTDMLGRAFNQEDINLILRMPINSNI-EDK

Query:  WVWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKK-------------EKEVWGKI
        W+WHF K                                       K I+   +P K++ F WR  + IIPTN  L ++             EKE    +
Subjt:  WVWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKK-------------EKEVWGKI

Query:  LSRVSG--------------AQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQE
        + +  G              + N+++ + N    L  S  +E+ N+  +  W  W++RN     +++       +W   +   ++   A+I  +      
Subjt:  LSRVSG--------------AQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQE

Query:  MSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENE
        M   +   W  PP   +KLN DAA+ +  +   + ++ +D  G  LG     +  +      E +   + +  A S     ++LE +CLQVIN L +   
Subjt:  MSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENE

Query:  ELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK
        E      ++EE   +   F S  FN+  R  N L   +A+
Subjt:  ELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK

XP_012857846.1 PREDICTED: uncharacterized protein LOC105977118 [Erythranthe guttata]8.3e-8224.04Show/hide
Query:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY
        + E++  H L   + +  N+ +L   S +      +G  G L +    ++D+R+ISY  NHID+E++       WR +  YG+PE +RK  +W+L+R L 
Subjt:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY

Query:  NHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL
        +H   PWL+G D NEILS +EK+GGN++   +I +FR  + D +L D+   GS FTW++ R                               N  GS++L
Subjt:  NHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL

Query:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ
         +K      K +++ R+KR +                                                                  E  E D      +
Subjt:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ

Query:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL
        +I  +   L+   E+ ++YW +Q+ +I    +GD                      KD +G W +    IE     +F+ +F S+NP +D +D+VL ++ 
Subjt:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL

Query:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK
         +V  S ++ L  P+T  E                              +  +A  RG   G+  APT P +S+L F DD+LIF  A ES    LK IL 
Subjt:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK

Query:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA
         +   S   INF KS ++ SP+ +++    +  +LG ++ E    YLG+P+   ++K +    + D                  KEVLIK++ QA+P+Y 
Subjt:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA

Query:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINA
        MS F L   L  EI     RFWWGS  +R  + W SWK LC  K  GGL FR++  FN AL+ KQ              ++ + YFP   +L   +G   
Subjt:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINA

Query:  SYLWKSLLW------------EFKNAKLASFISSDW--------------------------------KWNTDMLGRAFNQEDINLILRMPINSNI-EDK
        S  W+S +W               N    S  S  W                                +W+  +L   F   D+  IL +P+ S + +D+
Subjt:  SYLWKSLLW------------EFKNAKLASFISSDW--------------------------------KWNTDMLGRAFNQEDINLILRMPINSNI-EDK

Query:  WVWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKE--------------------
        W+WHF K                                       K I+   +P K++ F WR  + IIPTN  L +++                    
Subjt:  WVWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKE--------------------

Query:  -------KEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQE
               ++VW         + N+++ + N    L  S  +E+ N+  +  W  W++RN     +++       +W   +   ++   A+I  +      
Subjt:  -------KEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQE

Query:  MSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENE
        M   +   W  PP   +KLN D A+ +  +   + ++ +D  G  LG     +  +      E +   + +  A S     ++LE +CLQVIN L +   
Subjt:  MSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENE

Query:  ELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK
        E      ++EE   +   F S  FN+  R  N L   +A+
Subjt:  ELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.4e-8924.77Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK
        MA RLG+ALG F + DCD  N  W  +LR+++++DI+KPLRRGI ++L G     WI  +YERLPDFCY CG    +    K ++   W R     + + 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK

Query:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM
        P        K P E  +DK  N S     + S+  V   S+ V+         +P E                                S V  T ++  
Subjt:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM

Query:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR
           ++ KS      G +    + +KE  N   L   L   S   S++ S ++S  +  +  PG                               S RF+ 
Subjt:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR

Query:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------
         YG P  +++  TW+LLR + N + +PWL+G D+N IL   E S  +S  +  IE+FR  +   +L D+  +G  FTW + R                  
Subjt:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------

Query:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------
                        HN   S++++                   ++K  ++    AYN+ L +DF  IH +E +L  LLE EEI+WK++          
Subjt:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------

Query:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------
                                        K+EI+            G   F   + +       PK +EA             WN            
Subjt:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------

Query:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----
                                             D  S+  P + + D V                          L     +V+W+  E +     
Subjt:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----

Query:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF
                                                 L+PY         S +    N+ G L GI        I+HLLF DDSLIF R+ ES   
Subjt:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF

Query:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC
        AL+ +L  +  AS   INF+KSA++ SPNV+ +R  +L  IL +K+    G YLGLPS F+R +                                    
Subjt:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC

Query:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLDSIYFPDASILEVPMGINASYLWKSLLW--
                        + +++HW  W  +C PK  GGLNFR+L GFNQAL+AK              +VL   YF D S+L+      +SY WK  LW  
Subjt:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLDSIYFPDASILEVPMGINASYLWKSLLW--

Query:  ----------------------------------EFKNAKL----ASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S
                                           F N  L    ASFI++D  W+   +  +F  ED +LIL MPI+S N++D W+WH+DK+  YS  S
Subjt:  ----------------------------------EFKNAKL----ASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S

Query:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS
        G                              VP K+K F WR+ +E IPT  NL                            K+ +++W  +   L+ +S
Subjt:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS

Query:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL
           N +  F   W SL+     +DLNL AIT W +WNDRN++  GKQ+   + +C W+  + D    S A++       Q   +     W       +KL
Subjt:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL

Query:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF
        N DAA +   AST    I++D +  L+  ++  +       LAEI  + EGLKFA + N   + +ES+ L  I L++NE         W+ EI+ L   F
Subjt:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF

Query:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI
          I F++ +R  N     +AK      S    WL  +P+W+
Subjt:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.3e-8424.76Show/hide
Query:  GMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW-ERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFR
        G+ G L +  ++EVD+ +ISY  +HID+ I   + K WR S +YG PE  +K  TW LLR L      PWL   D NEIL   EK GG  +    + +FR
Subjt:  GMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW-ERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFR

Query:  EAIGDRNLRDIKGRGSTFTWNSRR----------------------------------------------------------------------------
        E + D NL D+  RG  FTW++RR                                                                            
Subjt:  EAIGDRNLRDIKGRGSTFTWNSRR----------------------------------------------------------------------------

Query:  ------------------------HNNCGS----------ENLEVKLKGIESRQKRAYNEFLE-VDFQKIHVIEFELDNLLEEEEIYWK-----------
                                 NNC +          +  + KLK ++ +     + F +  +  +I   E +++ +L +EE+YWK           
Subjt:  ------------------------HNNCGS----------ENLEVKLKGIESRQKRAYNEFLE-VDFQKIHVIEFELDNLLEEEEIYWK-----------

Query:  --------------KQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK-----------
                      K+K+ I G  D    W +D + +E  F  YF  LF++S+P +  ++  L  + P+V   MN +L +P+T+ EV             
Subjt:  --------------KQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK-----------

Query:  ----------------------------------------EANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINF
                                                +A  +  +HGI       K+SHLLF DDSLIF RA   +   LK + + +  AS    NF
Subjt:  ----------------------------------------EANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINF

Query:  TKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV-------------------DKEVLIKSIGQALPTYAMSLFQLLKKLC
         KS++  S     D+   +S+I  +K+      YLGLPS   R KTK     V                    KEVLIK++ QA+PTYAMS+F++   LC
Subjt:  TKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV-------------------DKEVLIKSIGQALPTYAMSLFQLLKKLC

Query:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLLW--
        ++I ++  RFWWG+  DRK IHW  W+ +   KA GG+ FR+L  FNQAL+AKQ              VL + YF     +   +G   S++W+S++W  
Subjt:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLLW--

Query:  -----------------------------EFK---------NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS--S
                                      FK         +  +A  I    +W  D++ + F  ED   I+++P+     ED+ +WH+DKK  YS  S
Subjt:  -----------------------------EFK---------NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS--S

Query:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQKKE---------------------------KEVW--GKILSRVSG
        G                              +P KVK F WRA ++++PT  NL KK+                           +++W    +   + G
Subjt:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQKKE---------------------------KEVW--GKILSRVSG

Query:  AQNFNNCFCNR-WLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM--SKRNQLKWSNPPKGFI
            +  +  + W    +     ++   A   WA+W  RN      +  +P      ++   +    S  KI + P  + +   +   Q +WS PP G+ 
Subjt:  AQNFNNCFCNR-WLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM--SKRNQLKWSNPPKGFI

Query:  KLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKV
        K+NVDAA        GL V+++D +G     + +++ L   + +AE   +  GLK A   +    + ES+ L+VI+L+  ++  L    + + +I+    
Subjt:  KLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKV

Query:  HFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYP
        +F + +  +  R  N     +AK A      V WL   P
Subjt:  HFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYP

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase5.6e-6823.46Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDC-----------KSYKRLG-
        + R +G++ G   ++D   D   W + LR +  +++TKPLRRG+ +  + +   I I+ +YE+LPDFCY CG + HV  +C           K  K  G 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDC-----------KSYKRLG-

Query:  WFRGLNPRRKS-----------KPSSQRDKKDKSPFESSVDKPKNRSSCE----------RVAK----SSKFV-QTESRAVERKNQVNQKEMPPEAM---
        W R   PR KS           +   + +KKD      SV+K +    C+          + A+    SS F   T+  +   K QV  K    ++    
Subjt:  WFRGLNPRRKS-----------KPSSQRDKKDKSPFESSVDKPKNRSSCE----------RVAK----SSKFV-QTESRAVERKNQVNQKEMPPEAM---

Query:  --KFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFMNKLRKWKSRASSGRK----------TFWALKIKESKNPHYLPAELSQTS--NRRSLR
          K       +  K++V    N   + V             F+ K     S +  G K          T   +    +   H +  +  + S  +    R
Subjt:  --KFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFMNKLRKWKSRASSGRK----------TFWALKIKESKNPHYLPAELSQTS--NRRSLR

Query:  SSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKS--WRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSG
          S E  E  S    G L +    E ++ ++SY  +H D+ +   + S  WRF+  YG P  +R+ ++WDL+R L   +  PW++G D NEI+  +EK G
Subjt:  SSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKS--WRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSG

Query:  GNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENLEVKLKGIESRQK--RAYNEFLEV------------DFQKIHVIEFE--------
        G+ +    ++ FR  I D  L  +K  G   TW   R  N   E L+  L     R +  +  N  L++            D +++ V  FE        
Subjt:  GNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENLEVKLKGIESRQK--RAYNEFLEV------------DFQKIHVIEFE--------

Query:  ------LDNL-----LEEEEIYWKK-QKDEIKGDKD-----------------FKGQW-CEDPKIIEATFWNYFKDLF--------SSSNPPKDVMDKVL
               DN+     ++  E+  K  +  E+K   D                 ++  W     +I+     N FK +         S+  P + + D  L
Subjt:  ------LDNL-----LEEEEIYWKK-QKDEIKGDKD-----------------FKGQW-CEDPKIIEATFWNYFKDLF--------SSSNPPKDVMDKVL

Query:  --------------------------GSVLPKVDWSMNEKL----------------------------------------------LAPYT-------Q
                                       +V+W+  E++                                              ++PY         
Subjt:  --------------------------GSVLPKVDWSMNEKL----------------------------------------------LAPYT-------Q

Query:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL
        S +   A+++G + G+      P+I+HLLF DDSL+F RA  +   A+  +LKQ+E AS   +N  KSA+  S N        +   LG+        YL
Subjt:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL

Query:  GLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG
        G+P    R K    Q I D                  K VLI+++ Q +P YAMS F+  K    ++     RFWW      K I W +W+ L + K  G
Subjt:  GLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG

Query:  GLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLL-----------WEFKNAKLASFISSDW------------
        GL FR+   FN A +AKQ              V  + YFP  S +   +G N SY+W+S+L           W   +      +   W            
Subjt:  GLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLL-----------WEFKNAKLASFISSDW------------

Query:  -------------------KWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS-SGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKI
                            W  D+L   F    +  IL + ++S+ +EDK +W   K  +++     H  +    R   E++   L      K +W   
Subjt:  -------------------KWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS-SGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKI

Query:  LSRVSGAQNFNNCFCNR--------WLS----LSSSCSDEDLNLCA------------ITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQRSVA
         +RVS    F  C+  R        W+      + S  D   NLC             I  W LW +RN+ +  GK I    +    +L    EF+  + 
Subjt:  LSRVSGAQNFNNCFCNR--------WLS----LSSSCSDEDLNLCA------------ITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQRSVA

Query:  KIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGK-LLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNC
         +  D   +Q     +  KW+ P  G +KLN DAA+  +   T L ++L+D  G  +LG  +R  + D ++  AE + +  GL+ A    H  +++ES+ 
Subjt:  KIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGK-LLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNC

Query:  LQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVA
         Q I  ++   E L      + +I      F  I F++  R  N L   +A
Subjt:  LQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVA

A0A2N9IZB6 Reverse transcriptase domain-containing protein7.4e-6822.84Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK
        +A  +G+A+G F++ D  +    W   LR+++ +D+T+PL RG  + +S D +  W+  +YE LP FCY CGR+GH   D    KR+      NPR  S+
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK

Query:  PSSQRDKK-DKSPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPP---------EAMKFSSQ------------------ILFIFIKE
        PS  RD+  D SP E   D+  +        K  K  +    + +  +QV++K + P         +   FS Q                  + F  I E
Subjt:  PSSQRDKK-DKSPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPP---------EAMKFSSQ------------------ILFIFIKE

Query:  EVNKVLNFHPEGVEFL-------------------------------------TSEVIGTSERFMNKLRKWKSRASSGRKTFWALKIKESKNPHYL----
        EV        E    L                                     TS     S     K R W +   + +K     K+   +N  ++    
Subjt:  EVNKVLNFHPEGVEFL-------------------------------------TSEVIGTSERFMNKLRKWKSRASSGRKTFWALKIKESKNPHYL----

Query:  ----PAELSQTS-NRRSLRSSSA--ESHE-------------------------KSSVGMPGKLEIPRH-----------SEVDLRVISYFQNHIDSEIF
            P  ++  S N R L +     + H+                            + M G   + RH           S V + + SY   HID+++F
Subjt:  ----PAELSQTS-NRRSLRSSSA--ESHE-------------------------KSSVGMPGKLEIPRH-----------SEVDLRVISYFQNHIDSEIF

Query:  WE-RKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSE
         E  +SWR +  YG PE + + +TW+LLR L   N+ PW+V  D NE +S  E+ G   ++   + SF+EA+ D  L+D+  +G  FTW++RR    G +
Subjt:  WE-RKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSE

Query:  NLEVKLKG------------------------------IESRQKRAYNEFLEV--------DFQKIHVIEFELDNLLEEEEIYW----------------
         + V+L                                +E R   A  + LE+        D Q ++ +  E+ +LL +EE+ W                
Subjt:  NLEVKLKG------------------------------IESRQKRAYNEFLEV--------DFQKIHVIEFELDNLLEEEEIYW----------------

Query:  ---------KKQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEKE--------------
                 +K+ + I+G  D +  W  DP  +E     YF  LF+SS P    +D V+  V   V   MNE LL P++  EV++               
Subjt:  ---------KKQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEKE--------------

Query:  -----------------------------------------------------------------ANSRGA-----------------------------
                                                                          N RG                              
Subjt:  -----------------------------------------------------------------ANSRGA-----------------------------

Query:  --------------------------------------------------------------------LHGIECAPTCPKISHLLFVDDSLIFCRAGESN
                                                                            + G+      P+ISHL F DDS+IFCRA    
Subjt:  --------------------------------------------------------------------LHGIECAPTCPKISHLLFVDDSLIFCRAGESN

Query:  FFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKT-----------KDLQGIVDK-------EVLIKS
          AL+ IL  +  AS   +N  K+A+  S N        +S   G  +  +   YLGLP    R+K            K LQG  +K       EVLIK+
Subjt:  FFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKT-----------KDLQGIVDK-------EVLIKS

Query:  IGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLDSIYFPDASI
        + QA+PTYAMS FQ    LC EI+    RFWWG  +  ++IHW + K L   K  GG+ FR+L  FN  L+A+              +VL + YFP  S 
Subjt:  IGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLDSIYFPDASI

Query:  LEVPMGINASYLWKS-----------LLWEFKNAKLASFISSDW-------------------------------KWNTDMLGRAFNQEDINLILRMPIN
        L+  +   ASY+W S           L W     +        W                                W  D+L   F   D+ LIL++P++
Subjt:  LEVPMGINASYLWKS-----------LLWEFKNAKLASFISSDW-------------------------------KWNTDMLGRAFNQEDINLILRMPIN

Query:  -SNIEDKWVWHFDKKSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKK-------------EKEVWGKILSRVSGAQN-FNNC-------------FCNRW
             D  +W   K+  +S  VP KVK F W+A   IIPT   L +K             E E    +L     AQ  +N C             F +  
Subjt:  -SNIEDKWVWHFDKKSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKK-------------EKEVWGKILSRVSGAQN-FNNC-------------FCNRW

Query:  LSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPAST
         S +S      L +   T WALW  RN      +    D      +    EF  + +K G     L   S     +W  PP G  KLN+     S     
Subjt:  LSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPAST

Query:  GLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNN
        GL V+++D  G ++ V   ++  + +L       V   +KFA     + +++E +  ++ +L++  +         +++I      F  + F +  +  N
Subjt:  GLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNN

Query:  ELVDKVAKNARLNSLNVDWLSCYPSWIS
        +  + +A  A  + +   WL  +P+ I+
Subjt:  ELVDKVAKNARLNSLNVDWLSCYPSWIS

A0A6J1DX30 uncharacterized protein LOC1110248741.2e-8924.77Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK
        MA RLG+ALG F + DCD  N  W  +LR+++++DI+KPLRRGI ++L G     WI  +YERLPDFCY CG    +    K ++   W R     + + 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK

Query:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM
        P        K P E  +DK  N S     + S+  V   S+ V+         +P E                                S V  T ++  
Subjt:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM

Query:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR
           ++ KS      G +    + +KE  N   L   L   S   S++ S ++S  +  +  PG                               S RF+ 
Subjt:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR

Query:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------
         YG P  +++  TW+LLR + N + +PWL+G D+N IL   E S  +S  +  IE+FR  +   +L D+  +G  FTW + R                  
Subjt:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------

Query:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------
                        HN   S++++                   ++K  ++    AYN+ L +DF  IH +E +L  LLE EEI+WK++          
Subjt:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------

Query:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------
                                        K+EI+            G   F   + +       PK +EA             WN            
Subjt:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------

Query:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----
                                             D  S+  P + + D V                          L     +V+W+  E +     
Subjt:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----

Query:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF
                                                 L+PY         S +    N+ G L GI        I+HLLF DDSLIF R+ ES   
Subjt:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF

Query:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC
        AL+ +L  +  AS   INF+KSA++ SPNV+ +R  +L  IL +K+    G YLGLPS F+R +                                    
Subjt:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC

Query:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLDSIYFPDASILEVPMGINASYLWKSLLW--
                        + +++HW  W  +C PK  GGLNFR+L GFNQAL+AK              +VL   YF D S+L+      +SY WK  LW  
Subjt:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLDSIYFPDASILEVPMGINASYLWKSLLW--

Query:  ----------------------------------EFKNAKL----ASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S
                                           F N  L    ASFI++D  W+   +  +F  ED +LIL MPI+S N++D W+WH+DK+  YS  S
Subjt:  ----------------------------------EFKNAKL----ASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S

Query:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS
        G                              VP K+K F WR+ +E IPT  NL                            K+ +++W  +   L+ +S
Subjt:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS

Query:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL
           N +  F   W SL+     +DLNL AIT W +WNDRN++  GKQ+   + +C W+  + D    S A++       Q   +     W       +KL
Subjt:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL

Query:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF
        N DAA +   AST    I++D +  L+  ++  +       LAEI  + EGLKFA + N   + +ES+ L  I L++NE         W+ EI+ L   F
Subjt:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF

Query:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI
          I F++ +R  N     +AK      S    WL  +P+W+
Subjt:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI

A0A803Q0L5 Uncharacterized protein3.9e-6927.91Show/hide
Query:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL
        S + +     G ++G+        +SHL F DDSLIF  A   +    K++L+ +  AS   +N+ KS +    NV+++    L+  +G++  ++ G YL
Subjt:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL

Query:  GLPSDFSRSKTKDLQGI------------------VDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG
        GLPS   R+K + L  I                    KEVLIK++ QA+PTYAMS F+L KK    I R   RFWWGS++  K+IHW  W  LC PK  G
Subjt:  GLPSDFSRSKTKDLQGI------------------VDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG

Query:  GLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLLWEFK-----------------------------------
        GL FR+L  FNQA++AKQ              VL + YFP  +ILE   G  AS++W+SL+W  K                                   
Subjt:  GLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLLWEFK-----------------------------------

Query:  -----NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDK--------------------------------KSIYSSGVPHKV
             N  +A     D  W++  +   FN +D  LIL MP     +EDK +WH+ K                                K+++   +P K+
Subjt:  -----NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDK--------------------------------KSIYSSGVPHKV

Query:  KFFCWRALNEIIPTNLNLQKK--EKEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDE
        K F W+     +PTN+ L K+   KE W     +    +  +    +  + ++            +  W +WN RN+V  G   P       W  KY  E
Subjt:  KFFCWRALNEIIPTNLNLQKK--EKEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDE

Query:  FQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVM
        F+   A       T     +R   KW  P  G +K+NVDA  KS    +G+  +++D  G+ L  S+  +  +      E+  +  GL+  I +      
Subjt:  FQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVM

Query:  LESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDW----LSCYPSWIS-GMPFPL
        +ES+CLQ + L++ ++E     D  L++I  L ++      ++  R  N +   +A  A +N  +  W    LSC    +S  MP P+
Subjt:  LESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDW----LSCYPSWIS-GMPFPL

A0A803Q1K6 Uncharacterized protein2.4e-7124.29Show/hide
Query:  VGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWER-KSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESF
        VG+ G L +    E+ LRV S    HI + +       W  +  YG P+ +++  +WDLLR +       WL   D NEI+S  EK GG  +++ A+E F
Subjt:  VGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWER-KSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSKTEKSGGNSKASRAIESF

Query:  REAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENL----------------EVKL---------------------KGIESRQKRAYNEFLEV--------
        REAI D  L D     +  TW +   +N   E L                +VK+                      G   + K +   F E         
Subjt:  REAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENL----------------EVKL---------------------KGIESRQKRAYNEFLEV--------

Query:  -----------------DFQK---------------------IHVIEFELDNLLEEEEIYW-------------------------KKQKDEIKGDKDFK
                         DF++                     +  IE +L+ +LE++E+YW                         +++K+ IKG  D  
Subjt:  -----------------DFQK---------------------IHVIEFELDNLLEEEEIYW-------------------------KKQKDEIKGDKDFK

Query:  GQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK--------------------------------------
        G W ++  +++    +YF+ +F +S  P DV D+V+  V  KV   MNE LL  +T  E+ K                                      
Subjt:  GQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK--------------------------------------

Query:  --------------------------EANS--------RGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAII
                                  EA S         G L GI        ISHL F D SLIF  A   +    + +L ++  AS   +N+ KS   
Subjt:  --------------------------EANS--------RGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAII

Query:  ISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQ------------------GIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSF
           NV  +    L+ ILG++  E+ G YLGLPS   R+K + L                    +  KEVLIK+I QA+PTY MS F+L KK    + R  
Subjt:  ISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQ------------------GIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSF

Query:  TRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLLWEFK------
        +RFWWGS+D  K+IHW  W++LC PK  GGL FR+L  FNQAL+AKQ              VL + YFP   +LE   G NAS++ +SL+W  K      
Subjt:  TRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEVPMGINASYLWKSLLWEFK------

Query:  ----------------------------------NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--SG------
                                          N  +     +D +W+   +   FN  D +LIL +P +  + EDK +WH+ K   YS  SG      
Subjt:  ----------------------------------NAKLASFISSDWKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--SG------

Query:  ------------------------VPHKVKFFCWRALNEIIPTNLNLQKK---------------EKEV----W----GKILSRVSGAQN-----FNNCF
                                 P KVK F W+  +  +P N+NL K+               E+ V    W     K   RVSG  +          
Subjt:  ------------------------VPHKVKFFCWRALNEIIPTNLNLQKK---------------EKEV----W----GKILSRVSGAQN-----FNNCF

Query:  CNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSN
         +  + +++    E L    +  W +WN RN V  G   P P+    W   +   F+      G++ S      +R    W  P +              
Subjt:  CNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSN

Query:  PASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCN
                       +   V+A  + L       E + +++G++  I Q  +   ++++CLQ ++L++N+    R  D  L +I  L  +   +  ++  
Subjt:  PASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCN

Query:  RCNNELVDKVAKNARLNSLNVDWLSCYP
        R  N +   +A  A ++  +  W+   P
Subjt:  RCNNELVDKVAKNARLNSLNVDWLSCYP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.8e-0727.13Show/hide
Query:  IISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWK
        ++   +NKD +G + + +  +M               R KT    G   +  L K++  ++P ++MS   L + + + + +    F WGST ++K+ H  
Subjt:  IISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWK

Query:  SWKFLCLPKALGGLNFRELIGFNQALIAK
         W  +C PK  GGL  R     N+ALI+K
Subjt:  SWKFLCLPKALGGLNFRELIGFNQALIAK

P93295 Uncharacterized mitochondrial protein AtMg003101.2e-1943.48Show/hide
Query:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILE
        ALP YAMS F+L K LC ++T + T FWW S +++++I W +W+ LC  K   GGL FR+L  FNQAL+AKQ              +L S YFP +S++E
Subjt:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILE

Query:  VPMGINASYLWKSLL
          +G   SY W+S++
Subjt:  VPMGINASYLWKSLL

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-1624.55Show/hide
Query:  NLCAITCWALWNDRNNVRI-GKQIPDPDVRCTWILKYYDEFQR-SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWN
        NL     W LW  RN +   GK+   P+V    + +  ++F+  S  +  +  ++  ++ +   ++W  PP  ++K N DA W+      G+  IL++ +
Subjt:  NLCAITCWALWNDRNNVRI-GKQIPDPDVRCTWILKYYDEFQR-SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWN

Query:  GKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYW------LEEIERLKVHFLSIEFNYCNRCNNELVD
        G +L + ARA+    ++  AE+  +R  +      N+K ++ ES+   ++NLL ++       D+W      LE+I++L  HF  ++F +  R  N++ D
Subjt:  GKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYW------LEEIERLKVHFLSIEFNYCNRCNNELVD

Query:  KVAKNA-RLNSLNVDWLSCYPSWI
        ++A+ +   ++ +    S  P W+
Subjt:  KVAKNA-RLNSLNVDWLSCYPSWI

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-2721.78Show/hide
Query:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEV
        ALPTY M+ F L K +C +I      FWW +  + K +HWK+W  L   KA GG+ F+++  FN AL+ KQ              V  S YF  +  L  
Subjt:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILEV

Query:  PMGINASYLWKS----------------------LLWEFK--------------------NAKLASFI-------SSDWKWNTDMLGRAFNQEDINLILR
        P+G   S++WKS                      ++W  K                     A ++S +        S  +W  D++   F + +  LI  
Subjt:  PMGINASYLWKS----------------------LLWEFK--------------------NAKLASFI-------SSDWKWNTDMLGRAFNQEDINLILR

Query:  M-PINSNIEDKWVWHF-------------------DKKS----------------IYSSGVPHKVKFFCWRALNEIIP-----TNLNLQKKE--------
        + P    I D + W +                   +K+S                I+ S    K++ F W+ L+  +P        +L K+         
Subjt:  M-PINSNIEDKWVWHF-------------------DKKS----------------IYSSGVPHKVKFFCWRALNEIIP-----TNLNLQKKE--------

Query:  KEVWGKILSRVS---------------GAQNFNNCFCNRW----LSLSSSCSDEDLNLCAITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQR-
        KE    +L + +               G +  ++ + N +    L   +   ++   L     W LW +RN  V  G++    +V    + +  D+ +  
Subjt:  KEVWGKILSRVS---------------GAQNFNNCFCNRW----LSLSSSCSDEDLNLCAITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQR-

Query:  SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLES
         +    +   T  ++++ +  +W  PP  ++K N DA W  +    G+  +L++  G++  + ARA+     +  AE+  +R  +       +  V+ ES
Subjt:  SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLES

Query:  NCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVD--WLSCYPSW
        +   +I +L N+ E        +++++RL   F  ++F +  R  N L ++VA+ + L+ LN D    S  PSW
Subjt:  NCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVD--WLSCYPSW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.7e-2143.48Show/hide
Query:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILE
        ALP YAMS F+L K LC ++T + T FWW S +++++I W +W+ LC  K   GGL FR+L  FNQAL+AKQ              +L S YFP +S++E
Subjt:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLDSIYFPDASILE

Query:  VPMGINASYLWKSLL
          +G   SY W+S++
Subjt:  VPMGINASYLWKSLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAAGGTTAGGGGATGCTTTGGGGAAGTTCTTAAAGGTCGATTGCGATCAGGATAATTTTTGCTGGTGGGAGAGTCTCAGACTGAAAATTCTTGTGGACATCAC
AAAACCTTTACGAAGAGGAATCTGGGTAAGCCTGAGTGGTGATCAGGAAAGCATTTGGATAAATTCCAAATATGAGAGACTACCTGATTTTTGTTATGGCTGTGGGAGGA
TTGGGCACGTTATTAAGGATTGTAAGTCTTATAAAAGACTCGGATGGTTTAGAGGTTTGAACCCAAGAAGGAAAAGTAAACCCTCATCCCAAAGAGACAAGAAGGACAAG
TCCCCCTTCGAGTCGTCTGTAGACAAACCCAAAAACCGTTCATCTTGTGAAAGGGTGGCTAAATCCTCTAAGTTTGTACAGACAGAAAGTAGAGCTGTGGAGAGAAAGAA
TCAAGTAAATCAGAAGGAGATGCCACCAGAAGCCATGAAGTTTTCAAGCCAGATCCTGTTCATATTCATAAAGGAAGAAGTAAACAAAGTCCTTAATTTTCATCCAGAAG
GAGTAGAGTTTCTAACCTCGGAGGTCATTGGAACCAGCGAACGTTTTATGAACAAGCTAAGGAAGTGGAAAAGTAGAGCGAGTTCCGGGAGGAAGACCTTCTGGGCCCTG
AAAATAAAAGAAAGCAAGAATCCTCATTACCTTCCAGCAGAACTATCTCAAACATCCAATCGGCGGAGCTTGCGCAGCAGCTCTGCCGAGAGCCATGAAAAATCATCTGT
TGGAATGCCCGGGAAATTAGAAATCCCAAGGCATTCTGAGGTGGATCTAAGAGTAATCTCTTACTTTCAGAACCACATTGATTCTGAAATCTTTTGGGAAAGGAAGAGTT
GGAGATTCTCCAGAATGTATGGTTTCCCTGAAGGAAACAGGAAGACCCAAACCTGGGATTTGCTTCGTCTTCTTTATAATCACAATGACAATCCTTGGCTAGTGGGGGAC
GACTTAAACGAAATCTTAAGTAAAACCGAAAAATCCGGTGGTAATAGCAAGGCCAGTAGGGCTATCGAAAGCTTTAGAGAGGCGATTGGTGATCGCAATTTAAGGGATAT
CAAAGGTCGTGGCTCTACTTTTACCTGGAATAGCAGAAGACACAACAATTGCGGCTCTGAGAACCTGGAGGTCAAATTGAAGGGAATTGAATCCCGGCAGAAGCGTGCTT
ATAATGAATTCCTTGAGGTGGACTTCCAAAAGATCCATGTTATCGAGTTTGAGCTTGACAACCTCTTGGAGGAGGAAGAAATTTATTGGAAAAAACAGAAAGATGAGATC
AAAGGGGATAAGGACTTCAAAGGGCAGTGGTGTGAAGACCCAAAAATCATTGAAGCTACCTTCTGGAATTATTTTAAAGATCTCTTTTCTTCATCTAATCCTCCTAAGGA
TGTAATGGATAAAGTGTTGGGCTCTGTCTTGCCAAAGGTGGACTGGAGTATGAATGAGAAATTGCTAGCACCCTACACACAAAGTGAAGTGGAAAAAGAGGCAAACAGTA
GAGGAGCTCTTCATGGGATAGAATGCGCCCCAACTTGTCCAAAAATCTCACATCTTCTATTCGTCGATGATAGCCTCATTTTCTGTAGGGCGGGCGAATCAAACTTCTTT
GCTCTTAAAACAATCTTGAAGCAGTTTGAGTTGGCTTCCGATGGAAGCATCAATTTCACCAAGTCAGCGATAATAATCTCCCCCAATGTAAACAAAGATCGATGGGGCTT
CCTAAGTGACATCCTCGGCATCAAGATGCAAGAGTCCCTTGGCACCTACTTGGGGCTTCCCTCCGACTTTTCTAGAAGCAAAACAAAAGACCTCCAAGGAATTGTAGACA
AAGAAGTCCTTATTAAAAGCATTGGCCAAGCTCTGCCCACGTACGCCATGAGCCTTTTTCAACTTCTAAAGAAGTTGTGTGACGAGATAACCAGGAGTTTCACTAGATTC
TGGTGGGGCTCAACGGATGACAGAAAAAGGATCCATTGGAAAAGCTGGAAGTTCCTTTGCTTACCTAAGGCGCTCGGTGGTCTTAACTTCAGGGAGTTGATCGGGTTTAA
TCAAGCGTTAATTGCAAAACAGGTCCTTGATAGTATTTACTTTCCTGATGCTTCAATTCTGGAGGTCCCTATGGGAATCAATGCCTCTTATCTCTGGAAAAGTTTGTTAT
GGGAGTTCAAGAATGCTAAGCTGGCTTCTTTCATTAGTTCTGATTGGAAGTGGAACACTGATATGTTGGGAAGAGCCTTCAACCAAGAAGACATTAATCTGATTTTGAGA
ATGCCCATTAACAGTAATATTGAAGATAAGTGGGTTTGGCATTTCGATAAGAAGAGCATTTACTCATCAGGAGTTCCCCATAAGGTGAAATTCTTTTGTTGGAGAGCTCT
AAATGAAATAATCCCTACAAATCTAAACCTCCAGAAGAAAGAGAAGGAAGTATGGGGGAAAATCTTGTCAAGAGTATCGGGTGCCCAAAACTTTAATAATTGTTTCTGTA
ATAGATGGTTGTCTTTAAGTTCTAGTTGTTCAGATGAAGATCTAAATCTATGTGCAATTACGTGTTGGGCTTTGTGGAATGATAGAAATAATGTGAGAATTGGGAAGCAA
ATTCCTGACCCTGATGTAAGATGTACTTGGATTTTGAAATACTACGATGAATTCCAGAGATCAGTTGCAAAAATCGGGAAAGATCCCTCAACTTTGCAGGAGATGTCGAA
AAGGAACCAGCTAAAGTGGTCAAATCCACCTAAAGGCTTTATCAAATTGAATGTGGACGCCGCCTGGAAATCGAATCCTGCTTCAACGGGTTTAAACGTGATACTCAAGG
ATTGGAATGGCAAACTTTTGGGAGTATCAGCCCGGGCCATGGATCTTGATTTTGATCTGCCATTGGCAGAAATCATTACTGTTCGTGAAGGCCTTAAGTTTGCCATTTCT
CAAAACCATAAACTTGTCATGCTGGAATCCAACTGTCTCCAAGTGATCAATTTACTGAAGAACGAAAATGAGGAGTTAAGAGGGGCAGATTACTGGTTGGAGGAGATCGA
GAGACTGAAGGTTCATTTCTTGTCAATTGAGTTTAATTACTGTAATCGTTGCAACAATGAGTTGGTAGATAAAGTTGCAAAAAATGCTAGGCTTAATAGCCTCAATGTTG
ATTGGTTGTCCTGTTATCCTTCGTGGATTAGTGGGATGCCTTTCCCTCTCTTTGTTCCCGTTTCCCATTTCATTCCCCGTCCTTGCCAAATTCCCCTTGGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAAGGTTAGGGGATGCTTTGGGGAAGTTCTTAAAGGTCGATTGCGATCAGGATAATTTTTGCTGGTGGGAGAGTCTCAGACTGAAAATTCTTGTGGACATCAC
AAAACCTTTACGAAGAGGAATCTGGGTAAGCCTGAGTGGTGATCAGGAAAGCATTTGGATAAATTCCAAATATGAGAGACTACCTGATTTTTGTTATGGCTGTGGGAGGA
TTGGGCACGTTATTAAGGATTGTAAGTCTTATAAAAGACTCGGATGGTTTAGAGGTTTGAACCCAAGAAGGAAAAGTAAACCCTCATCCCAAAGAGACAAGAAGGACAAG
TCCCCCTTCGAGTCGTCTGTAGACAAACCCAAAAACCGTTCATCTTGTGAAAGGGTGGCTAAATCCTCTAAGTTTGTACAGACAGAAAGTAGAGCTGTGGAGAGAAAGAA
TCAAGTAAATCAGAAGGAGATGCCACCAGAAGCCATGAAGTTTTCAAGCCAGATCCTGTTCATATTCATAAAGGAAGAAGTAAACAAAGTCCTTAATTTTCATCCAGAAG
GAGTAGAGTTTCTAACCTCGGAGGTCATTGGAACCAGCGAACGTTTTATGAACAAGCTAAGGAAGTGGAAAAGTAGAGCGAGTTCCGGGAGGAAGACCTTCTGGGCCCTG
AAAATAAAAGAAAGCAAGAATCCTCATTACCTTCCAGCAGAACTATCTCAAACATCCAATCGGCGGAGCTTGCGCAGCAGCTCTGCCGAGAGCCATGAAAAATCATCTGT
TGGAATGCCCGGGAAATTAGAAATCCCAAGGCATTCTGAGGTGGATCTAAGAGTAATCTCTTACTTTCAGAACCACATTGATTCTGAAATCTTTTGGGAAAGGAAGAGTT
GGAGATTCTCCAGAATGTATGGTTTCCCTGAAGGAAACAGGAAGACCCAAACCTGGGATTTGCTTCGTCTTCTTTATAATCACAATGACAATCCTTGGCTAGTGGGGGAC
GACTTAAACGAAATCTTAAGTAAAACCGAAAAATCCGGTGGTAATAGCAAGGCCAGTAGGGCTATCGAAAGCTTTAGAGAGGCGATTGGTGATCGCAATTTAAGGGATAT
CAAAGGTCGTGGCTCTACTTTTACCTGGAATAGCAGAAGACACAACAATTGCGGCTCTGAGAACCTGGAGGTCAAATTGAAGGGAATTGAATCCCGGCAGAAGCGTGCTT
ATAATGAATTCCTTGAGGTGGACTTCCAAAAGATCCATGTTATCGAGTTTGAGCTTGACAACCTCTTGGAGGAGGAAGAAATTTATTGGAAAAAACAGAAAGATGAGATC
AAAGGGGATAAGGACTTCAAAGGGCAGTGGTGTGAAGACCCAAAAATCATTGAAGCTACCTTCTGGAATTATTTTAAAGATCTCTTTTCTTCATCTAATCCTCCTAAGGA
TGTAATGGATAAAGTGTTGGGCTCTGTCTTGCCAAAGGTGGACTGGAGTATGAATGAGAAATTGCTAGCACCCTACACACAAAGTGAAGTGGAAAAAGAGGCAAACAGTA
GAGGAGCTCTTCATGGGATAGAATGCGCCCCAACTTGTCCAAAAATCTCACATCTTCTATTCGTCGATGATAGCCTCATTTTCTGTAGGGCGGGCGAATCAAACTTCTTT
GCTCTTAAAACAATCTTGAAGCAGTTTGAGTTGGCTTCCGATGGAAGCATCAATTTCACCAAGTCAGCGATAATAATCTCCCCCAATGTAAACAAAGATCGATGGGGCTT
CCTAAGTGACATCCTCGGCATCAAGATGCAAGAGTCCCTTGGCACCTACTTGGGGCTTCCCTCCGACTTTTCTAGAAGCAAAACAAAAGACCTCCAAGGAATTGTAGACA
AAGAAGTCCTTATTAAAAGCATTGGCCAAGCTCTGCCCACGTACGCCATGAGCCTTTTTCAACTTCTAAAGAAGTTGTGTGACGAGATAACCAGGAGTTTCACTAGATTC
TGGTGGGGCTCAACGGATGACAGAAAAAGGATCCATTGGAAAAGCTGGAAGTTCCTTTGCTTACCTAAGGCGCTCGGTGGTCTTAACTTCAGGGAGTTGATCGGGTTTAA
TCAAGCGTTAATTGCAAAACAGGTCCTTGATAGTATTTACTTTCCTGATGCTTCAATTCTGGAGGTCCCTATGGGAATCAATGCCTCTTATCTCTGGAAAAGTTTGTTAT
GGGAGTTCAAGAATGCTAAGCTGGCTTCTTTCATTAGTTCTGATTGGAAGTGGAACACTGATATGTTGGGAAGAGCCTTCAACCAAGAAGACATTAATCTGATTTTGAGA
ATGCCCATTAACAGTAATATTGAAGATAAGTGGGTTTGGCATTTCGATAAGAAGAGCATTTACTCATCAGGAGTTCCCCATAAGGTGAAATTCTTTTGTTGGAGAGCTCT
AAATGAAATAATCCCTACAAATCTAAACCTCCAGAAGAAAGAGAAGGAAGTATGGGGGAAAATCTTGTCAAGAGTATCGGGTGCCCAAAACTTTAATAATTGTTTCTGTA
ATAGATGGTTGTCTTTAAGTTCTAGTTGTTCAGATGAAGATCTAAATCTATGTGCAATTACGTGTTGGGCTTTGTGGAATGATAGAAATAATGTGAGAATTGGGAAGCAA
ATTCCTGACCCTGATGTAAGATGTACTTGGATTTTGAAATACTACGATGAATTCCAGAGATCAGTTGCAAAAATCGGGAAAGATCCCTCAACTTTGCAGGAGATGTCGAA
AAGGAACCAGCTAAAGTGGTCAAATCCACCTAAAGGCTTTATCAAATTGAATGTGGACGCCGCCTGGAAATCGAATCCTGCTTCAACGGGTTTAAACGTGATACTCAAGG
ATTGGAATGGCAAACTTTTGGGAGTATCAGCCCGGGCCATGGATCTTGATTTTGATCTGCCATTGGCAGAAATCATTACTGTTCGTGAAGGCCTTAAGTTTGCCATTTCT
CAAAACCATAAACTTGTCATGCTGGAATCCAACTGTCTCCAAGTGATCAATTTACTGAAGAACGAAAATGAGGAGTTAAGAGGGGCAGATTACTGGTTGGAGGAGATCGA
GAGACTGAAGGTTCATTTCTTGTCAATTGAGTTTAATTACTGTAATCGTTGCAACAATGAGTTGGTAGATAAAGTTGCAAAAAATGCTAGGCTTAATAGCCTCAATGTTG
ATTGGTTGTCCTGTTATCCTTCGTGGATTAGTGGGATGCCTTTCCCTCTCTTTGTTCCCGTTTCCCATTTCATTCCCCGTCCTTGCCAAATTCCCCTTGGGGATTGA
Protein sequenceShow/hide protein sequence
MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSKPSSQRDKKDK
SPFESSVDKPKNRSSCERVAKSSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFMNKLRKWKSRASSGRKTFWAL
KIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGD
DLNEILSKTEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENLEVKLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQKDEI
KGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF
ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRF
WWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQVLDSIYFPDASILEVPMGINASYLWKSLLWEFKNAKLASFISSDWKWNTDMLGRAFNQEDINLILR
MPINSNIEDKWVWHFDKKSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQ
IPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAIS
QNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYPSWISGMPFPLFVPVSHFIPRPCQIPLGD