; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G010290 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G010290
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
Genome locationCG_Chr05:11393629..11401147
RNA-Seq ExpressionClCG05G010290
SyntenyClCG05G010290
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]1.6e-7222.18Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSY-------KRL---GWF
        + R +G ALG+ L+VD D     W E +R+++L++I+KPL R   + +  +  S W+   YERLPD C+ CG +GH +K+C+ +       +RL    W 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSY-------KRL---GWF

Query:  RG--LNPRRKS--------------------KPSSQRDKK--------------DKSPFESSVDKPKNRSSC-----------ERVA--ESSKFV-----
        R   LN RR                      +  SQ+D                +K+P    VD P N+ S            E VA  ESS  V     
Subjt:  RG--LNPRRKS--------------------KPSSQRDKK--------------DKSPFESSVDKPKNRSSC-----------ERVA--ESSKFV-----

Query:  ---QTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPE---GVEFLTSEVIGTSERFMNK-LRKWKSRASSGRKTFWALKIK------
           +T    V ++N +    M    ++ S     +F+  +     N  P+   G     S    T +R  ++ LRK ++ A  GR      +        
Subjt:  ---QTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPE---GVEFLTSEVIGTSERFMNK-LRKWKSRASSGRKTFWALKIK------

Query:  ------------------ESKNPHYLPAELSQTS----NRRSLRSSSAESHEK----------------SSVGMPGKLEIPRHSEVDLRVISYFQNHIDS
                          E+K  ++LP  L + S     RR L  S A S+E                  +VG  G L +     + L+++SY +NHI +
Subjt:  ------------------ESKNPHYLPAELSQTS----NRRSLRSSSAESHEK----------------SSVGMPGKLEIPRHSEVDLRVISYFQNHIDS

Query:  EI-FWERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR----
         I   +   W  + +YG PE  ++++ W LL+ L +  + PWLV  D NEIL  +EK GGN ++   ++ FR  + D +LRD+   G+ FTW++RR    
Subjt:  EI-FWERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR----

Query:  --------------------------------------------------------------HNNCGS--------ENLEVKLKGIESRQKRAYNEF---
                                                                         C S         N  + L  I  R  R   E    
Subjt:  --------------------------------------------------------------HNNCGS--------ENLEVKLKGIESRQKRAYNEF---

Query:  -----------LEVDFQKIHVIE----------------FELDNLLEEEEIYWK-------------------------KQKDEIKGDKDFKGQWCEDPK
                   L    +K+  +E                 E+   LE +E+ WK                         ++K+ I   +D  G W +  +
Subjt:  -----------LEVDFQKIHVIE----------------FELDNLLEEEEIYWK-------------------------KQKDEIKGDKDFKGQWCEDPK

Query:  IIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVE-----------------------------------------------
         ++A    YF +LF++++  +  M  +L  V  +V   MNE LL PY   EVE                                               
Subjt:  IIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVE-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------KEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFAL
                                                                K    R  + GI      P+I+HLLF DDS+  C+A  S    +
Subjt:  --------------------------------------------------------KEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFAL

Query:  KTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV------------------DKEVLIKSIGQA
        + +L ++E AS   IN  K++++ S NV  D    +  + G    +    YLGLP    RSK +    I                    +EVLIK++  +
Subjt:  KTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV------------------DKEVLIKSIGQA

Query:  LPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVP
        +PTYAMS F   K LC E+     +FWWG      +IHW SW+ LC+ K  GG+ FR+L  FN AL+AKQ              V ++ YFP +S+ E  
Subjt:  LPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVP

Query:  MGINASYLWKSLL-----------WEF---------------------------KNAKLASFISSD-GKWNTDMLGRAFNQEDINLILRMPINSNIEDKW
         G N+SY+WK +L           W                             +N K+ S I ++ G WN  M+   FN   I  IL++ I+ N ED W
Subjt:  MGINASYLWKSLL-----------WEF---------------------------KNAKLASFISSD-GKWNTDMLGRAFNQEDINLILRMPINSNIEDKW

Query:  VWHFDK----------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKE-------------KEVWGKILSRVSG
         W  +K                                  KS++   +P K+K F WRA  E +PT LNL+++               E     L   S 
Subjt:  VWHFDK----------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKE-------------KEVWGKILSRVSG

Query:  AQNFNNCFCNRWLSLSSSCSDEDL--------------NLCAITCWALWNDRNNVRIGKQIPDP-DVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRN
         +N    FC +  +L S  S  DL                 AIT W +W  R N ++ + I  P +V     L    E+++       +P    ++SK  
Subjt:  AQNFNNCFCNRWLSLSSSCSDEDL--------------NLCAITCWALWNDRNNVRIGKQIPDP-DVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRN

Query:  QLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGA
         ++W  PP  F+KLN+D A     +  G+ V+L++  G+++   ++           E + +  GL+  +      +MLE++CL ++N L   +E L   
Subjt:  QLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGA

Query:  DYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYPSWIS
         + L++I RL      ++  + NR  N++   +A++A L      W    PS++S
Subjt:  DYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYPSWIS

XP_012851712.1 PREDICTED: uncharacterized protein LOC105971405 [Erythranthe guttata]2.3e-8424.41Show/hide
Query:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY
        + E++  H L   + +  N+ +L   S +      +G  G L +    ++D+R+ISY  NHID+E++       WR +  YG+PE +RK  +W+L+R L 
Subjt:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY

Query:  NHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL
        +H   PWL+G D NEILS +EK+GGN++   +I +FR  + D +L D+   GS FTW++ RH                              N  GS++L
Subjt:  NHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL

Query:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ
         +K      K +++ R+KR +                                                                  E  E D      +
Subjt:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ

Query:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL
        +I  +   L+   E+ ++YW +Q+ +I    +GD                      KD +G W +    IE     +F+ +F S+NP +D +D+VL ++ 
Subjt:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL

Query:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK
         +V  S ++ L  P+T  E                              +  +A  RG   G+  APT P +S+L F DD+LIF  A ES    LK IL 
Subjt:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK

Query:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA
         +   S   INF KS ++ SP+ +++    +  +LG ++ E    YLG+P+   ++K +    + D                  KEVLIK++ QA+P+Y 
Subjt:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA

Query:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINA
        MS F L   L  EI     RFWWGS  +R  + W SWK LC  K  GGL FR++  FN AL+ KQ              ++ + YFP   +L   +G   
Subjt:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINA

Query:  SYLWKSLL--------------------------WEFKNAKLASFI-----------------SSDGKWNTDMLGRAFNQEDINLILRMPINSNI-EDKW
        S  W+S+                           W     KL  F                     G+W+  +L   F   D+  IL +P+ S + +D+W
Subjt:  SYLWKSLL--------------------------WEFKNAKLASFI-----------------SSDGKWNTDMLGRAFNQEDINLILRMPINSNI-EDKW

Query:  VWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKK-------------EKEVWGKIL
        +WHF K                                       K I+   +P K++ F WR  + IIPTN  L ++             EKE    ++
Subjt:  VWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKK-------------EKEVWGKIL

Query:  SRVSG--------------AQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM
         +  G              + N+++ + N    L  S  +E+ N+  +  W  W++RN     +++       +W   +   ++   A+I  +      M
Subjt:  SRVSG--------------AQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM

Query:  SKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEE
           +   W  PP   +KLN DAA+ +  +   + ++ +D  G  LG     +  +      E +   + +  A S     ++LE +CLQVIN L +   E
Subjt:  SKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEE

Query:  LRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK
              ++EE   +   F S  FN+  R  N L   +A+
Subjt:  LRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK

XP_012857846.1 PREDICTED: uncharacterized protein LOC105977118 [Erythranthe guttata]9.8e-8324.06Show/hide
Query:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY
        + E++  H L   + +  N+ +L   S +      +G  G L +    ++D+R+ISY  NHID+E++       WR +  YG+PE +RK  +W+L+R L 
Subjt:  IKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW--ERKSWRFSRMYGFPEGNRKTQTWDLLRLLY

Query:  NHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL
        +H   PWL+G D NEILS +EK+GGN++   +I +FR  + D +L D+   GS FTW++ R                               N  GS++L
Subjt:  NHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRH------------------------------NNCGSENL

Query:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ
         +K      K +++ R+KR +                                                                  E  E D      +
Subjt:  EVKL-----KGIES-RQKRAY-----------------------------------------------------------------NEFLEVD-----FQ

Query:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL
        +I  +   L+   E+ ++YW +Q+ +I    +GD                      KD +G W +    IE     +F+ +F S+NP +D +D+VL ++ 
Subjt:  KIHVIEFELDNLLEEEEIYWKKQKDEI----KGD----------------------KDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVL

Query:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK
         +V  S ++ L  P+T  E                              +  +A  RG   G+  APT P +S+L F DD+LIF  A ES    LK IL 
Subjt:  PKVDWSMNEKLLAPYTQSE------------------------------VEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILK

Query:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA
         +   S   INF KS ++ SP+ +++    +  +LG ++ E    YLG+P+   ++K +    + D                  KEVLIK++ QA+P+Y 
Subjt:  QFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYA

Query:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINA
        MS F L   L  EI     RFWWGS  +R  + W SWK LC  K  GGL FR++  FN AL+ KQ              ++ + YFP   +L   +G   
Subjt:  MSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINA

Query:  SYLWKSLL--------------------------WEFKNAKLASFI-----------------SSDGKWNTDMLGRAFNQEDINLILRMPINSNI-EDKW
        S  W+S+                           W     KL  F                     G+W+  +L   F   D+  IL +P+ S + +D+W
Subjt:  SYLWKSLL--------------------------WEFKNAKLASFI-----------------SSDGKWNTDMLGRAFNQEDINLILRMPINSNI-EDKW

Query:  VWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKE---------------------
        +WHF K                                       K I+   +P K++ F WR  + IIPTN  L +++                     
Subjt:  VWHFDK---------------------------------------KSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKE---------------------

Query:  ------KEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM
              ++VW         + N+++ + N    L  S  +E+ N+  +  W  W++RN     +++       +W   +   ++   A+I  +      M
Subjt:  ------KEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM

Query:  SKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEE
           +   W  PP   +KLN D A+ +  +   + ++ +D  G  LG     +  +      E +   + +  A S     ++LE +CLQVIN L +   E
Subjt:  SKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEE

Query:  LRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK
              ++EE   +   F S  FN+  R  N L   +A+
Subjt:  LRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAK

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.3e-9024.84Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK
        MA RLG+ALG F + DCD  N  W  +LR+++++DI+KPLRRGI ++L G     WI  +YERLPDFCY CG    +    K ++   W R     + + 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK

Query:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAESSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM
        P        K P E  +DK  N S     + S+  V   S+ V+         +P E                                S V  T ++  
Subjt:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAESSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM

Query:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR
           ++ KS      G +    + +KE  N   L   L   S   S++ S ++S  +  +  PG                               S RF+ 
Subjt:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR

Query:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------
         YG P  +++  TW+LLR + N + +PWL+G D+N IL   E S  +S  +  IE+FR  +   +L D+  +G  FTW + R                  
Subjt:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------

Query:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------
                        HN   S++++                   ++K  ++    AYN+ L +DF  IH +E +L  LLE EEI+WK++          
Subjt:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------

Query:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------
                                        K+EI+            G   F   + +       PK +EA             WN            
Subjt:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------

Query:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----
                                             D  S+  P + + D V                          L     +V+W+  E +     
Subjt:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----

Query:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF
                                                 L+PY         S +    N+ G L GI        I+HLLF DDSLIF R+ ES   
Subjt:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF

Query:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC
        AL+ +L  +  AS   INF+KSA++ SPNV+ +R  +L  IL +K+    G YLGLPS F+R +                                    
Subjt:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC

Query:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLESIYFPDASILEVPMGINASYLWKSLLW--
                        + +++HW  W  +C PK  GGLNFR+L GFNQAL+AK              +VL+  YF D S+L+      +SY WK  LW  
Subjt:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLESIYFPDASILEVPMGINASYLWKSLLW--

Query:  ----------------------------------EFKNAKL----ASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S
                                           F N  L    ASFI++DG W+   +  +F  ED +LIL MPI+S N++D W+WH+DK+  YS  S
Subjt:  ----------------------------------EFKNAKL----ASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S

Query:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS
        G                              VP K+K F WR+ +E IPT  NL                            K+ +++W  +   L+ +S
Subjt:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS

Query:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL
           N +  F   W SL+     +DLNL AIT W +WNDRN++  GKQ+   + +C W+  + D    S A++       Q   +     W       +KL
Subjt:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL

Query:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF
        N DAA +   AST    I++D +  L+  ++  +       LAEI  + EGLKFA + N   + +ES+ L  I L++NE         W+ EI+ L   F
Subjt:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF

Query:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI
          I F++ +R  N     +AK      S    WL  +P+W+
Subjt:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.4e-8424.76Show/hide
Query:  GMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW-ERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFR
        G+ G L +  ++EVD+ +ISY  +HID+ I   + K WR S +YG PE  +K  TW LLR L      PWL   D NEIL   EK GG  +    + +FR
Subjt:  GMPGKLEIPRHSEVDLRVISYFQNHIDSEIFW-ERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFR

Query:  EAIGDRNLRDIKGRGSTFTWNSRR----------------------------------------------------------------------------
        E + D NL D+  RG  FTW++RR                                                                            
Subjt:  EAIGDRNLRDIKGRGSTFTWNSRR----------------------------------------------------------------------------

Query:  ------------------------HNNCGS----------ENLEVKLKGIESRQKRAYNEFLE-VDFQKIHVIEFELDNLLEEEEIYWK-----------
                                 NNC +          +  + KLK ++ +     + F +  +  +I   E +++ +L +EE+YWK           
Subjt:  ------------------------HNNCGS----------ENLEVKLKGIESRQKRAYNEFLE-VDFQKIHVIEFELDNLLEEEEIYWK-----------

Query:  --------------KQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK-----------
                      K+K+ I G  D    W +D + +E  F  YF  LF++S+P +  ++  L  + P+V   MN +L +P+T+ EV             
Subjt:  --------------KQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK-----------

Query:  ----------------------------------------EANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINF
                                                +A  +  +HGI       K+SHLLF DDSLIF RA   +   LK + + +  AS    NF
Subjt:  ----------------------------------------EANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINF

Query:  TKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV-------------------DKEVLIKSIGQALPTYAMSLFQLLKKLC
         KS++  S     D+   +S+I  +K+      YLGLPS   R KTK     V                    KEVLIK++ QA+PTYAMS+F++   LC
Subjt:  TKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIV-------------------DKEVLIKSIGQALPTYAMSLFQLLKKLC

Query:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLW--
        ++I ++  RFWWG+  DRK IHW  W+ +   KA GG+ FR+L  FNQAL+AKQ              VL++ YF     +   +G   S++W+S++W  
Subjt:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLW--

Query:  -----------------------------EFK---------NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS--S
                                      FK         +  +A  I    +W  D++ + F  ED   I+++P+     ED+ +WH+DKK  YS  S
Subjt:  -----------------------------EFK---------NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS--S

Query:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQKKE---------------------------KEVW--GKILSRVSG
        G                              +P KVK F WRA ++++PT  NL KK+                           +++W    +   + G
Subjt:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQKKE---------------------------KEVW--GKILSRVSG

Query:  AQNFNNCFCNR-WLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM--SKRNQLKWSNPPKGFI
            +  +  + W    +     ++   A   WA+W  RN      +  +P      ++   +    S  KI + P  + +   +   Q +WS PP G+ 
Subjt:  AQNFNNCFCNR-WLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM--SKRNQLKWSNPPKGFI

Query:  KLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKV
        K+NVDAA        GL V+++D +G     + +++ L   + +AE   +  GLK A   +    + ES+ L+VI+L+  ++  L    + + +I+    
Subjt:  KLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKV

Query:  HFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYP
        +F + +  +  R  N     +AK A      V WL   P
Subjt:  HFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYP

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248746.2e-9124.84Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK
        MA RLG+ALG F + DCD  N  W  +LR+++++DI+KPLRRGI ++L G     WI  +YERLPDFCY CG    +    K ++   W R     + + 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSK

Query:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAESSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM
        P        K P E  +DK  N S     + S+  V   S+ V+         +P E                                S V  T ++  
Subjt:  PSSQRDKKDKSPFESSVDKPKNRSSCERVAESSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFM

Query:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR
           ++ KS      G +    + +KE  N   L   L   S   S++ S ++S  +  +  PG                               S RF+ 
Subjt:  NKLRKWKSRA--SSGRKTFWALKIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSR

Query:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------
         YG P  +++  TW+LLR + N + +PWL+G D+N IL   E S  +S  +  IE+FR  +   +L D+  +G  FTW + R                  
Subjt:  MYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRR------------------

Query:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------
                        HN   S++++                   ++K  ++    AYN+ L +DF  IH +E +L  LLE EEI+WK++          
Subjt:  ----------------HNNCGSENLEV------------------KLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQ----------

Query:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------
                                        K+EI+            G   F   + +       PK +EA             WN            
Subjt:  --------------------------------KDEIK------------GDKDFKGQWCE------DPKIIEATF-----------WN------------

Query:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----
                                             D  S+  P + + D V                          L     +V+W+  E +     
Subjt:  ----------------------------------YFKDLFSSSNPPKDVMDKV--------------------------LGSVLPKVDWSMNEKL-----

Query:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF
                                                 L+PY         S +    N+ G L GI        I+HLLF DDSLIF R+ ES   
Subjt:  -----------------------------------------LAPYT-------QSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF

Query:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC
        AL+ +L  +  AS   INF+KSA++ SPNV+ +R  +L  IL +K+    G YLGLPS F+R +                                    
Subjt:  ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLC

Query:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLESIYFPDASILEVPMGINASYLWKSLLW--
                        + +++HW  W  +C PK  GGLNFR+L GFNQAL+AK              +VL+  YF D S+L+      +SY WK  LW  
Subjt:  DEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAK--------------QVLESIYFPDASILEVPMGINASYLWKSLLW--

Query:  ----------------------------------EFKNAKL----ASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S
                                           F N  L    ASFI++DG W+   +  +F  ED +LIL MPI+S N++D W+WH+DK+  YS  S
Subjt:  ----------------------------------EFKNAKL----ASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--S

Query:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS
        G                              VP K+K F WR+ +E IPT  NL                            K+ +++W  +   L+ +S
Subjt:  G------------------------------VPHKVKFFCWRALNEIIPTNLNLQ---------------------------KKEKEVWGKI---LSRVS

Query:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL
           N +  F   W SL+     +DLNL AIT W +WNDRN++  GKQ+   + +C W+  + D    S A++       Q   +     W       +KL
Subjt:  GAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKL

Query:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF
        N DAA +   AST    I++D +  L+  ++  +       LAEI  + EGLKFA + N   + +ES+ L  I L++NE         W+ EI+ L   F
Subjt:  NVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHF

Query:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI
          I F++ +R  N     +AK      S    WL  +P+W+
Subjt:  LSIEFNYCNRCNNELVDKVAK-NARLNSLNVDWLSCYPSWI

A0A803PV25 Uncharacterized protein7.4e-6827.25Show/hide
Query:  KEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPS
        + A  +G LHG+        +SHL F DDSL+F  A E      + +L+++ +AS   +NF KS +    +V+      L+  +G+K+ ++ G YLGLPS
Subjt:  KEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPS

Query:  DFSRSKTKDLQGI-----------------VDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFR
           R+K +  + I                   KEVLIK+I QA+PTY MS F+L KK  + I     RFWWGS++   +IHW  W  LC  K  GGL FR
Subjt:  DFSRSKTKDLQGI-----------------VDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFR

Query:  ELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLWEFK----------------------------------------
        +L  FNQAL+AKQ              VL++ Y+P+  +LE   G +AS++W+SL+W  K                                        
Subjt:  ELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLWEFK----------------------------------------

Query:  NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--SG------------------------------VPHKVKFFCW
        N  +      +G+W+ + +   FN  D  LIL+M  +  +IEDK +WH+ K   YS  SG                              +P KVK F W
Subjt:  NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--SG------------------------------VPHKVKFFCW

Query:  RALNEIIPTNLNLQKKEKEVWGKILSRVSGAQNFNNCF---------CNRW---------------------LSLSSSCSDEDLNLCAITCWALWNDRNN
        +  +  IPTN  L  ++ ++        SGA  + N F         C+ W                     + +SSS + ED     +  W LW  RN+
Subjt:  RALNEIIPTNLNLQKKEKEVWGKILSRVSGAQNFNNCF---------CNRW---------------------LSLSSSCSDEDLNLCAITCWALWNDRNN

Query:  VRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM-SKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDL
        V  G   P       W  K+  EF        +D +  Q+  + R   +W  P +G   +NVDA  K       ++ +++D  G++   + R ++ +   
Subjt:  VRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEM-SKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDL

Query:  PLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEI----ERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCY
          AE+  + +G+K  + Q      +E++CLQ +NL+  ++   R  D  + +I    + ++VH +S  +    R  N+    +A  A +N  +  W+   
Subjt:  PLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEI----ERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCY

Query:  P
        P
Subjt:  P

A0A803PV25 Uncharacterized protein4.1e-1021.61Show/hide
Query:  VGMPGKLEIPRHSEVDLRVISYFQNHIDSEIF-WERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESF
        +G+ G L +   +++ ++V+S    HI + +       W F+  YG P+  ++  +W LLR L      PWL   D NEI+S +EK GG  +    ++ F
Subjt:  VGMPGKLEIPRHSEVDLRVISYFQNHIDSEIF-WERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESF

Query:  REAIGDRNLRDIKGRGSTFTW-----NSR-----------------------------------------------------------------------
        +E + D    D        TW     NSR                                                                       
Subjt:  REAIGDRNLRDIKGRGSTFTW-----NSR-----------------------------------------------------------------------

Query:  ---------------------RHNNCG------SENLEVKLKGIESRQKRAYNEFLEVD----FQKIHVIEFELDNLLEEEEIYW---------------
                             + N CG      +   + KL     + K+A +E   +     ++ I  +E +L+ LLE++E YW               
Subjt:  ---------------------RHNNCG------SENLEVKLKGIESRQKRAYNEFLEVD----FQKIHVIEFELDNLLEEEEIYW---------------

Query:  ----------KKQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEV
                  +++K+EIKG +D  G W +D  ++     +Y++ LF+SS+  + V+++VL  V PKV   MN  LLA + + EV
Subjt:  ----------KKQKDEIKGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEV

A0A803PV25 Uncharacterized protein9.6e-6823.69Show/hide
Query:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDC-----------KSYKRLG-
        + R +G++ G   ++D   D   W + LR +  +++TKPLRRG+ +  + +   I I+ +YE+LPDFCY CG + HV  +C           K  K  G 
Subjt:  MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDC-----------KSYKRLG-

Query:  WFRGLNPRRKS-----------KPSSQRDKKDKSPFESSVDKPKNRSSCE-------------RVAESSKFVQ--TESRAVERKNQVNQKEMPPEAM---
        W R   PR KS           +   + +KKD      SV+K +    C+             R  E S   Q  T+  +   K QV  K    ++    
Subjt:  WFRGLNPRRKS-----------KPSSQRDKKDKSPFESSVDKPKNRSSCE-------------RVAESSKFVQ--TESRAVERKNQVNQKEMPPEAM---

Query:  --KFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFMNKLRKWKSRASSGRK----------TFWALKIKESKNPHYLPAELSQTS--NRRSLR
          K       +  K++V    N   + V             F+ K     S +  G K          T   +    +   H +  +  + S  +    R
Subjt:  --KFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFMNKLRKWKSRASSGRK----------TFWALKIKESKNPHYLPAELSQTS--NRRSLR

Query:  SSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKS--WRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSG
          S E  E  S    G L +    E ++ ++SY  +H D+ +   + S  WRF+  YG P  +R+ ++WDL+R L   +  PW++G D NEI+  +EK G
Subjt:  SSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKS--WRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSG

Query:  GNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENLEVKLKGIESRQK--RAYNEFLEV------------DFQKIHVIEFE--------
        G+ +    ++ FR  I D  L  +K  G   TW   R  N   E L+  L     R +  +  N  L++            D +++ V  FE        
Subjt:  GNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENLEVKLKGIESRQK--RAYNEFLEV------------DFQKIHVIEFE--------

Query:  ------LDNL-----LEEEEIYWKK-QKDEIKGDKD-----------------FKGQW-CEDPKIIEATFWNYFKDLF--------SSSNPPKDVMDKVL
               DN+     ++  E+  K  +  E+K   D                 ++  W     +I+     N FK +         S+  P + + D  L
Subjt:  ------LDNL-----LEEEEIYWKK-QKDEIKGDKD-----------------FKGQW-CEDPKIIEATFWNYFKDLF--------SSSNPPKDVMDKVL

Query:  --------------------------GSVLPKVDWSMNEKL----------------------------------------------LAPYT-------Q
                                       +V+W+  E++                                              ++PY         
Subjt:  --------------------------GSVLPKVDWSMNEKL----------------------------------------------LAPYT-------Q

Query:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL
        S +   A+++G + G+      P+I+HLLF DDSL+F RA  +   A+  +LKQ+E AS   +N  KSA+  S N        +   LG+        YL
Subjt:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL

Query:  GLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG
        G+P    R K    Q I D                  K VLI+++ Q +P YAMS F+  K    ++     RFWW      K I W +W+ L + K  G
Subjt:  GLPSDFSRSKTKDLQGIVD------------------KEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG

Query:  GLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLL-----------WEF--------------------KNAKL
        GL FR+   FN A +AKQ              V  + YFP  S +   +G N SY+W+S+L           W                      K + +
Subjt:  GLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLL-----------WEF--------------------KNAKL

Query:  ASFISSDG-----------KWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS-SGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKI
        A  I   G            W  D+L   F    +  IL + ++S+ +EDK +W   K  +++     H  +    R   E++   L      K +W   
Subjt:  ASFISSDG-----------KWNTDMLGRAFNQEDINLILRMPINSN-IEDKWVWHFDKKSIYS-SGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKI

Query:  LSRVSGAQNFNNCFCNR--------WLS----LSSSCSDEDLNLCA------------ITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQRSVA
         +RVS    F  C+  R        W+      + S  D   NLC             I  W LW +RN+ +  GK I    +    +L    EF+  + 
Subjt:  LSRVSGAQNFNNCFCNR--------WLS----LSSSCSDEDLNLCA------------ITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQRSVA

Query:  KIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGK-LLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNC
         +  D   +Q     +  KW+ P  G +KLN DAA+  +   T L ++L+D  G  +LG  +R  + D ++  AE + +  GL+ A    H  +++ES+ 
Subjt:  KIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGK-LLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNC

Query:  LQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVA
         Q I  ++   E L      + +I      F  I F++  R  N L   +A
Subjt:  LQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVA

A0A803Q0L5 Uncharacterized protein2.1e-7028.05Show/hide
Query:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL
        S + +     G ++G+        +SHL F DDSLIF  A   +    K++L+ +  AS   +N+ KS +    NV+++    L+  +G++  ++ G YL
Subjt:  SEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYL

Query:  GLPSDFSRSKTKDLQGI------------------VDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG
        GLPS   R+K + L  I                    KEVLIK++ QA+PTYAMS F+L KK    I R   RFWWGS++  K+IHW  W  LC PK  G
Subjt:  GLPSDFSRSKTKDLQGI------------------VDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALG

Query:  GLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLWEFK-----------------------------------
        GL FR+L  FNQA++AKQ              VL++ YFP  +ILE   G  AS++W+SL+W  K                                   
Subjt:  GLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLWEFK-----------------------------------

Query:  -----NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDK--------------------------------KSIYSSGVPHKV
             N  +A     DG W++  +   FN +D  LIL MP     +EDK +WH+ K                                K+++   +P K+
Subjt:  -----NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDK--------------------------------KSIYSSGVPHKV

Query:  KFFCWRALNEIIPTNLNLQKK--EKEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDE
        K F W+     +PTN+ L K+   KE W     +    +  +    +  + ++            +  W +WN RN+V  G   P       W  KY  E
Subjt:  KFFCWRALNEIIPTNLNLQKK--EKEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDE

Query:  FQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVM
        F+   A       T     +R   KW  P  G +K+NVDA  KS    +G+  +++D  G+ L  S+  +  +      E+  +  GL+  I +      
Subjt:  FQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVM

Query:  LESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDW----LSCYPSWIS-GMPFPL
        +ES+CLQ + L++ ++E     D  L++I  L ++      ++  R  N +   +A  A +N  +  W    LSC    +S  MP P+
Subjt:  LESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDW----LSCYPSWIS-GMPFPL

A0A803Q1K6 Uncharacterized protein1.7e-7224.38Show/hide
Query:  VGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWER-KSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESF
        VG+ G L +    E+ LRV S    HI + +       W  +  YG P+ +++  +WDLLR +       WL   D NEI+S  EK GG  +++ A+E F
Subjt:  VGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWER-KSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGDDLNEILSETEKSGGNSKASRAIESF

Query:  REAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENL----------------EVKL---------------------KGIESRQKRAYNEFLEV--------
        REAI D  L D     +  TW +   +N   E L                +VK+                      G   + K +   F E         
Subjt:  REAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENL----------------EVKL---------------------KGIESRQKRAYNEFLEV--------

Query:  -----------------DFQK---------------------IHVIEFELDNLLEEEEIYW-------------------------KKQKDEIKGDKDFK
                         DF++                     +  IE +L+ +LE++E+YW                         +++K+ IKG  D  
Subjt:  -----------------DFQK---------------------IHVIEFELDNLLEEEEIYW-------------------------KKQKDEIKGDKDFK

Query:  GQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK--------------------------------------
        G W ++  +++    +YF+ +F +S  P DV D+V+  V  KV   MNE LL  +T  E+ K                                      
Subjt:  GQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEK--------------------------------------

Query:  --------------------------EANS--------RGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAII
                                  EA S         G L GI        ISHL F D SLIF  A   +    + +L ++  AS   +N+ KS   
Subjt:  --------------------------EANS--------RGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFFALKTILKQFELASDGSINFTKSAII

Query:  ISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQ------------------GIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSF
           NV  +    L+ ILG++  E+ G YLGLPS   R+K + L                    +  KEVLIK+I QA+PTY MS F+L KK    + R  
Subjt:  ISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQ------------------GIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSF

Query:  TRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLWEFK------
        +RFWWGS+D  K+IHW  W++LC PK  GGL FR+L  FNQAL+AKQ              VL++ YFP   +LE   G NAS++ +SL+W  K      
Subjt:  TRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEVPMGINASYLWKSLLWEFK------

Query:  ----------------------------------NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--SG------
                                          N  +     +DG+W+   +   FN  D +LIL +P +  + EDK +WH+ K   YS  SG      
Subjt:  ----------------------------------NAKLASFISSDGKWNTDMLGRAFNQEDINLILRMPINS-NIEDKWVWHFDKKSIYS--SG------

Query:  ------------------------VPHKVKFFCWRALNEIIPTNLNLQKK---------------EKEV----W----GKILSRVSGAQN-----FNNCF
                                 P KVK F W+  +  +P N+NL K+               E+ V    W     K   RVSG  +          
Subjt:  ------------------------VPHKVKFFCWRALNEIIPTNLNLQKK---------------EKEV----W----GKILSRVSGAQN-----FNNCF

Query:  CNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSN
         +  + +++    E L    +  W +WN RN V  G   P P+    W   +   F+      G++ S      +R    W  P +              
Subjt:  CNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQIPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSN

Query:  PASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCN
                       +   V+A  + L       E + +++G++  I Q  +   ++++CLQ ++L++N+    R  D  L +I  L  +   +  ++  
Subjt:  PASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCN

Query:  RCNNELVDKVAKNARLNSLNVDWLSCYP
        R  N +   +A  A ++  +  W+   P
Subjt:  RCNNELVDKVAKNARLNSLNVDWLSCYP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.8e-0727.13Show/hide
Query:  IISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWK
        ++   +NKD +G + + +  +M               R KT    G   +  L K++  ++P ++MS   L + + + + +    F WGST ++K+ H  
Subjt:  IISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWK

Query:  SWKFLCLPKALGGLNFRELIGFNQALIAK
         W  +C PK  GGL  R     N+ALI+K
Subjt:  SWKFLCLPKALGGLNFRELIGFNQALIAK

P93295 Uncharacterized mitochondrial protein AtMg003107.2e-2043.48Show/hide
Query:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILE
        ALP YAMS F+L K LC ++T + T FWW S +++++I W +W+ LC  K   GGL FR+L  FNQAL+AKQ              +L S YFP +S++E
Subjt:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILE

Query:  VPMGINASYLWKSLL
          +G   SY W+S++
Subjt:  VPMGINASYLWKSLL

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-1624.55Show/hide
Query:  NLCAITCWALWNDRNNVRI-GKQIPDPDVRCTWILKYYDEFQR-SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWN
        NL     W LW  RN +   GK+   P+V    + +  ++F+  S  +  +  ++  ++ +   ++W  PP  ++K N DA W+      G+  IL++ +
Subjt:  NLCAITCWALWNDRNNVRI-GKQIPDPDVRCTWILKYYDEFQR-SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWN

Query:  GKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYW------LEEIERLKVHFLSIEFNYCNRCNNELVD
        G +L + ARA+    ++  AE+  +R  +      N+K ++ ES+   ++NLL ++       D+W      LE+I++L  HF  ++F +  R  N++ D
Subjt:  GKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLESNCLQVINLLKNENEELRGADYW------LEEIERLKVHFLSIEFNYCNRCNNELVD

Query:  KVAKNA-RLNSLNVDWLSCYPSWI
        ++A+ +   ++ +    S  P W+
Subjt:  KVAKNA-RLNSLNVDWLSCYPSWI

AT4G29090.1 Ribonuclease H-like superfamily protein2.5e-2821.78Show/hide
Query:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEV
        ALPTY M+ F L K +C +I      FWW +  + K +HWK+W  L   KA GG+ F+++  FN AL+ KQ              V +S YF  +  L  
Subjt:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILEV

Query:  PMGINASYLWKS----------------------LLWEFK--------------------------NAKLASFISSDGK-WNTDMLGRAFNQEDINLILR
        P+G   S++WKS                      ++W  K                            K++  I   G+ W  D++   F + +  LI  
Subjt:  PMGINASYLWKS----------------------LLWEFK--------------------------NAKLASFISSDGK-WNTDMLGRAFNQEDINLILR

Query:  M-PINSNIEDKWVWHF-------------------DKKS----------------IYSSGVPHKVKFFCWRALNEIIP-----TNLNLQKKE--------
        + P    I D + W +                   +K+S                I+ S    K++ F W+ L+  +P        +L K+         
Subjt:  M-PINSNIEDKWVWHF-------------------DKKS----------------IYSSGVPHKVKFFCWRALNEIIP-----TNLNLQKKE--------

Query:  KEVWGKILSRVS---------------GAQNFNNCFCNRW----LSLSSSCSDEDLNLCAITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQR-
        KE    +L + +               G +  ++ + N +    L   +   ++   L     W LW +RN  V  G++    +V    + +  D+ +  
Subjt:  KEVWGKILSRVS---------------GAQNFNNCFCNRW----LSLSSSCSDEDLNLCAITCWALWNDRNN-VRIGKQIPDPDVRCTWILKYYDEFQR-

Query:  SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLES
         +    +   T  ++++ +  +W  PP  ++K N DA W  +    G+  +L++  G++  + ARA+     +  AE+  +R  +       +  V+ ES
Subjt:  SVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAISQNHKLVMLES

Query:  NCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVD--WLSCYPSW
        +   +I +L N+ E        +++++RL   F  ++F +  R  N L ++VA+ + L+ LN D    S  PSW
Subjt:  NCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVD--WLSCYPSW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.1e-2143.48Show/hide
Query:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILE
        ALP YAMS F+L K LC ++T + T FWW S +++++I W +W+ LC  K   GGL FR+L  FNQAL+AKQ              +L S YFP +S++E
Subjt:  ALPTYAMSLFQLLKKLCDEITRSFTRFWWGSTDDRKRIHWKSWKFLCLPKA-LGGLNFRELIGFNQALIAKQ--------------VLESIYFPDASILE

Query:  VPMGINASYLWKSLL
          +G   SY W+S++
Subjt:  VPMGINASYLWKSLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAAGGTTAGGGGATGCTTTGGGGAAGTTCTTAAAGGTCGATTGCGATCAGGATAATTTTTGCTGGTGGGAGAGTCTCAGACTGAAAATTCTTGTGGACATCAC
AAAACCTTTACGAAGAGGAATCTGGGTAAGCCTGAGTGGTGATCAGGAAAGCATTTGGATAAATTCCAAATATGAGAGACTACCTGATTTTTGCTATGGCTGTGGGAGGA
TTGGGCACGTTATTAAGGATTGTAAGTCTTATAAAAGACTCGGATGGTTTAGAGGTTTGAACCCAAGAAGGAAAAGTAAACCCTCATCCCAAAGAGACAAGAAGGACAAG
TCCCCCTTCGAGTCGTCTGTAGACAAACCCAAAAACCGTTCATCTTGTGAAAGGGTGGCTGAATCCTCTAAGTTTGTACAGACAGAAAGTAGAGCTGTGGAGAGAAAGAA
TCAAGTAAATCAGAAGGAGATGCCACCAGAAGCCATGAAGTTTTCAAGCCAGATCCTGTTCATATTCATAAAGGAAGAAGTAAACAAAGTCCTTAATTTTCATCCAGAAG
GAGTAGAGTTTCTAACCTCGGAGGTCATTGGAACCAGCGAACGTTTTATGAACAAGCTAAGGAAGTGGAAAAGTAGAGCGAGTTCCGGGAGGAAGACCTTCTGGGCCCTG
AAAATAAAAGAAAGCAAGAATCCTCATTACCTTCCAGCAGAACTATCTCAAACATCCAATCGGCGGAGCTTGCGCAGCAGCTCTGCCGAGAGCCATGAAAAATCATCTGT
TGGAATGCCCGGGAAATTAGAAATCCCAAGGCATTCTGAGGTGGATCTAAGAGTAATCTCTTACTTTCAGAACCACATTGATTCTGAAATCTTTTGGGAAAGGAAGAGTT
GGAGATTCTCCAGAATGTATGGCTTCCCTGAAGGAAACAGGAAGACCCAAACCTGGGATTTGCTTCGTCTTCTTTATAATCACAATGACAATCCTTGGCTAGTGGGGGAC
GACTTAAACGAAATCTTAAGTGAAACCGAAAAATCCGGTGGTAATAGCAAGGCCAGTAGGGCTATCGAAAGCTTTAGAGAGGCGATTGGTGATCGCAATTTGAGGGATAT
CAAAGGTCGTGGCTCTACTTTTACCTGGAATAGCAGAAGACACAACAATTGCGGCTCTGAGAACCTGGAGGTCAAATTGAAGGGAATTGAATCCCGGCAGAAGCGTGCTT
ATAATGAATTCCTTGAGGTGGACTTCCAAAAGATCCATGTTATCGAGTTTGAGCTTGACAACCTCTTGGAGGAGGAAGAAATTTATTGGAAAAAACAGAAAGATGAGATC
AAAGGGGATAAGGACTTCAAAGGGCAGTGGTGTGAAGACCCAAAAATCATTGAAGCTACCTTCTGGAATTATTTTAAAGATCTCTTTTCTTCATCTAATCCTCCTAAGGA
TGTAATGGATAAAGTGTTGGGCTCTGTCTTGCCAAAGGTGGACTGGAGTATGAATGAGAAATTGCTAGCACCCTACACACAAAGTGAAGTGGAAAAAGAGGCAAACAGTA
GAGGAGCTCTTCATGGGATAGAATGCGCCCCAACTTGTCCAAAAATCTCACATCTTCTATTCGTCGATGATAGCCTCATTTTCTGTAGGGCGGGCGAATCAAACTTCTTT
GCTCTTAAAACAATCTTGAAGCAGTTTGAGTTGGCTTCCGATGGAAGCATCAATTTCACCAAGTCAGCGATAATAATCTCCCCCAATGTAAACAAAGATCGATGGGGCTT
CCTAAGTGACATCCTCGGCATCAAGATGCAAGAGTCCCTTGGCACCTACTTGGGGCTTCCCTCCGACTTTTCTAGAAGCAAAACAAAAGACCTCCAAGGAATTGTAGACA
AAGAAGTCCTTATTAAAAGCATTGGCCAAGCTCTGCCCACGTACGCCATGAGCCTTTTTCAACTTCTAAAGAAGTTGTGTGACGAGATAACCAGGAGTTTCACTAGATTC
TGGTGGGGCTCAACGGATGACAGAAAAAGGATCCATTGGAAAAGCTGGAAGTTCCTTTGCTTACCTAAGGCGCTCGGTGGTCTTAACTTCAGGGAGTTGATCGGGTTTAA
TCAAGCGTTAATTGCAAAACAGGTCCTTGAGAGTATTTACTTTCCTGATGCTTCAATTCTGGAGGTCCCTATGGGAATCAATGCCTCTTATCTCTGGAAAAGTTTGTTAT
GGGAGTTCAAGAATGCTAAGCTGGCTTCTTTCATTAGTTCTGATGGGAAGTGGAACACTGATATGTTGGGAAGAGCCTTCAACCAAGAAGACATTAATCTGATTTTGAGA
ATGCCCATTAACAGTAATATTGAAGATAAGTGGGTTTGGCATTTCGATAAGAAGAGCATTTACTCATCAGGAGTTCCCCATAAGGTGAAATTCTTTTGTTGGAGAGCTCT
AAATGAAATAATCCCTACAAATCTAAACCTCCAGAAGAAAGAGAAGGAAGTATGGGGGAAAATCTTGTCAAGAGTATCGGGTGCCCAAAACTTTAATAATTGTTTCTGTA
ATAGATGGTTGTCTTTAAGTTCTAGTTGTTCAGATGAAGATCTAAATCTATGTGCAATTACGTGTTGGGCTTTGTGGAATGATAGAAATAATGTGAGAATTGGGAAGCAA
ATTCCTGACCCTGATGTAAGATGTACTTGGATTTTGAAATACTACGATGAATTCCAGAGATCAGTTGCAAAAATCGGGAAAGATCCCTCAACTTTGCAGGAGATGTCGAA
AAGGAACCAGCTAAAGTGGTCAAATCCACCTAAAGGCTTTATCAAATTGAATGTGGACGCCGCCTGGAAATCGAATCCTGCTTCAACGGGTTTAAACGTGATACTCAAAG
ATTGGAATGGCAAACTTTTGGGAGTATCAGCCCGGGCCATGGATCTTGATTTTGATCTGCCATTGGCAGAAATCATTACTGTTCGTGAAGGCCTTAAGTTTGCCATTTCT
CAAAACCATAAACTTGTCATGCTGGAATCCAACTGTCTCCAAGTGATCAATTTACTGAAGAACGAAAATGAGGAGTTAAGAGGGGCAGATTACTGGTTGGAGGAGATCGA
GAGACTGAAGGTTCATTTCTTGTCAATTGAGTTTAATTACTGTAATCGTTGCAACAATGAGTTGGTAGATAAAGTTGCAAAAAATGCTAGGCTTAATAGCCTCAATGTTG
ATTGGTTGTCCTGTTATCCTTCGTGGATTAGTGGGATGCCTTTCCCTCTCTTTGTTCCCGTTTCCCATTTCATTCCCCGTCCTTGCCAAATTCCCCTTGGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAAGGTTAGGGGATGCTTTGGGGAAGTTCTTAAAGGTCGATTGCGATCAGGATAATTTTTGCTGGTGGGAGAGTCTCAGACTGAAAATTCTTGTGGACATCAC
AAAACCTTTACGAAGAGGAATCTGGGTAAGCCTGAGTGGTGATCAGGAAAGCATTTGGATAAATTCCAAATATGAGAGACTACCTGATTTTTGCTATGGCTGTGGGAGGA
TTGGGCACGTTATTAAGGATTGTAAGTCTTATAAAAGACTCGGATGGTTTAGAGGTTTGAACCCAAGAAGGAAAAGTAAACCCTCATCCCAAAGAGACAAGAAGGACAAG
TCCCCCTTCGAGTCGTCTGTAGACAAACCCAAAAACCGTTCATCTTGTGAAAGGGTGGCTGAATCCTCTAAGTTTGTACAGACAGAAAGTAGAGCTGTGGAGAGAAAGAA
TCAAGTAAATCAGAAGGAGATGCCACCAGAAGCCATGAAGTTTTCAAGCCAGATCCTGTTCATATTCATAAAGGAAGAAGTAAACAAAGTCCTTAATTTTCATCCAGAAG
GAGTAGAGTTTCTAACCTCGGAGGTCATTGGAACCAGCGAACGTTTTATGAACAAGCTAAGGAAGTGGAAAAGTAGAGCGAGTTCCGGGAGGAAGACCTTCTGGGCCCTG
AAAATAAAAGAAAGCAAGAATCCTCATTACCTTCCAGCAGAACTATCTCAAACATCCAATCGGCGGAGCTTGCGCAGCAGCTCTGCCGAGAGCCATGAAAAATCATCTGT
TGGAATGCCCGGGAAATTAGAAATCCCAAGGCATTCTGAGGTGGATCTAAGAGTAATCTCTTACTTTCAGAACCACATTGATTCTGAAATCTTTTGGGAAAGGAAGAGTT
GGAGATTCTCCAGAATGTATGGCTTCCCTGAAGGAAACAGGAAGACCCAAACCTGGGATTTGCTTCGTCTTCTTTATAATCACAATGACAATCCTTGGCTAGTGGGGGAC
GACTTAAACGAAATCTTAAGTGAAACCGAAAAATCCGGTGGTAATAGCAAGGCCAGTAGGGCTATCGAAAGCTTTAGAGAGGCGATTGGTGATCGCAATTTGAGGGATAT
CAAAGGTCGTGGCTCTACTTTTACCTGGAATAGCAGAAGACACAACAATTGCGGCTCTGAGAACCTGGAGGTCAAATTGAAGGGAATTGAATCCCGGCAGAAGCGTGCTT
ATAATGAATTCCTTGAGGTGGACTTCCAAAAGATCCATGTTATCGAGTTTGAGCTTGACAACCTCTTGGAGGAGGAAGAAATTTATTGGAAAAAACAGAAAGATGAGATC
AAAGGGGATAAGGACTTCAAAGGGCAGTGGTGTGAAGACCCAAAAATCATTGAAGCTACCTTCTGGAATTATTTTAAAGATCTCTTTTCTTCATCTAATCCTCCTAAGGA
TGTAATGGATAAAGTGTTGGGCTCTGTCTTGCCAAAGGTGGACTGGAGTATGAATGAGAAATTGCTAGCACCCTACACACAAAGTGAAGTGGAAAAAGAGGCAAACAGTA
GAGGAGCTCTTCATGGGATAGAATGCGCCCCAACTTGTCCAAAAATCTCACATCTTCTATTCGTCGATGATAGCCTCATTTTCTGTAGGGCGGGCGAATCAAACTTCTTT
GCTCTTAAAACAATCTTGAAGCAGTTTGAGTTGGCTTCCGATGGAAGCATCAATTTCACCAAGTCAGCGATAATAATCTCCCCCAATGTAAACAAAGATCGATGGGGCTT
CCTAAGTGACATCCTCGGCATCAAGATGCAAGAGTCCCTTGGCACCTACTTGGGGCTTCCCTCCGACTTTTCTAGAAGCAAAACAAAAGACCTCCAAGGAATTGTAGACA
AAGAAGTCCTTATTAAAAGCATTGGCCAAGCTCTGCCCACGTACGCCATGAGCCTTTTTCAACTTCTAAAGAAGTTGTGTGACGAGATAACCAGGAGTTTCACTAGATTC
TGGTGGGGCTCAACGGATGACAGAAAAAGGATCCATTGGAAAAGCTGGAAGTTCCTTTGCTTACCTAAGGCGCTCGGTGGTCTTAACTTCAGGGAGTTGATCGGGTTTAA
TCAAGCGTTAATTGCAAAACAGGTCCTTGAGAGTATTTACTTTCCTGATGCTTCAATTCTGGAGGTCCCTATGGGAATCAATGCCTCTTATCTCTGGAAAAGTTTGTTAT
GGGAGTTCAAGAATGCTAAGCTGGCTTCTTTCATTAGTTCTGATGGGAAGTGGAACACTGATATGTTGGGAAGAGCCTTCAACCAAGAAGACATTAATCTGATTTTGAGA
ATGCCCATTAACAGTAATATTGAAGATAAGTGGGTTTGGCATTTCGATAAGAAGAGCATTTACTCATCAGGAGTTCCCCATAAGGTGAAATTCTTTTGTTGGAGAGCTCT
AAATGAAATAATCCCTACAAATCTAAACCTCCAGAAGAAAGAGAAGGAAGTATGGGGGAAAATCTTGTCAAGAGTATCGGGTGCCCAAAACTTTAATAATTGTTTCTGTA
ATAGATGGTTGTCTTTAAGTTCTAGTTGTTCAGATGAAGATCTAAATCTATGTGCAATTACGTGTTGGGCTTTGTGGAATGATAGAAATAATGTGAGAATTGGGAAGCAA
ATTCCTGACCCTGATGTAAGATGTACTTGGATTTTGAAATACTACGATGAATTCCAGAGATCAGTTGCAAAAATCGGGAAAGATCCCTCAACTTTGCAGGAGATGTCGAA
AAGGAACCAGCTAAAGTGGTCAAATCCACCTAAAGGCTTTATCAAATTGAATGTGGACGCCGCCTGGAAATCGAATCCTGCTTCAACGGGTTTAAACGTGATACTCAAAG
ATTGGAATGGCAAACTTTTGGGAGTATCAGCCCGGGCCATGGATCTTGATTTTGATCTGCCATTGGCAGAAATCATTACTGTTCGTGAAGGCCTTAAGTTTGCCATTTCT
CAAAACCATAAACTTGTCATGCTGGAATCCAACTGTCTCCAAGTGATCAATTTACTGAAGAACGAAAATGAGGAGTTAAGAGGGGCAGATTACTGGTTGGAGGAGATCGA
GAGACTGAAGGTTCATTTCTTGTCAATTGAGTTTAATTACTGTAATCGTTGCAACAATGAGTTGGTAGATAAAGTTGCAAAAAATGCTAGGCTTAATAGCCTCAATGTTG
ATTGGTTGTCCTGTTATCCTTCGTGGATTAGTGGGATGCCTTTCCCTCTCTTTGTTCCCGTTTCCCATTTCATTCCCCGTCCTTGCCAAATTCCCCTTGGGGATTGA
Protein sequenceShow/hide protein sequence
MARRLGDALGKFLKVDCDQDNFCWWESLRLKILVDITKPLRRGIWVSLSGDQESIWINSKYERLPDFCYGCGRIGHVIKDCKSYKRLGWFRGLNPRRKSKPSSQRDKKDK
SPFESSVDKPKNRSSCERVAESSKFVQTESRAVERKNQVNQKEMPPEAMKFSSQILFIFIKEEVNKVLNFHPEGVEFLTSEVIGTSERFMNKLRKWKSRASSGRKTFWAL
KIKESKNPHYLPAELSQTSNRRSLRSSSAESHEKSSVGMPGKLEIPRHSEVDLRVISYFQNHIDSEIFWERKSWRFSRMYGFPEGNRKTQTWDLLRLLYNHNDNPWLVGD
DLNEILSETEKSGGNSKASRAIESFREAIGDRNLRDIKGRGSTFTWNSRRHNNCGSENLEVKLKGIESRQKRAYNEFLEVDFQKIHVIEFELDNLLEEEEIYWKKQKDEI
KGDKDFKGQWCEDPKIIEATFWNYFKDLFSSSNPPKDVMDKVLGSVLPKVDWSMNEKLLAPYTQSEVEKEANSRGALHGIECAPTCPKISHLLFVDDSLIFCRAGESNFF
ALKTILKQFELASDGSINFTKSAIIISPNVNKDRWGFLSDILGIKMQESLGTYLGLPSDFSRSKTKDLQGIVDKEVLIKSIGQALPTYAMSLFQLLKKLCDEITRSFTRF
WWGSTDDRKRIHWKSWKFLCLPKALGGLNFRELIGFNQALIAKQVLESIYFPDASILEVPMGINASYLWKSLLWEFKNAKLASFISSDGKWNTDMLGRAFNQEDINLILR
MPINSNIEDKWVWHFDKKSIYSSGVPHKVKFFCWRALNEIIPTNLNLQKKEKEVWGKILSRVSGAQNFNNCFCNRWLSLSSSCSDEDLNLCAITCWALWNDRNNVRIGKQ
IPDPDVRCTWILKYYDEFQRSVAKIGKDPSTLQEMSKRNQLKWSNPPKGFIKLNVDAAWKSNPASTGLNVILKDWNGKLLGVSARAMDLDFDLPLAEIITVREGLKFAIS
QNHKLVMLESNCLQVINLLKNENEELRGADYWLEEIERLKVHFLSIEFNYCNRCNNELVDKVAKNARLNSLNVDWLSCYPSWISGMPFPLFVPVSHFIPRPCQIPLGD