; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0013650 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0013650
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPol protein
Genome locationchr10:13294369..13302930
RNA-Seq ExpressionIVF0013650
SyntenyIVF0013650
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032109.1 pol protein [Cucumis melo var. makuwa]6.31e-9745.52Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYA+VR +DLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFV PFEILE+IG VAYRLALPPSLSTVHDVFHVSMLRKYV DPSHVVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPL-------RCWLERPPPPSPSLLSSAPAGRHPFSPS-------------PSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHP
        LEIDENLSYTEQP+       +    R  P    L  +       + P              P +    V +    ++T P  V   PS     L +  P
Subjt:  LEIDENLSYTEQPL-------RCWLERPPPPSPSLLSSAPAGRHPFSPS-------------PSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHP

Query:  DAAVRPSSNRVR----RRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPK---TEPSHEPSQAEDRAEPRAE
        DA     S RV+    RR   + R+  E +V      +                        +++D  F +  R  PK   TE +        RA+    
Subjt:  DAAVRPSSNRVR----RRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPK---TEPSHEPSQAEDRAEPRAE

Query:  SSQVEPRDLTLKFLGPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKY
          +++ +     F   T +           +V+  L    N     G    R Q     Q+  +  EA R +       + ++G  L   T + ++ ++ 
Subjt:  SSQVEPRDLTLKFLGPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKY

Query:  ---IDYSGMSFKEVFRSLKSSLT---SIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLST
           +D   M   E  ++  +      +I A    + AERA TVVTGTL VLGHYAL+LFDSGS HSFI SAFVLHA LEVEPL+HVL VSTPSGE +LS 
Subjt:  ---IDYSGMSFKEVFRSLKSSLT---SIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLST

Query:  EKVKACQIEIVGHVIEVTLLVLDMYHFDVILVMDWLAANHASIDCSR
        EKVKACQIEI GHV EVTLLVLDM  FDVIL MDWL ANHASIDCSR
Subjt:  EKVKACQIEIVGHVIEVTLLVLDMYHFDVILVMDWLAANHASIDCSR

KAA0036202.1 pol protein [Cucumis melo var. makuwa]9.10e-10971.58Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYADVRR+DLEFDV DKVFLKVA MKGVLRFERRGKLSPRFVG FEILERIG +AYRLALP SLSTVHDVFHVSMLRKYV DPSHVVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLRC-------------------WLE--------------RPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRG
        LEIDENLSY EQP+                     W                RPPPPSPSLLSSA A  HP S    L GGTVSVSVVGNLTTPAFVRR 
Subjt:  LEIDENLSYTEQPLRC-------------------WLE--------------RPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRG

Query:  PSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDR-RRGCPDFKLTRPETCFQFDPRFKLQIRV
        PSI ELPLGLTH DAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRR      RGC  FKLTRPET  +F  R   Q+ +
Subjt:  PSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDR-RRGCPDFKLTRPETCFQFDPRFKLQIRV

KAA0066274.1 pol protein [Cucumis melo var. makuwa]8.54e-9760Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYADVRRRDLEFDVGDKVF KVAPMKGVLRFERRGKLSPRFVGPFEILERIG VAYRLALPPSLSTVHDVFHVSMLRKYV DPS VVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLRCWLERPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVV-----GNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVRRRP
        LEIDENLSYTEQP+             L       R+   P   +      V         ++ +P FVRRGPSIS+LPL LTHPDAAVRPSS+RVRRRP
Subjt:  LEIDENLSYTEQPLRCWLERPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVV-----GNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVRRRP

Query:  VALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTASSFG
        VALRRVEAEPVVGL +    P   + +RRR      +   +T F                                  S     RDLTLKFLGP ASSFG
Subjt:  VALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTASSFG

Query:  NSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEV
        NS LYSGGSVE+    GHNQVS KGF TT P IE 
Subjt:  NSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEV

KAA0067060.1 pol protein [Cucumis melo var. makuwa]3.82e-16765.9Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLRC-------------------WLER--------PPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISEL
        LEIDENLSYTEQP+                     W           PPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISEL
Subjt:  LEIDENLSYTEQPLRC-------------------WLER--------PPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISEL

Query:  PLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAE
        PLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCF                                  
Subjt:  PLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAE

Query:  SSQVEPRDLTLKFLGPTASSFGNSYLYSG-----------GSVERGLDR---------------------------------------GHNQVSDKGFPT
              RDLTLKFLGPTASSFGNSYLYSG           G  E   DR                                       G N++    F  
Subjt:  SSQVEPRDLTLKFLGPTASSFGNSYLYSG-----------GSVERGLDR---------------------------------------GHNQVSDKGFPT

Query:  TR---------PQIEVRVQRGADRREAGRMREGHMDA
        TR           I VRVQRGADRREAGRMREGHMDA
Subjt:  TR---------PQIEVRVQRGADRREAGRMREGHMDA

TYK27130.1 pol protein [Cucumis melo var. makuwa]5.58e-10146.88Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYA+VR +DLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFV PFEILE+IG VAYRLALPPSLSTVHDVFHVSMLRKYV DPSHVVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLRCWLERPPPPSPSLLS--SAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVR----RR
        LEIDENLSYTEQP+        P S   +   +  +G+   +  P +    V +    ++T P  V   PS     L +  PDA     S RV+    RR
Subjt:  LEIDENLSYTEQPLRCWLERPPPPSPSLLS--SAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVR----RR

Query:  PVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPK---TEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTA
           + R+  E +V      +                        +++D  F +  R  PK   TE +        RA+      +++ +     F   T 
Subjt:  PVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPK---TEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTA

Query:  SSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKY---IDYSGMSFKEVFRSL
        +           +V+  L    N     G    R Q     Q+  +  EA R +       + ++G  L   T + ++ ++    +D   M   E  ++ 
Subjt:  SSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKY---IDYSGMSFKEVFRSL

Query:  KSSLT---SIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLSTEKVKACQIEIVGHVIEVT
         +      +I A    + AERA TVVTGTL VLGHYAL+LFDSGS HSFI SAFVLHA LEVEPL+HVL VSTPSGE +LS EKVKACQIEI GHV EVT
Subjt:  KSSLT---SIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLSTEKVKACQIEIVGHVIEVT

Query:  LLVLDMYHFDVILVMDWLAANHASIDCSR
        LLVLDM  FDVIL MDWL ANHASIDCSR
Subjt:  LLVLDMYHFDVILVMDWLAANHASIDCSR

TrEMBL top hitse value%identityAlignment
A0A5A7SSB1 Pol protein3.7e-8646.42Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYA+VR +DLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFV PFEILE+IG VAYRLALPPSLSTVHDVFHVSMLRKYV DPSHVVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQP-------LRCWLERPPPPSPSLLSSAPAGRHPFSP-------------SPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHP
        LEIDENLSYTEQP       ++    R  P    L  +       + P              P +    V +    ++T P  V   PS     L +  P
Subjt:  LEIDENLSYTEQP-------LRCWLERPPPPSPSLLSSAPAGRHPFSP-------------SPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHP

Query:  DA--AVRPSSNRVRRRPVALRRV-EAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIR-----VEPKTEPSHEPSQAEDRAEPRA
        DA  +VR      RR    +R V EA    G L   K      +   R  P    T      +F    +L I+      +P T         +   + RA
Subjt:  DA--AVRPSSNRVRRRPVALRRV-EAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIR-----VEPKTEPSHEPSQAEDRAEPRA

Query:  ESSQV---EPRDLTLKFLGPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYR
         SS+V   E R    K      ++ G S   + G    G      ++  K                  R+E   +    M   E       G P      
Subjt:  ESSQV---EPRDLTLKFLGPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYR

Query:  KKKYIDYSGMSFKEVFRSLKSSLTSIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLSTEK
              + G     +F + K+             AERA TVVTGTL VLGHYAL+LFDSGS HSFI SAFVLHA LEVEPL+HVL VSTPSGE +LS EK
Subjt:  KKKYIDYSGMSFKEVFRSLKSSLTSIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLSTEK

Query:  VKACQIEIVGHVIEVTLLVLDMYHFDVILVMDWLAANHASIDCSR
        VKACQIEI GHV EVTLLVLDM  FDVIL MDWL ANHASIDCSR
Subjt:  VKACQIEIVGHVIEVTLLVLDMYHFDVILVMDWLAANHASIDCSR

A0A5A7SYA6 Pol protein4.8e-9473.99Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYADVRR+DLEFDV DKVFLKVA MKGVLRFERRGKLSPRFVG FEILERIG +AYRLALP SLSTVHDVFHVSMLRKYV DPSHVVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLR-------------------CWLE--------------RPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRG
        LEIDENLSY EQP+                     W                RPPPPSPSLLSSA A  HP     SL GGTVSVSVVGNLTTPAFVRR 
Subjt:  LEIDENLSYTEQPLR-------------------CWLE--------------RPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRG

Query:  PSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRI-EDRRRGCPDFKLTRPETCF
        PSI ELPLGLTH DAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRR  +   RGC  FKLTRPET F
Subjt:  PSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRI-EDRRRGCPDFKLTRPETCF

A0A5A7UPE4 Ty3-gypsy retrotransposon protein1.1e-8564.55Show/hide
Query:  QSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKPLEID
        +SRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIG VAYRLALP SLSTVHDVFHVSMLRKYV DPSHVVDY+PLEID
Subjt:  QSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKPLEID

Query:  ENLSYTEQPLRCWLERPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAE
        ENLSYTEQP+              ++             + +     +  +       FV  GPSISE PLGLTH DAAVRPSS+ VRRRPVALRRVEAE
Subjt:  ENLSYTEQPLRCWLERPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAE

Query:  PVVGLLQAAKRPRRRIEDR-RRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTASSFGNSYLYSG
        PVVGLLQAAKR RR+      RGCP FKLTRPET FQ + RFKL                                RDLTLKFLGPTASSFGNS LYSG
Subjt:  PVVGLLQAAKRPRRRIEDR-RRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTASSFGNSYLYSG

A0A5A7VIM5 Pol protein9.5e-13565.9Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLR-------------------CWLE--------RPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISEL
        LEIDENLSYTEQP+                     W           PPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISEL
Subjt:  LEIDENLSYTEQPLR-------------------CWLE--------RPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISEL

Query:  PLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAE
        PLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCF                                  
Subjt:  PLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAE

Query:  SSQVEPRDLTLKFLGPTASSFGNSYLYSG-----------GSVERGLDR---------------------------------------GHNQVSDKGFPT
              RDLTLKFLGPTASSFGNSYLYSG           G  E   DR                                       G N++    F  
Subjt:  SSQVEPRDLTLKFLGPTASSFGNSYLYSG-----------GSVERGLDR---------------------------------------GHNQVSDKGFPT

Query:  TR---------PQIEVRVQRGADRREAGRMREGHMDA
        TR           I VRVQRGADRREAGRMREGHMDA
Subjt:  TR---------PQIEVRVQRGADRREAGRMREGHMDA

A0A5D3DV03 Pol protein2.7e-8947.82Show/hide
Query:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP
        M T QSRQKSYA+VR +DLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFV PFEILE+IG VAYRLALPPSLSTVHDVFHVSMLRKYV DPSHVVDY+P
Subjt:  MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKP

Query:  LEIDENLSYTEQPLRCWLERPPPPSPSLLS--SAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDA--AVRPSSNRVRRRPV
        LEIDENLSYTEQP+        P S   +   +  +G+   +  P +    V +    ++T P  V   PS     L +  PDA  +VR      RR   
Subjt:  LEIDENLSYTEQPLRCWLERPPPPSPSLLS--SAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDA--AVRPSSNRVRRRPV

Query:  ALRRV-EAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIR-----VEPKTEPSHEPSQAEDRAEPRAESSQV---EPRDLTLKFL
         +R V EA    G L   K      +   R  P    T      +F    +L I+      +P T         +   + RA SS+V   E R    K  
Subjt:  ALRRV-EAEPVVGLLQAAKRPRRRIEDRRRGCPDFKLTRPETCFQFDPRFKLQIR-----VEPKTEPSHEPSQAEDRAEPRAESSQV---EPRDLTLKFL

Query:  GPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKYIDYSGMSFKEVFRS
            ++ G S   + G    G      ++  K                  R+E   +    M   E       G P            + G     +F +
Subjt:  GPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKYIDYSGMSFKEVFRS

Query:  LKSSLTSIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLSTEKVKACQIEIVGHVIEVTLL
         K+             AERA TVVTGTL VLGHYAL+LFDSGS HSFI SAFVLHA LEVEPL+HVL VSTPSGE +LS EKVKACQIEI GHV EVTLL
Subjt:  LKSSLTSIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFVLHARLEVEPLYHVLLVSTPSGESLLSTEKVKACQIEIVGHVIEVTLL

Query:  VLDMYHFDVILVMDWLAANHASIDCSR
        VLDM  FDVIL MDWL ANHASIDCSR
Subjt:  VLDMYHFDVILVMDWLAANHASIDCSR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.0e-0536.67Show/hide
Query:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY
        ++T   + K Y D++ +++ EF  GD V +K     G L   +  KL+P F GPF +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein4.0e-0536.67Show/hide
Query:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY
        ++T   + K Y D++ +++ EF  GD V +K     G L   +  KL+P F GPF +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein4.0e-0536.67Show/hide
Query:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY
        ++T   + K Y D++ +++ EF  GD V +K     G L   +  KL+P F GPF +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY

Q1W2L8 Glutamate--cysteine ligase, chloroplastic2.3e-0540.28Show/hide
Query:  NQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDA--FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE
        N    +G P     +   +    D   AG +     D+  FE+YV YAL VP Y +YRKKKYID +GMSF++
Subjt:  NQVSDKGFPTTRPQIEVRVQRGADRREAGRMREGHMDA--FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE

Q9UR07 Transposon Tf2-11 polyprotein4.0e-0536.67Show/hide
Query:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY
        ++T   + K Y D++ +++ EF  GD V +K     G L   +  KL+P F GPF +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  MHTTQSRQKSYADVRRRDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTV-HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
AT4G23100.1 glutamate-cysteine ligase4.1e-0562.5Show/hide
Query:  FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE
        FE+YV YAL VP Y  YRK KYID +GM+F++
Subjt:  FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE

AT4G23100.2 glutamate-cysteine ligase4.1e-0562.5Show/hide
Query:  FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE
        FE+YV YAL VP Y  YRK KYID +GM+F++
Subjt:  FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE

AT4G23100.3 glutamate-cysteine ligase4.1e-0562.5Show/hide
Query:  FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE
        FE+YV YAL VP Y  YRK KYID +GM+F++
Subjt:  FEKYVGYALGVPTYLIYRKKKYIDYSGMSFKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATACCACGCAGAGTAGGCAGAAGAGTTATGCGGATGTGAGACGGAGGGATCTTGAGTTTGATGTGGGGGACAAGGTGTTCTTGAAGGTAGCACCTATGAAAGGTGT
CTTACGATTTGAAAGGAGAGGAAAACTGAGTCCCCGTTTTGTTGGGCCGTTTGAGATTCTAGAGCGGATTGGCCTTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCT
CGACAGTTCATGATGTGTTCCATGTTTCTATGTTGAGGAAGTACGTGGCAGATCCATCCCATGTAGTTGATTACAAGCCACTAGAGATTGATGAAAACTTGAGCTATACT
GAACAACCGTTGAGGTGCTGGCTAGAGAGGCCGCCGCCGCCGTCCCCCAGCCTTCTCTCTTCCGCACCCGCCGGTCGCCATCCCTTCTCTCCGTCTCCCTCTCTCTATGG
TGGTACCGTTTCCGTCTCCGTCGTCGGCAATCTCACCACTCCAGCTTTCGTCCGCCGCGGCCCATCCATTTCCGAACTCCCTCTCGGTCTCACGCACCCGGACGCTGCCG
TGCGTCCTTCGTCCAACCGTGTCCGCCGACGCCCAGTCGCTCTCCGCCGCGTTGAAGCGGAGCCGGTCGTCGGACTCCTCCAAGCCGCGAAGCGTCCCCGTCGTCGAATC
GAAGACCGTCGTCGTGGTTGCCCCGATTTCAAGCTAACCCGACCCGAGACCTGCTTCCAGTTCGACCCGCGATTCAAGCTGCAAATCCGAGTTGAGCCGAAGACTGAGCC
GAGCCACGAGCCAAGTCAAGCCGAAGATCGAGCCGAGCCGCGAGCCGAGTCAAGCCAAGTTGAGCCTAGGGATTTGACGCTCAAGTTCTTGGGTCCCACGGCAAGTTCAT
TTGGGAATAGCTACCTTTACTCGGGTGGATCTGTTGAGCGTGGACTCGATCGAGGGCATAACCAAGTATCAGATAAGGGTTTTCCTACTACTAGACCTCAGATCGAGGTA
AGGGTACAGCGAGGGGCAGACCGACGAGAGGCAGGAAGGATGCGTGAAGGCCATATGGACGCATTTGAGAAGTATGTCGGCTATGCTCTTGGTGTTCCAACGTATTTGAT
ATATCGGAAAAAGAAATATATTGACTATTCTGGAATGTCCTTCAAGGAAGTTTTCCGTTCACTGAAGTCATCATTGACTAGTATTGAAGCACTTCAGTATGCTGAATATG
CTGAGAGAGCAGACACTGTGGTGACAGGTACGCTTCTAGTGTTGGGGCATTATGCCTTAATTCTGTTTGATTCTGGTTCATTACATTCCTTTATATCTTCTGCATTTGTG
TTGCATGCTCGCTTAGAGGTAGAGCCTCTATACCATGTTTTATTAGTATCTACTCCTTCTGGGGAGAGTTTATTGTCGACAGAAAAGGTGAAAGCATGCCAGATTGAGAT
AGTAGGTCATGTGATAGAAGTAACGTTGTTAGTTCTTGATATGTACCACTTTGATGTAATTCTGGTTATGGATTGGCTAGCTGCTAACCATGCTAGCATAGATTGTTCCC
GTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATACCACGCAGAGTAGGCAGAAGAGTTATGCGGATGTGAGACGGAGGGATCTTGAGTTTGATGTGGGGGACAAGGTGTTCTTGAAGGTAGCACCTATGAAAGGTGT
CTTACGATTTGAAAGGAGAGGAAAACTGAGTCCCCGTTTTGTTGGGCCGTTTGAGATTCTAGAGCGGATTGGCCTTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCT
CGACAGTTCATGATGTGTTCCATGTTTCTATGTTGAGGAAGTACGTGGCAGATCCATCCCATGTAGTTGATTACAAGCCACTAGAGATTGATGAAAACTTGAGCTATACT
GAACAACCGTTGAGGTGCTGGCTAGAGAGGCCGCCGCCGCCGTCCCCCAGCCTTCTCTCTTCCGCACCCGCCGGTCGCCATCCCTTCTCTCCGTCTCCCTCTCTCTATGG
TGGTACCGTTTCCGTCTCCGTCGTCGGCAATCTCACCACTCCAGCTTTCGTCCGCCGCGGCCCATCCATTTCCGAACTCCCTCTCGGTCTCACGCACCCGGACGCTGCCG
TGCGTCCTTCGTCCAACCGTGTCCGCCGACGCCCAGTCGCTCTCCGCCGCGTTGAAGCGGAGCCGGTCGTCGGACTCCTCCAAGCCGCGAAGCGTCCCCGTCGTCGAATC
GAAGACCGTCGTCGTGGTTGCCCCGATTTCAAGCTAACCCGACCCGAGACCTGCTTCCAGTTCGACCCGCGATTCAAGCTGCAAATCCGAGTTGAGCCGAAGACTGAGCC
GAGCCACGAGCCAAGTCAAGCCGAAGATCGAGCCGAGCCGCGAGCCGAGTCAAGCCAAGTTGAGCCTAGGGATTTGACGCTCAAGTTCTTGGGTCCCACGGCAAGTTCAT
TTGGGAATAGCTACCTTTACTCGGGTGGATCTGTTGAGCGTGGACTCGATCGAGGGCATAACCAAGTATCAGATAAGGGTTTTCCTACTACTAGACCTCAGATCGAGGTA
AGGGTACAGCGAGGGGCAGACCGACGAGAGGCAGGAAGGATGCGTGAAGGCCATATGGACGCATTTGAGAAGTATGTCGGCTATGCTCTTGGTGTTCCAACGTATTTGAT
ATATCGGAAAAAGAAATATATTGACTATTCTGGAATGTCCTTCAAGGAAGTTTTCCGTTCACTGAAGTCATCATTGACTAGTATTGAAGCACTTCAGTATGCTGAATATG
CTGAGAGAGCAGACACTGTGGTGACAGGTACGCTTCTAGTGTTGGGGCATTATGCCTTAATTCTGTTTGATTCTGGTTCATTACATTCCTTTATATCTTCTGCATTTGTG
TTGCATGCTCGCTTAGAGGTAGAGCCTCTATACCATGTTTTATTAGTATCTACTCCTTCTGGGGAGAGTTTATTGTCGACAGAAAAGGTGAAAGCATGCCAGATTGAGAT
AGTAGGTCATGTGATAGAAGTAACGTTGTTAGTTCTTGATATGTACCACTTTGATGTAATTCTGGTTATGGATTGGCTAGCTGCTAACCATGCTAGCATAGATTGTTCCC
GTTAG
Protein sequenceShow/hide protein sequence
MHTTQSRQKSYADVRRRDLEFDVGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGLVAYRLALPPSLSTVHDVFHVSMLRKYVADPSHVVDYKPLEIDENLSYT
EQPLRCWLERPPPPSPSLLSSAPAGRHPFSPSPSLYGGTVSVSVVGNLTTPAFVRRGPSISELPLGLTHPDAAVRPSSNRVRRRPVALRRVEAEPVVGLLQAAKRPRRRI
EDRRRGCPDFKLTRPETCFQFDPRFKLQIRVEPKTEPSHEPSQAEDRAEPRAESSQVEPRDLTLKFLGPTASSFGNSYLYSGGSVERGLDRGHNQVSDKGFPTTRPQIEV
RVQRGADRREAGRMREGHMDAFEKYVGYALGVPTYLIYRKKKYIDYSGMSFKEVFRSLKSSLTSIEALQYAEYAERADTVVTGTLLVLGHYALILFDSGSLHSFISSAFV
LHARLEVEPLYHVLLVSTPSGESLLSTEKVKACQIEIVGHVIEVTLLVLDMYHFDVILVMDWLAANHASIDCSR