; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0161601 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0161601
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr06:10031259..10032278
RNA-Seq ExpressionCmc06g0161601
SyntenyCmc06g0161601
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035996.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.0e-12870.15Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMARI+IHAK+ PL+FWAEA+N ACHIH R+TTRSG+ VT YELWKGRKPNVKYFHIF  TCYIL DREYH KWD KS+QG+FLG SQNSRAY+VFN 
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDET-HVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVL-----VPSAHVKKSHPSSSTIRDPSAGI
        ++ TVMETIN++VND E    + + E+DET  VT   TSTP D      S+ D   TNSN+  +  + E V+     +PS+HV K+HPSSS I DPSAGI
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDET-HVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVL-----VPSAHVKKSHPSSSTIRDPSAGI

Query:  TTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGV
        TTR+K+K+DY KMI DLCY SAIEPTSVE ALKDEYWIN+MQEEL+QFKRNNVWTLVPKP+G NIIGTKW+FKNKTDES  V RN A LVAQGYAQ+EGV
Subjt:  TTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGV

Query:  DFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        DFDE F P+ RLEAIRLLL ISC  KFKL+QMDVK
Subjt:  DFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.8e-12774.77Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMAR+MIHA N PL F AEAVNT CHI  +                         H  + TCYIL DREYH KWDVKSDQGIFLGYS NSRAYRVFNI
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE
        KS TVME INV+VNDFESN NQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNE VLVPSAHVKK+H SSS + DPSAGITT+ KE
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE

Query:  KVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETF
        K                      NALKDEYWIN MQEELLQFKRNN+WTLVPKPD ANIIGTKWIFKNKTDESESVIRN ARLVAQGYAQV+GVDF++TF
Subjt:  KVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETF

Query:  APIARLEAIRLLLSISCFRKFKLFQMDVK
        AP+ARLEAIRLLLSISCFRKFKLFQMDVK
Subjt:  APIARLEAIRLLLSISCFRKFKLFQMDVK

KAA0045248.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.8e-12188Show/hide
Query:  VTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVLVNDFESNFNQFNIEDDETHV
        VTT+S  T T YELWKG+KPNVKYFHIF STCYIL DREYH KWDVKSDQ IFLGYSQNSRAYRVFNIKS TVME INV+VNDFESN NQFNIEDDETHV
Subjt:  VTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVLVNDFESNFNQFNIEDDETHV

Query:  TPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWI
        TP+VT TPLDEMPKGD QPDSAKTNSNITDEVINNE VL+PSAHVKK+HPSSS I DPS GITTRRKEKVDY KMIADLCYVSAIEPTSVENALKDEYWI
Subjt:  TPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWI

Query:  NSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMAR
        N MQEELLQFK N+VWTLVPKPDGANIIGTKWIFKNKTDE  SVIRN AR
Subjt:  NSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMAR

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.1e-15376.8Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMAR+MIHAKN PL FWAEAVNTACHIHNRVTTRSG TVT YEL KGRKPNVKYFHIF STCYIL DREYH KWD KS QGIFLGYSQNSRAYRVFNI
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE
        KS TVMETINV+VNDFESN NQFNIEDDETHVTPEVTSTPLDEMPKG+SQ  SAKT+S+ITDEVINNE +LVPSAHVKK+HPSSS I DPSAGITTRRKE
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE

Query:  KVD------------------------------------YRKM----------IADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKP
         V                                     +RK             DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKP
Subjt:  KVD------------------------------------YRKM----------IADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKP

Query:  DGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        DGANIIGTKWIFKNKTDES S+IRN ARLVAQGY QVEGVD DETFAP+ARLEAIRLLLSISCF+KFKLFQMDVK
Subjt:  DGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.0e-15682.18Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMAR+MIHAKN PL FWAEAVNTACHIHNRVTTRSG TVT YEL KGRKPNVKYFHIF STCYIL DREYH KWD KS QGIFLGYSQNSRAYRVFNI
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE
        KS TVMETINV+VNDFESN NQFNIEDDETHVTPEVTSTPLDEMPKG+SQ  SAKT+S+ITDEVINNE +LVPSAHVKK+HPSSS I DPSAGITTRRKE
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE

Query:  KVDYRKMIA-------------------DLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMA
         ++    I+                   DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDES S+IRN A
Subjt:  KVDYRKMIA-------------------DLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMA

Query:  RLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        RLVAQGY QVEGVD DETFAP+ARLEAIRLLLSISCF+KFKLFQMDVK
Subjt:  RLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

TrEMBL top hitse value%identityAlignment
A0A5A7T197 Gag-pol polyprotein4.4e-12870.15Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMARI+IHAK+ PL+FWAEA+N ACHIH R+TTRSG+ VT YELWKGRKPNVKYFHIF  TCYIL DREYH KWD KS+QG+FLG SQNSRAY+VFN 
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDET-HVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVL-----VPSAHVKKSHPSSSTIRDPSAGI
        ++ TVMETIN++VND E    + + E+DET  VT   TSTP D      S+ D   TNSN+  +  + E V+     +PS+HV K+HPSSS I DPSAGI
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDET-HVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVL-----VPSAHVKKSHPSSSTIRDPSAGI

Query:  TTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGV
        TTR+K+K+DY KMI DLCY SAIEPTSVE ALKDEYWIN+MQEEL+QFKRNNVWTLVPKP+G NIIGTKW+FKNKTDES  V RN A LVAQGYAQ+EGV
Subjt:  TTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGV

Query:  DFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        DFDE F P+ RLEAIRLLL ISC  KFKL+QMDVK
Subjt:  DFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

A0A5A7TQU8 Gag-pol polyprotein2.3e-12188Show/hide
Query:  VTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVLVNDFESNFNQFNIEDDETHV
        VTT+S  T T YELWKG+KPNVKYFHIF STCYIL DREYH KWDVKSDQ IFLGYSQNSRAYRVFNIKS TVME INV+VNDFESN NQFNIEDDETHV
Subjt:  VTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETINVLVNDFESNFNQFNIEDDETHV

Query:  TPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWI
        TP+VT TPLDEMPKGD QPDSAKTNSNITDEVINNE VL+PSAHVKK+HPSSS I DPS GITTRRKEKVDY KMIADLCYVSAIEPTSVENALKDEYWI
Subjt:  TPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWI

Query:  NSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMAR
        N MQEELLQFK N+VWTLVPKPDGANIIGTKWIFKNKTDE  SVIRN AR
Subjt:  NSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMAR

A0A5A7U931 Gag-pol polyprotein3.9e-15376.8Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMAR+MIHAKN PL FWAEAVNTACHIHNRVTTRSG TVT YEL KGRKPNVKYFHIF STCYIL DREYH KWD KS QGIFLGYSQNSRAYRVFNI
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE
        KS TVMETINV+VNDFESN NQFNIEDDETHVTPEVTSTPLDEMPKG+SQ  SAKT+S+ITDEVINNE +LVPSAHVKK+HPSSS I DPSAGITTRRKE
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE

Query:  KVD------------------------------------YRKM----------IADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKP
         V                                     +RK             DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKP
Subjt:  KVD------------------------------------YRKM----------IADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKP

Query:  DGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        DGANIIGTKWIFKNKTDES S+IRN ARLVAQGY QVEGVD DETFAP+ARLEAIRLLLSISCF+KFKLFQMDVK
Subjt:  DGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

A0A5D3DCZ8 Gag-pol polyprotein2.9e-15682.18Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMAR+MIHAKN PL FWAEAVNTACHIHNRVTTRSG TVT YEL KGRKPNVKYFHIF STCYIL DREYH KWD KS QGIFLGYSQNSRAYRVFNI
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE
        KS TVMETINV+VNDFESN NQFNIEDDETHVTPEVTSTPLDEMPKG+SQ  SAKT+S+ITDEVINNE +LVPSAHVKK+HPSSS I DPSAGITTRRKE
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE

Query:  KVDYRKMIA-------------------DLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMA
         ++    I+                   DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDES S+IRN A
Subjt:  KVDYRKMIA-------------------DLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMA

Query:  RLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        RLVAQGY QVEGVD DETFAP+ARLEAIRLLLSISCF+KFKLFQMDVK
Subjt:  RLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

A0A5D3DSN1 Gag-pol polyprotein2.8e-12774.77Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI
        LQEMAR+MIHA N PL F AEAVNT CHI  +                         H  + TCYIL DREYH KWDVKSDQGIFLGYS NSRAYRVFNI
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNI

Query:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE
        KS TVME INV+VNDFESN NQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNE VLVPSAHVKK+H SSS + DPSAGITT+ KE
Subjt:  KSRTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKE

Query:  KVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETF
        K                      NALKDEYWIN MQEELLQFKRNN+WTLVPKPD ANIIGTKWIFKNKTDESESVIRN ARLVAQGYAQV+GVDF++TF
Subjt:  KVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETF

Query:  APIARLEAIRLLLSISCFRKFKLFQMDVK
        AP+ARLEAIRLLLSISCFRKFKLFQMDVK
Subjt:  APIARLEAIRLLLSISCFRKFKLFQMDVK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-2426.24Show/hide
Query:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRS--GTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNS-RAYRV
        + E AR M+        FW EAV TA ++ NR+ +R+   ++ T YE+W  +KP +K+  +F +T Y+ + +    K+D KS + IF+GY  N  + +  
Subjt:  LQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRS--GTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNS-RAYRV

Query:  FN---IKSRTVMETINVLVND----FESNFNQFNIEDDETHVTPEVTSTPLDEMPK-----------------------GDSQ-------PDSAKTNSNI
         N   I +R V+     +VN     FE+ F + + E +  +   +       E P                         DS+       P+ +K   NI
Subjt:  FN---IKSRTVMETINVLVND----FESNFNQFNIEDDETHVTPEVTSTPLDEMPK-----------------------GDSQ-------PDSAKTNSNI

Query:  ---TDEVINNEIVLVPSAHVKK----------------------SHPSSSTIRDPSAG----ITTRRKEKV-------------DYRKMIADLCYVSAIE
            D   +N+  L  S   K+                       H     I +P+      I  RR E++                K++ +   +    
Subjt:  ---TDEVINNEIVLVPSAHVKK----------------------SHPSSSTIRDPSAG----ITTRRKEKV-------------DYRKMIADLCYVSAIE

Query:  PTSV-ENALKDE--YWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSI
        P S  E   +D+   W  ++  EL   K NN WT+  +P+  NI+ ++W+F  K +E  + IR  ARLVA+G+ Q   +D++ETFAP+AR+ + R +LS+
Subjt:  PTSV-ENALKDE--YWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSI

Query:  SCFRKFKLFQMDVKKCLPKWILK
              K+ QMDVK       LK
Subjt:  SCFRKFKLFQMDVKKCLPKWILK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-2726.32Show/hide
Query:  EMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKS
        E  R M+     P  FW EAV TAC++ NR  +          +W  ++ +  +  +F    +  V +E  +K D KS   IF+GY      YR+++   
Subjt:  EMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKS

Query:  RTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTP------------LDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDP
        + V+ + +V+  + E        E  +  + P   + P             DE+ +   QP          DE +  E V  P+   ++  P     R  
Subjt:  RTVMETINVLVNDFESNFNQFNIEDDETHVTPEVTSTP------------LDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDP

Query:  SAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENAL---KDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQG
           + +RR    +Y  +  D       EP S++  L   +    + +MQEE+   ++N  + LV  P G   +  KW+FK K D    ++R  ARLV +G
Subjt:  SAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENAL---KDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQG

Query:  YAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
        + Q +G+DFDE F+P+ ++ +IR +LS++     ++ Q+DVK
Subjt:  YAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK

P92520 Uncharacterized mitochondrial protein AtMg008203.2e-1947.47Show/hide
Query:  EPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSIS
        EP SV  ALKD  W  +MQEEL    RN  W LVP P   NI+G KW+FK K     ++ R  ARLVA+G+ Q EG+ F ET++P+ R   IR +L+++
Subjt:  EPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-1741.07Show/hide
Query:  EPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDG-ANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSIS
        EP +   ALKDE W N+M  E+     N+ W LVP P     I+G +WIF  K +   S+ R  ARLVA+GY Q  G+D+ ETF+P+ +  +IR++L ++
Subjt:  EPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDG-ANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSIS

Query:  CFRKFKLFQMDV
          R + + Q+DV
Subjt:  CFRKFKLFQMDV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-1635Show/hide
Query:  ITTRRKEKVDYRKMIADLCYVSAI----EPTSVENALKDEYWINSMQEELLQFKRNNVWTLV-PKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGY
        + TR K+ +  RK      Y +++    EP +   A+KD+ W  +M  E+     N+ W LV P P    I+G +WIF  K +   S+ R  ARLVA+GY
Subjt:  ITTRRKEKVDYRKMIADLCYVSAI----EPTSVENALKDEYWINSMQEELLQFKRNNVWTLV-PKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGY

Query:  AQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDV
         Q  G+D+ ETF+P+ +  +IR++L ++  R + + Q+DV
Subjt:  AQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.3e-1932.19Show/hide
Query:  SSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMAR
        +S TI D S  ++  +   + +  ++   C   A EP++   A +   W  +M +E+   +  + W +   P     IG KW++K K +   ++ R  AR
Subjt:  SSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMAR

Query:  LVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDV
        LVA+GY Q EG+DF ETF+P+ +L +++L+L+IS    F L Q+D+
Subjt:  LVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.3e-2047.47Show/hide
Query:  EPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSIS
        EP SV  ALKD  W  +MQEEL    RN  W LVP P   NI+G KW+FK K     ++ R  ARLVA+G+ Q EG+ F ET++P+ R   IR +L+++
Subjt:  EPTSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACAAGAAATGGCTCGAATTATGATACATGCCAAAAATTTTCCTTTGTATTTTTGGGCAGAAGCTGTAAACACAGCATGTCATATTCACAACAGAGTCACTACACG
ATCTGGTACGACAGTTACATCGTATGAATTATGGAAGGGACGGAAACCAAATGTTAAGTATTTTCATATTTTTTTAAGTACTTGTTATATTTTGGTCGATCGAGAGTATC
ATTCCAAGTGGGATGTGAAATCTGATCAAGGGATCTTTCTTGGTTATTCTCAGAATAGTCGAGCGTATAGAGTCTTTAATATTAAATCCAGAACAGTCATGGAAACAATC
AATGTTTTGGTTAATGATTTTGAGTCTAATTTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTTACACCTGAAGTTACTTCTACGCCTCTTGATGAAATGCCTAA
AGGAGATTCGCAGCCAGACAGTGCTAAGACCAATTCAAACATAACTGATGAGGTCATAAACAATGAAATTGTGCTTGTCCCCTCTGCACATGTGAAAAAGAGTCATCCAT
CAAGTTCCACAATAAGGGATCCGTCAGCTGGAATTACTACCAGAAGAAAAGAAAAGGTAGATTACAGGAAAATGATTGCTGATTTATGCTATGTATCAGCAATAGAACCC
ACATCTGTTGAGAATGCTCTCAAGGATGAATACTGGATAAATTCCATGCAAGAAGAGTTATTACAGTTCAAGCGTAACAACGTTTGGACTTTGGTTCCTAAACCTGATGG
AGCGAACATCATAGGAACTAAGTGGATCTTTAAAAATAAAACTGATGAATCTGAGAGTGTAATAAGGAACATGGCCCGTTTGGTGGCTCAAGGTTATGCACAGGTAGAAG
GTGTTGATTTTGATGAAACTTTTGCACCTATCGCTAGACTTGAAGCTATTCGCCTCTTGCTCAGTATATCATGTTTCCGAAAATTTAAATTGTTTCAAATGGACGTTAAA
AAGTGCCTTCCTAAATGGATACTTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACAAGAAATGGCTCGAATTATGATACATGCCAAAAATTTTCCTTTGTATTTTTGGGCAGAAGCTGTAAACACAGCATGTCATATTCACAACAGAGTCACTACACG
ATCTGGTACGACAGTTACATCGTATGAATTATGGAAGGGACGGAAACCAAATGTTAAGTATTTTCATATTTTTTTAAGTACTTGTTATATTTTGGTCGATCGAGAGTATC
ATTCCAAGTGGGATGTGAAATCTGATCAAGGGATCTTTCTTGGTTATTCTCAGAATAGTCGAGCGTATAGAGTCTTTAATATTAAATCCAGAACAGTCATGGAAACAATC
AATGTTTTGGTTAATGATTTTGAGTCTAATTTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTTACACCTGAAGTTACTTCTACGCCTCTTGATGAAATGCCTAA
AGGAGATTCGCAGCCAGACAGTGCTAAGACCAATTCAAACATAACTGATGAGGTCATAAACAATGAAATTGTGCTTGTCCCCTCTGCACATGTGAAAAAGAGTCATCCAT
CAAGTTCCACAATAAGGGATCCGTCAGCTGGAATTACTACCAGAAGAAAAGAAAAGGTAGATTACAGGAAAATGATTGCTGATTTATGCTATGTATCAGCAATAGAACCC
ACATCTGTTGAGAATGCTCTCAAGGATGAATACTGGATAAATTCCATGCAAGAAGAGTTATTACAGTTCAAGCGTAACAACGTTTGGACTTTGGTTCCTAAACCTGATGG
AGCGAACATCATAGGAACTAAGTGGATCTTTAAAAATAAAACTGATGAATCTGAGAGTGTAATAAGGAACATGGCCCGTTTGGTGGCTCAAGGTTATGCACAGGTAGAAG
GTGTTGATTTTGATGAAACTTTTGCACCTATCGCTAGACTTGAAGCTATTCGCCTCTTGCTCAGTATATCATGTTTCCGAAAATTTAAATTGTTTCAAATGGACGTTAAA
AAGTGCCTTCCTAAATGGATACTTAAATGA
Protein sequenceShow/hide protein sequence
MLQEMARIMIHAKNFPLYFWAEAVNTACHIHNRVTTRSGTTVTSYELWKGRKPNVKYFHIFLSTCYILVDREYHSKWDVKSDQGIFLGYSQNSRAYRVFNIKSRTVMETI
NVLVNDFESNFNQFNIEDDETHVTPEVTSTPLDEMPKGDSQPDSAKTNSNITDEVINNEIVLVPSAHVKKSHPSSSTIRDPSAGITTRRKEKVDYRKMIADLCYVSAIEP
TSVENALKDEYWINSMQEELLQFKRNNVWTLVPKPDGANIIGTKWIFKNKTDESESVIRNMARLVAQGYAQVEGVDFDETFAPIARLEAIRLLLSISCFRKFKLFQMDVK
KCLPKWILK