; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0165481 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0165481
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr06:17330043..17330813
RNA-Seq ExpressionCmc06g0165481
SyntenyCmc06g0165481
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.5e-12487.5Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        ++ ELLQFRRNNVWTL+SKPEGVNVI TKW+FKNK DE GC+TKNK RLVAQGYTQVEG+DFDETFAP+ARL+AIRLLL ISCIQKFKLYQMDVKSA LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
         YLNEE+YVAQPKGFVD EHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GY + EIDKT FIHRKSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
        SEFEMS VGELSCFL LQIKQKND IFISQEKY +NMVKKF LEQARNK T AATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-12889.84Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEELLQFRRNNVWTL+SKPEGVNVI TKW+FKNKTDE GC+TKNK RLVAQGYTQVEG+DFDETFAP+ARLEAIRLLL ISCIQKFKLYQ+DVKS  LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
        GYLNEE+YVAQPKGFVD EHPKHVYKLNKALYGLKQA RAWYDRLTVYLRG+GYS+ EIDK LFIHRKSDQLLVAQIYVDDIIFGGFP DL+NNFINIMQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
        SEFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVKKFGL+QARNK TPAATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.3e-12287.89Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEELLQFR+NNVWTL+SKPEGVNVI TKWVFKNKTDE GC+TKNK +LVAQGYTQVEGIDFDETFA +ARLEAIRLLL ISCIQKFKLYQMDVKSA L+
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
        GYLNEE+YVAQPKGFVD EHPKH+YKLNKALYGLKQA RAWYD+LTVYLRGKGYS+ EIDKTLFI RKSDQLLVAQIYVDDIIF GFP DLVNNFI    
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
         EFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVKKFGLEQARNK TPAATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-12089.75Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEELLQF+RNNVWTL+ KPEGVNVI TKWVFKNKTDE GC+TKNK RLVAQGYTQVEGIDFDETF+P+ARLEAIRLLL ISCIQKFKLYQMDVKSA LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
        GYLNEE+YVAQPK FVD EH KHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYS+ EIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQ LVNNFI +MQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLE
        SEFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVKKFG +
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLE

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-12387.11Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        ++ ELLQFRRNNVWTL+SKPEGVNVI TKW+FKNK DE GC+TKNK RLVAQGYTQVEG+DFDETFAP+ARL+AIRLLL ISCIQKFKLYQMDVKSA LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
         YLNEE+YVAQPKGFVD EHPKHVYKLNKALYGLKQAPRAWYDRLT YLRG+GY + EIDKT FIHRKSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
        SEFEMS VGELSCFL LQIKQKND IFISQEKY +NMVKKF LEQARNK T AATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein1.7e-12487.5Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        ++ ELLQFRRNNVWTL+SKPEGVNVI TKW+FKNK DE GC+TKNK RLVAQGYTQVEG+DFDETFAP+ARL+AIRLLL ISCIQKFKLYQMDVKSA LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
         YLNEE+YVAQPKGFVD EHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GY + EIDKT FIHRKSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
        SEFEMS VGELSCFL LQIKQKND IFISQEKY +NMVKKF LEQARNK T AATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

A0A5D3BJA9 Gag-pol polyprotein5.1e-12189.75Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEELLQF+RNNVWTL+ KPEGVNVI TKWVFKNKTDE GC+TKNK RLVAQGYTQVEGIDFDETF+P+ARLEAIRLLL ISCIQKFKLYQMDVKSA LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
        GYLNEE+YVAQPK FVD EH KHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYS+ EIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQ LVNNFI +MQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLE
        SEFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVKKFG +
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLE

A0A5D3BPB3 Gag-pol polyprotein6.6e-12989.84Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEELLQFRRNNVWTL+SKPEGVNVI TKW+FKNKTDE GC+TKNK RLVAQGYTQVEG+DFDETFAP+ARLEAIRLLL ISCIQKFKLYQ+DVKS  LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
        GYLNEE+YVAQPKGFVD EHPKHVYKLNKALYGLKQA RAWYDRLTVYLRG+GYS+ EIDK LFIHRKSDQLLVAQIYVDDIIFGGFP DL+NNFINIMQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
        SEFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVKKFGL+QARNK TPAATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

A0A5D3CJ17 Gag-pol polyprotein4.9e-12487.11Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        ++ ELLQFRRNNVWTL+SKPEGVNVI TKW+FKNK DE GC+TKNK RLVAQGYTQVEG+DFDETFAP+ARL+AIRLLL ISCIQKFKLYQMDVKSA LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
         YLNEE+YVAQPKGFVD EHPKHVYKLNKALYGLKQAPRAWYDRLT YLRG+GY + EIDKT FIHRKSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQ
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
        SEFEMS VGELSCFL LQIKQKND IFISQEKY +NMVKKF LEQARNK T AATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

A0A5D3CXU0 Gag-pol polyprotein2.1e-12287.89Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEELLQFR+NNVWTL+SKPEGVNVI TKWVFKNKTDE GC+TKNK +LVAQGYTQVEGIDFDETFA +ARLEAIRLLL ISCIQKFKLYQMDVKSA L+
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
        GYLNEE+YVAQPKGFVD EHPKH+YKLNKALYGLKQA RAWYD+LTVYLRGKGYS+ EIDKTLFI RKSDQLLVAQIYVDDIIF GFP DLVNNFI    
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ

Query:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
         EFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVKKFGLEQARNK TPAATH
Subjt:  SEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-4537.05Show/hide
Query:  ELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYL
        EL   + NN WT+  +PE  N++D++WVF  K +E+G   + K RLVA+G+TQ   ID++ETFAP+AR+ + R +LS+      K++QMDVK+A LNG L
Subjt:  ELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYL

Query:  NEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKS--DQLLVAQIYVDDIIFGGFPQDLVNNFINIMQS
         EE+Y+  P+G     +  +V KLNKA+YGLKQA R W++     L+   +    +D+ ++I  K   ++ +   +YVDD++        +NNF   +  
Subjt:  NEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKS--DQLLVAQIYVDDIIFGGFPQDLVNNFINIMQS

Query:  EFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTP
        +F M+ + E+  F+ ++I+ + D I++SQ  YV+ ++ KF +E      TP
Subjt:  EFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTP

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein1.2e-1527.16Show/hide
Query:  VIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIA----RLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNEELYVAQPKGFVDFEH
        ++ T  +F  K + +      K R+V +G TQ       +T++ I         I++ L I+  +   +  +D+  A L   L EE+Y+  P        
Subjt:  VIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIA----RLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNEELYVAQPKGFVDFEH

Query:  PKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGEL--------S
         + V KLNKALYGLKQ+P+ W D L  YL G G   ++   T  +++  D+ L+  +YVDD +     +  ++ FIN ++S FE+ + G L         
Subjt:  PKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGEL--------S

Query:  CFLELQIKQKNDVIFISQEKYVRNMVKKFGLE
          ++L   ++   I ++ + ++  M KK+  E
Subjt:  CFLELQIKQKNDVIFISQEKYVRNMVKKFGLE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-4235.14Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        MQEE+   ++N  + L+  P+G   +  KWVFK K D    + + K RLV +G+ Q +GIDFDE F+P+ ++ +IR +LS++     ++ Q+DVK+A L+
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSD-QLLVAQIYVDDIIFGGFPQDLVNNFINIM
        G L EE+Y+ QP+GF        V KLNK+LYGLKQAPR WY +   +++ + Y K   D  ++  R S+   ++  +YVDD++  G  + L+      +
Subjt:  GYLNEELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSD-QLLVAQIYVDDIIFGGFPQDLVNNFINIM

Query:  QSEFEMSMVGELSCFLELQI--KQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH
           F+M  +G     L ++I  ++ +  +++SQEKY+  ++++F ++ A+   TP A H
Subjt:  QSEFEMSMVGELSCFLELQI--KQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-4234.69Show/hide
Query:  NNVWTLMSKPEG-VNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNEELYV
        N+ W L+  P   V ++  +W+F  K +  G + + K RLVA+GY Q  G+D+ ETF+P+ +  +IR++L ++  + + + Q+DV +A L G L +++Y+
Subjt:  NNVWTLMSKPEG-VNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNEELYV

Query:  AQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVG
        +QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L++N ++ +   F +    
Subjt:  AQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAA
        EL  FL ++ K+    + +SQ +Y+ +++ +  +  A+   TP A
Subjt:  ELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-4134.15Show/hide
Query:  NNVWTLM-SKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNEELYV
        N+ W L+   P  V ++  +W+F  K +  G + + K RLVA+GY Q  G+D+ ETF+P+ +  +IR++L ++  + + + Q+DV +A L G L +E+Y+
Subjt:  NNVWTLM-SKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNEELYV

Query:  AQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVG
        +QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L+ + ++ +   F +    
Subjt:  AQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVG

Query:  ELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAAT
        +L  FL ++ K+    + +SQ +Y  +++ +  +  A+   TP AT
Subjt:  ELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAAT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-4037.25Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN
        M +E+      + W + + P     I  KWV+K K +  G + + K RLVA+GYTQ EGIDF ETF+P+ +L +++L+L+IS I  F L+Q+D+ +A LN
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLN

Query:  GYLNEELYVAQPKGFV----DFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFI
        G L+EE+Y+  P G+     D   P  V  L K++YGLKQA R W+ + +V L G G+ +   D T F+   +   L   +YVDDII        V+   
Subjt:  GYLNEELYVAQPKGFV----DFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFI

Query:  NIMQSEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGL
        + ++S F++  +G L  FL L+I +    I I Q KY  +++ + GL
Subjt:  NIMQSEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGL

ATMG00810.1 DNA/RNA polymerases superfamily protein3.5e-0534.21Show/hide
Query:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTP
        +YVDDI+  G    L+N  I  + S F M  +G +  FL +QIK     +F+SQ KY   ++   G+   +   TP
Subjt:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.4e-1443.9Show/hide
Query:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSIS
        MQEEL    RN  W L+  P   N++  KWVFK K    G + + K RLVA+G+ Q EGI F ET++P+ R   IR +L+++
Subjt:  MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATGTCTGGACATTAATGTCAAAACCAGAAGGTGTAAACGTTATCGATACTAAATGGGTGTTCAAAAATAAA
ACTGATGAAGTTGGATGTATGACGAAAAATAAAGTCAGATTAGTGGCTCAAGGGTATACTCAAGTTGAAGGTATTGATTTTGATGAAACGTTTGCTCCTATTGCT
CGTCTTGAAGCCATTCGATTATTACTTAGTATATCATGCATACAGAAATTTAAATTGTATCAGATGGATGTAAAGAGTGCCCTCTTAAATGGATATTTGAATGAG
GAGCTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTTCGAGCACCCGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCT
TGGTATGATCGGCTAACTGTGTACTTGAGAGGTAAAGGATATTCCAAAAGAGAAATTGACAAGACCTTGTTCATACACAGGAAATCTGATCAACTTTTGGTTGCT
CAAATTTATGTTGATGACATCATTTTTGGAGGGTTTCCTCAAGATCTAGTAAATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGCATGGTTGGAGAA
CTTTCATGTTTTCTGGAACTTCAAATTAAGCAAAAGAATGATGTCATCTTCATATCTCAGGAAAAGTATGTCAGGAATATGGTCAAAAAGTTTGGTTTGGAACAG
GCTCGAAATAAGTGGACTCCAGCTGCGACACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATGTCTGGACATTAATGTCAAAACCAGAAGGTGTAAACGTTATCGATACTAAATGGGTGTTCAAAAATAAA
ACTGATGAAGTTGGATGTATGACGAAAAATAAAGTCAGATTAGTGGCTCAAGGGTATACTCAAGTTGAAGGTATTGATTTTGATGAAACGTTTGCTCCTATTGCT
CGTCTTGAAGCCATTCGATTATTACTTAGTATATCATGCATACAGAAATTTAAATTGTATCAGATGGATGTAAAGAGTGCCCTCTTAAATGGATATTTGAATGAG
GAGCTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTTCGAGCACCCGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCT
TGGTATGATCGGCTAACTGTGTACTTGAGAGGTAAAGGATATTCCAAAAGAGAAATTGACAAGACCTTGTTCATACACAGGAAATCTGATCAACTTTTGGTTGCT
CAAATTTATGTTGATGACATCATTTTTGGAGGGTTTCCTCAAGATCTAGTAAATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGCATGGTTGGAGAA
CTTTCATGTTTTCTGGAACTTCAAATTAAGCAAAAGAATGATGTCATCTTCATATCTCAGGAAAAGTATGTCAGGAATATGGTCAAAAAGTTTGGTTTGGAACAG
GCTCGAAATAAGTGGACTCCAGCTGCGACACATTAG
Protein sequenceShow/hide protein sequence
MQEELLQFRRNNVWTLMSKPEGVNVIDTKWVFKNKTDEVGCMTKNKVRLVAQGYTQVEGIDFDETFAPIARLEAIRLLLSISCIQKFKLYQMDVKSALLNGYLNE
ELYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSKREIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGE
LSCFLELQIKQKNDVIFISQEKYVRNMVKKFGLEQARNKWTPAATH