; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0169901 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0169901
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr06:24644647..24645698
RNA-Seq ExpressionCmc06g0169901
SyntenyCmc06g0169901
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.8e-14683.02Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL
        MQTRRK+KIDY+KM +  F        +   +GVNVIGTKW+FKNKIDE  CVTKNK RLVAQGYTQVEG+DFDETFAPVA+L+AIRLLLGISCIQKFKL
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL

Query:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ
        YQMDVKSAFLN YLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT+YLR +GY   + DKT FIHRKSDQLLVAQIYVDDIIFGGF  
Subjt:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ

Query:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD
        DLVNNFINIMQSEFEMS VGELSCF GLQIKQKNDDIFISQEKYAKNMVKKF LEQARNKRT AATHVKLT DT+G EVDHKLYRSIVG+LLYLT SRPD
Subjt:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD

Query:  IAYAVEICARYQVDPRISHLEAVK
        IAY V ICA YQ DPRI+HLEAVK
Subjt:  IAYAVEICARYQVDPRISHLEAVK

KAA0033021.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.4e-13078.15Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK
        MQTRRKEKIDYMKMV DL   I T +   V        ++      DE+    +N V  LV++     V GIDFDETFAPVA+LEAIRLLLGISCIQKFK
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK

Query:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS
        LYQMDVKSAFLNGYLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT+YLR KGYS G+ DKTLFIHRKSDQLLVAQIYVDDIIFG F 
Subjt:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS

Query:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP
        QDL+NNFINIMQSE EMSMVGELSCF GLQIKQKNDDI ISQEKYA+NMVKKFGLEQARNKRTPAATHVKLT +T+G EVDHKLY+SIVG+LLYLTASRP
Subjt:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP

Query:  DIAYAVEICARYQVDPRISHLEAVK
        DIAY V I ARYQ DPRI+HLE VK
Subjt:  DIAYAVEICARYQVDPRISHLEAVK

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-14387.37Show/hide
Query:  KGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEH
        +GVNVIGTKW+FKNK DE  CVTKNK RLVAQGYTQVEG+DFDETFAPVA+LEAIRLLLGISCIQKFKLYQ+DVKS FLNGYLNEEVYVAQ KGFVDSEH
Subjt:  KGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEH

Query:  PKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIK
        PKHV+KLNK LYGLKQA RAWYDRLT+YLR +GYS G+ DK LFIHRKSDQLLVAQIYVDDIIFGGF  DL+NNFINIMQSEFEMSMVGELSCF GLQIK
Subjt:  PKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIK

Query:  QKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK
        QKND IFISQEKYA+NMVKKFGL+QARNKRTPAATHVKLT DT+G EVDHKLYRSIVGSLLYLTASRPDIAY V ICARYQ DPRI+ LE VK
Subjt:  QKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK

TYK03438.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.4e-13078.15Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK
        MQTRRKEKIDYMKMV DL   I T +   V        ++      DE+    +N V  LV++     V GIDFDETFAPVA+LEAIRLLLGISCIQKFK
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK

Query:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS
        LYQMDVKSAFLNGYLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT+YLR KGYS G+ DKTLFIHRKSDQLLVAQIYVDDIIFG F 
Subjt:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS

Query:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP
        QDL+NNFINIMQSE EMSMVGELSCF GLQIKQKNDDI ISQEKYA+NMVKKFGLEQARNKRTPAATHVKLT +T+G EVDHKLY+SIVG+LLYLTASRP
Subjt:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP

Query:  DIAYAVEICARYQVDPRISHLEAVK
        DIAY V I ARYQ DPRI+HLE VK
Subjt:  DIAYAVEICARYQVDPRISHLEAVK

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.4e-14683.02Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL
        MQTRRK+KIDY+KM +  F        +   +GVNVIGTKW+FKNKIDE  CVTKNK RLVAQGYTQVEG+DFDETFAPVA+L+AIRLLLGISCIQKFKL
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL

Query:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ
        YQMDVKSAFLN YLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT YLR +GY   + DKT FIHRKSDQLLVAQIYVDDIIFGGF  
Subjt:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ

Query:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD
        DLVNNFINIMQSEFEMS VGELSCF GLQIKQKNDDIFISQEKYAKNMVKKF LEQARNKRT AATHVKLT DT+G EVDHKLYRSIVG+LLYLT SRPD
Subjt:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD

Query:  IAYAVEICARYQVDPRISHLEAVK
        IAY V ICA YQ DPRI+HLEAVK
Subjt:  IAYAVEICARYQVDPRISHLEAVK

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein8.9e-14783.02Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL
        MQTRRK+KIDY+KM +  F        +   +GVNVIGTKW+FKNKIDE  CVTKNK RLVAQGYTQVEG+DFDETFAPVA+L+AIRLLLGISCIQKFKL
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL

Query:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ
        YQMDVKSAFLN YLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT+YLR +GY   + DKT FIHRKSDQLLVAQIYVDDIIFGGF  
Subjt:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ

Query:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD
        DLVNNFINIMQSEFEMS VGELSCF GLQIKQKNDDIFISQEKYAKNMVKKF LEQARNKRT AATHVKLT DT+G EVDHKLYRSIVG+LLYLT SRPD
Subjt:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD

Query:  IAYAVEICARYQVDPRISHLEAVK
        IAY V ICA YQ DPRI+HLEAVK
Subjt:  IAYAVEICARYQVDPRISHLEAVK

A0A5A7SU23 Gag-pol polyprotein1.2e-13078.15Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK
        MQTRRKEKIDYMKMV DL   I T +   V        ++      DE+    +N V  LV++     V GIDFDETFAPVA+LEAIRLLLGISCIQKFK
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK

Query:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS
        LYQMDVKSAFLNGYLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT+YLR KGYS G+ DKTLFIHRKSDQLLVAQIYVDDIIFG F 
Subjt:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS

Query:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP
        QDL+NNFINIMQSE EMSMVGELSCF GLQIKQKNDDI ISQEKYA+NMVKKFGLEQARNKRTPAATHVKLT +T+G EVDHKLY+SIVG+LLYLTASRP
Subjt:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP

Query:  DIAYAVEICARYQVDPRISHLEAVK
        DIAY V I ARYQ DPRI+HLE VK
Subjt:  DIAYAVEICARYQVDPRISHLEAVK

A0A5D3BPB3 Gag-pol polyprotein9.2e-14487.37Show/hide
Query:  KGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEH
        +GVNVIGTKW+FKNK DE  CVTKNK RLVAQGYTQVEG+DFDETFAPVA+LEAIRLLLGISCIQKFKLYQ+DVKS FLNGYLNEEVYVAQ KGFVDSEH
Subjt:  KGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEH

Query:  PKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIK
        PKHV+KLNK LYGLKQA RAWYDRLT+YLR +GYS G+ DK LFIHRKSDQLLVAQIYVDDIIFGGF  DL+NNFINIMQSEFEMSMVGELSCF GLQIK
Subjt:  PKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIK

Query:  QKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK
        QKND IFISQEKYA+NMVKKFGL+QARNKRTPAATHVKLT DT+G EVDHKLYRSIVGSLLYLTASRPDIAY V ICARYQ DPRI+ LE VK
Subjt:  QKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK

A0A5D3BWU5 Gag-pol polyprotein1.2e-13078.15Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK
        MQTRRKEKIDYMKMV DL   I T +   V        ++      DE+    +N V  LV++     V GIDFDETFAPVA+LEAIRLLLGISCIQKFK
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKV-RLVAQGY-TQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFK

Query:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS
        LYQMDVKSAFLNGYLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT+YLR KGYS G+ DKTLFIHRKSDQLLVAQIYVDDIIFG F 
Subjt:  LYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFS

Query:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP
        QDL+NNFINIMQSE EMSMVGELSCF GLQIKQKNDDI ISQEKYA+NMVKKFGLEQARNKRTPAATHVKLT +T+G EVDHKLY+SIVG+LLYLTASRP
Subjt:  QDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRP

Query:  DIAYAVEICARYQVDPRISHLEAVK
        DIAY V I ARYQ DPRI+HLE VK
Subjt:  DIAYAVEICARYQVDPRISHLEAVK

A0A5D3CJ17 Gag-pol polyprotein2.6e-14683.02Show/hide
Query:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL
        MQTRRK+KIDY+KM +  F        +   +GVNVIGTKW+FKNKIDE  CVTKNK RLVAQGYTQVEG+DFDETFAPVA+L+AIRLLLGISCIQKFKL
Subjt:  MQTRRKEKIDYMKMVVDLFTPIQTKQCLD-VKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKL

Query:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ
        YQMDVKSAFLN YLNEEVYVAQ KGFVDSEHPKHV+KLNK LYGLKQAPRAWYDRLT YLR +GY   + DKT FIHRKSDQLLVAQIYVDDIIFGGF  
Subjt:  YQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQ

Query:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD
        DLVNNFINIMQSEFEMS VGELSCF GLQIKQKNDDIFISQEKYAKNMVKKF LEQARNKRT AATHVKLT DT+G EVDHKLYRSIVG+LLYLT SRPD
Subjt:  DLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPD

Query:  IAYAVEICARYQVDPRISHLEAVK
        IAY V ICA YQ DPRI+HLEAVK
Subjt:  IAYAVEICARYQVDPRISHLEAVK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-4334.98Show/hide
Query:  NVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKH
        N++ ++WVF  K +E+    + K RLVA+G+TQ   ID++ETFAPVA++ + R +L +      K++QMDVK+AFLNG L EE+Y+   +G   S +  +
Subjt:  NVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKH

Query:  VHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKS--DQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQ
        V KLNK +YGLKQA R W++     L++  +     D+ ++I  K   ++ +   +YVDD++        +NNF   +  +F M+ + E+  F G++I+ 
Subjt:  VHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKS--DQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQ

Query:  KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVK---LTTDTDGVEVDHKLYRSIVGSLLY-LTASRPDIAYAVEICARY
        + D I++SQ  Y K ++ KF +E      TP  + +    L +D D     +   RS++G L+Y +  +RPD+  AV I +RY
Subjt:  KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVK---LTTDTDGVEVDHKLYRSIVGSLLY-LTASRPDIAYAVEICARY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-4735.97Show/hide
Query:  KGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEH
        KG   +  KWVFK K D    + + K RLV +G+ Q +GIDFDE F+PV ++ +IR +L ++     ++ Q+DVK+AFL+G L EE+Y+ Q +GF  +  
Subjt:  KGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEH

Query:  PKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSD-QLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQI
           V KLNK+LYGLKQAPR WY +   +++ + Y     D  ++  R S+   ++  +YVDD++  G  + L+      +   F+M  +G      G++I
Subjt:  PKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSD-QLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQI

Query:  --KQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHK------LYRSIVGSLLY-LTASRPDIAYAVEICARYQVDPRISHLE
          ++ +  +++SQEKY + ++++F ++ A+   TP A H+KL+       V+ K       Y S VGSL+Y +  +RPDIA+AV + +R+  +P   H E
Subjt:  --KQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHK------LYRSIVGSLLY-LTASRPDIAYAVEICARYQVDPRISHLE

Query:  AVK
        AVK
Subjt:  AVK

P25600 Putative transposon Ty5-1 protein YCL074W4.9e-2531.7Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDL
        MDV +AFLN  ++E +YV Q  GFV+  +P +V +L   +YGLKQAP  W + +   L+  G+   + +  L+    SD  +   +YVDD++    S  +
Subjt:  MDVKSAFLNGYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDL

Query:  VNNFINIMQSEFEMSMVGELSCFWGLQIKQ-KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLY-LTASRPD
         +     +   + M  +G++  F GL I Q  N DI +S + Y      +  +   +  +TP      L   T     D   Y+SIVG LL+     RPD
Subjt:  VNNFINIMQSEFEMSMVGELSCFWGLQIKQ-KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLY-LTASRPD

Query:  IAYAVEICARYQVDPRISHLEAVK
        I+Y V + +R+  +PR  HLE+ +
Subjt:  IAYAVEICARYQVDPRISHLEAVK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-5236.77Show/hide
Query:  VNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPK
        V ++G +W+F  K +    + + K RLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV +AFL G L ++VY++Q  GF+D + P 
Subjt:  VNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPK

Query:  HVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQK
        +V KL K LYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L++N ++ +   F +    EL  F G++ K+ 
Subjt:  HVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQK

Query:  NDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK
           + +SQ +Y  +++ +  +  A+   TP A   KL+  +     D   YR IVGSL YL  +RPDI+YAV   +++   P   HL+A+K
Subjt:  NDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-5236.43Show/hide
Query:  VNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPK
        V ++G +W+F  K +    + + K RLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV +AFL G L +EVY++Q  GFVD + P 
Subjt:  VNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFVDSEHPK

Query:  HVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQK
        +V +L K +YGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L+ + ++ +   F +    +L  F G++ K+ 
Subjt:  HVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQK

Query:  NDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK
           + +SQ +Y  +++ +  +  A+   TP AT  KLT  +     D   YR IVGSL YL  +RPD++YAV   ++Y   P   H  A+K
Subjt:  NDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-5038.14Show/hide
Query:  IGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFV----DSEHP
        IG KWV+K K +    + + K RLVA+GYTQ EGIDF ETF+PV +L +++L+L IS I  F L+Q+D+ +AFLNG L+EE+Y+    G+     DS  P
Subjt:  IGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQLKGFV----DSEHP

Query:  KHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQ
          V  L K++YGLKQA R W+ + ++ L   G+     D T F+   +   L   +YVDDII    +   V+   + ++S F++  +G L  F GL+I +
Subjt:  KHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQ

Query:  KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAV
            I I Q KYA +++ + GL   +    P    V  +  + G  VD K YR ++G L+YL  +R DI++AV   +++   PR++H +AV
Subjt:  KNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-1635.77Show/hide
Query:  IYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSI
        +YVDDI+  G S  L+N  I  + S F M  +G +  F G+QIK     +F+SQ KYA+ ++   G+   +   TP    +  +  T     D   +RSI
Subjt:  IYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGELSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSI

Query:  VGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK
        VG+L YLT +RPDI+YAV I  +   +P ++  + +K
Subjt:  VGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-0844.07Show/hide
Query:  NVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGIS
        N++G KWVFK K+     + + K RLVA+G+ Q EGI F ET++PV +   IR +L ++
Subjt:  NVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGTTGATTTATTTACTCCAATTCAGACGAAACAATGTCTGGACGTTAAAGGTGTAAACGTTATTGG
CACCAAATGGGTGTTTAAAAATAAGATTGATGAAGTTAGATGTGTGACAAAAAATAAAGTCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTATTGACTTTGATG
AAACGTTTGCTCCTGTTGCTCAACTTGAAGCCATTCGATTATTACTTGGTATATCATGCATACAGAAATTTAAATTGTATCAAATGGATGTAAAGAGTGCCTTCTTAAAT
GGATATTTGAATGAGGAGGTTTATGTTGCTCAACTAAAAGGTTTTGTTGATTCTGAGCACCCGAAGCATGTGCATAAGCTCAACAAAACTTTATATGGGCTAAAGCAAGC
TCCGAGAGCTTGGTATGATCGGCTAACTATTTACTTGAGAGATAAAGGATATTCTGGAGGAAAATTTGACAAGACCTTGTTTATACACAGAAAATCTGATCAACTTTTGG
TTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTTCTCAAGATCTAGTAAATAATTTCATTAACATTATGCAGTCAGAATTTGAAATGAGCATGGTTGGAGAA
CTTTCATGCTTTTGGGGACTTCAAATTAAGCAAAAGAATGACGACATCTTCATATCTCAAGAAAAGTATGCCAAGAATATGGTTAAAAAGTTTGGTTTGGAACAGGCTCG
AAATAAGCGGACTCCAGCTGCGACACATGTTAAACTTACAACAGACACCGATGGTGTTGAAGTTGATCACAAACTCTACAGGAGTATAGTAGGCAGCTTATTATACTTAA
CAGCAAGTCGACCTGACATAGCTTATGCTGTGGAAATATGTGCTCGTTATCAGGTGGATCCCCGCATCTCTCACCTAGAAGCTGTTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGTTGATTTATTTACTCCAATTCAGACGAAACAATGTCTGGACGTTAAAGGTGTAAACGTTATTGG
CACCAAATGGGTGTTTAAAAATAAGATTGATGAAGTTAGATGTGTGACAAAAAATAAAGTCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTATTGACTTTGATG
AAACGTTTGCTCCTGTTGCTCAACTTGAAGCCATTCGATTATTACTTGGTATATCATGCATACAGAAATTTAAATTGTATCAAATGGATGTAAAGAGTGCCTTCTTAAAT
GGATATTTGAATGAGGAGGTTTATGTTGCTCAACTAAAAGGTTTTGTTGATTCTGAGCACCCGAAGCATGTGCATAAGCTCAACAAAACTTTATATGGGCTAAAGCAAGC
TCCGAGAGCTTGGTATGATCGGCTAACTATTTACTTGAGAGATAAAGGATATTCTGGAGGAAAATTTGACAAGACCTTGTTTATACACAGAAAATCTGATCAACTTTTGG
TTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTTCTCAAGATCTAGTAAATAATTTCATTAACATTATGCAGTCAGAATTTGAAATGAGCATGGTTGGAGAA
CTTTCATGCTTTTGGGGACTTCAAATTAAGCAAAAGAATGACGACATCTTCATATCTCAAGAAAAGTATGCCAAGAATATGGTTAAAAAGTTTGGTTTGGAACAGGCTCG
AAATAAGCGGACTCCAGCTGCGACACATGTTAAACTTACAACAGACACCGATGGTGTTGAAGTTGATCACAAACTCTACAGGAGTATAGTAGGCAGCTTATTATACTTAA
CAGCAAGTCGACCTGACATAGCTTATGCTGTGGAAATATGTGCTCGTTATCAGGTGGATCCCCGCATCTCTCACCTAGAAGCTGTTAAATGA
Protein sequenceShow/hide protein sequence
MQTRRKEKIDYMKMVVDLFTPIQTKQCLDVKGVNVIGTKWVFKNKIDEVRCVTKNKVRLVAQGYTQVEGIDFDETFAPVAQLEAIRLLLGISCIQKFKLYQMDVKSAFLN
GYLNEEVYVAQLKGFVDSEHPKHVHKLNKTLYGLKQAPRAWYDRLTIYLRDKGYSGGKFDKTLFIHRKSDQLLVAQIYVDDIIFGGFSQDLVNNFINIMQSEFEMSMVGE
LSCFWGLQIKQKNDDIFISQEKYAKNMVKKFGLEQARNKRTPAATHVKLTTDTDGVEVDHKLYRSIVGSLLYLTASRPDIAYAVEICARYQVDPRISHLEAVK