; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0029691 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0029691
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:30984843..30986404
RNA-Seq ExpressionCmc01g0029691
SyntenyCmc01g0029691
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]1.4e-18062.62Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEEV Y  QPKGF D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ +Y KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
          LYRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + +  +LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IR+LV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDAN FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

AAO73523.1 gag-pol polyprotein [Glycine max]3.6e-18162.81Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEEV Y  QPKGF D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ +Y KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
         K YRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + ++S+LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IR+LV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDAN FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

AAO73527.1 gag-pol polyprotein [Glycine max]1.4e-18062.62Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEEV Y  QPKGF D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ +Y KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
          LYRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + +  +LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IR+LV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDAN FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

AAO73529.1 gag-pol polyprotein [Glycine max]4.7e-18162.81Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEE  Y  QPKGF+D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ KY KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
          LYRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + + S+LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IRELV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDA  FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

MCH79363.1 gag-pol polyprotein [Trifolium medium]4.4e-17961.51Show/hide
Query:  IADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRL
        I++ C+   IEP      L DE+WI A MQEEL QF+R+ VW LVP+P    +IGTKW+ +NK+DE   VT+NKARLVAQGY+QVEG+DFDETFAPV RL
Subjt:  IADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRL

Query:  EAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMN
        E+I LL+ ++CI RFKLYQMDV  AFLN YL+EEV Y  QPKGFID  YP HVYKL K LYGL+QAPRAWYERL I+L  +GY +    K     E+  N
Subjt:  EAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMN

Query:  SLLHKSML---------------MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHK
         ++ +  +                +  ++SEFEMS+VGEL+ FLGLQ+KQ  ++IF+SQ KY KNIVKKFG+E + +KRTPAAT +K+T+D  G   D  
Subjt:  SLLHKSML---------------MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHK

Query:  LYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFS
        +Y+S+I SLL+LTASRPDI +AVG+C RYQ +P+MSHL  VKRILKY++GTSD+ ILYS    S LVGYCDADWAGS DDRK+TSGGCFFLGNNLISWFS
Subjt:  LYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFS

Query:  KKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIK
        KKQNCVSLSTAEAEYIA  S+C+QL+W K ML +Y   QD+MTL+CDN++AI+ISKN +QHSRTK+IDIRH+FIR+LVE  V+TL H+ ++ QLADIF K
Subjt:  KKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIK

Query:  PLDANTFEHLRTGLDAC
         LDAN +E LR  L  C
Subjt:  PLDANTFEHLRTGLDAC

TrEMBL top hitse value%identityAlignment
A0A392LWM0 Gag-pol polyprotein (Fragment)2.1e-17961.51Show/hide
Query:  IADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRL
        I++ C+   IEP      L DE+WI A MQEEL QF+R+ VW LVP+P    +IGTKW+ +NK+DE   VT+NKARLVAQGY+QVEG+DFDETFAPV RL
Subjt:  IADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRL

Query:  EAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMN
        E+I LL+ ++CI RFKLYQMDV  AFLN YL+EEV Y  QPKGFID  YP HVYKL K LYGL+QAPRAWYERL I+L  +GY +    K     E+  N
Subjt:  EAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMN

Query:  SLLHKSML---------------MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHK
         ++ +  +                +  ++SEFEMS+VGEL+ FLGLQ+KQ  ++IF+SQ KY KNIVKKFG+E + +KRTPAAT +K+T+D  G   D  
Subjt:  SLLHKSML---------------MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHK

Query:  LYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFS
        +Y+S+I SLL+LTASRPDI +AVG+C RYQ +P+MSHL  VKRILKY++GTSD+ ILYS    S LVGYCDADWAGS DDRK+TSGGCFFLGNNLISWFS
Subjt:  LYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFS

Query:  KKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIK
        KKQNCVSLSTAEAEYIA  S+C+QL+W K ML +Y   QD+MTL+CDN++AI+ISKN +QHSRTK+IDIRH+FIR+LVE  V+TL H+ ++ QLADIF K
Subjt:  KKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIK

Query:  PLDANTFEHLRTGLDAC
         LDAN +E LR  L  C
Subjt:  PLDANTFEHLRTGLDAC

Q84VH6 Gag-pol polyprotein2.3e-18162.81Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEE  Y  QPKGF+D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ KY KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
          LYRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + + S+LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IRELV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDA  FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

Q84VH8 Gag-pol polyprotein6.6e-18162.62Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEEV Y  QPKGF D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ +Y KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
          LYRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + +  +LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IR+LV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDAN FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

Q84VI2 Gag-pol polyprotein1.7e-18162.81Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEEV Y  QPKGF D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ +Y KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
         K YRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + ++S+LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IR+LV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDAN FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

Q84VI4 Gag-pol polyprotein6.6e-18162.62Show/hide
Query:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR
        ++++ C+   IEP      L DE+WINA MQEEL QF+RN VW LVP+P+   +IGTKWI KNKT+E   +T+NKARLVAQGY Q+EGVDFDETFAPV R
Subjt:  MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVR

Query:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM
        LE+I LLL ++CI +FKLYQMDV  AFLN YLNEEV Y  QPKGF D  +P HVY+L K LYGL+QAPRAWYERL  +L+ +GY +  + K  LF +Q  
Subjt:  LEAIGLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMM

Query:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD
         +L              +   ML   +  ++SEFEMS+VGEL+ FLGLQ+KQ  +SIF+SQ +Y KNIVKKFG+E + HKRTPA T +K++KD  G   D
Subjt:  NSL--------------LHKSML--MISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARAD

Query:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW
          LYRS+I SLL+LTASRPDI YAVG+C RYQ +P++SHL  VKRILKYV+GTSD+ I+Y + +  +LVGYCDADWAGS DDRK+TSGGCF+LGNNLISW
Subjt:  HKLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISW

Query:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF
        FSKKQNCVSLSTAEAEYIA  S+C+QLVW K ML EY   QD+MTLYCDNM+AI+ISKN VQHSRTK+IDIRH++IR+LV++KVITL H+ +  Q+ADIF
Subjt:  FSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIF

Query:  IKPLDANTFEHLRTGLDAC
         K LDAN FE LR  L  C
Subjt:  IKPLDANTFEHLRTGLDAC

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-7134.7Show/hide
Query:  PSTFD-IFLKDE--YWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLI
        P++FD I  +D+   W  AI   EL   + NN WT+  +P+   I+ ++W+   K +E     + KARLVA+G+ Q   +D++ETFAPV R+ +   +L 
Subjt:  PSTFD-IFLKDE--YWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLI

Query:  ISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTK-HYLFTEQMMNS----LL
        +      K++QMDV  AFLN  L EE IY   P+G        +V KLNK +YGL+QA R W+E     L    +  + + +  Y+  +  +N     LL
Subjt:  ISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTK-HYLFTEQMMNS----LL

Query:  HKSMLMIS------------YLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSII
        +   ++I+            YL  +F M+ + E+  F+G++I+ + + I++SQ  YVK I+ KF +E      TP  +++   +  N     +   RS+I
Subjt:  HKSMLMIS------------YLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSII

Query:  ESLLH-LTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTT--SILVGYCDADWAGSTDDRKTTSGGCFFLGN-NLISWFSKK
          L++ +  +RPD+  AV I  RY +       + +KR+L+Y+ GT D ++++  +    + ++GY D+DWAGS  DRK+T+G  F + + NLI W +K+
Subjt:  ESLLH-LTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTT--SILVGYCDADWAGSTDDRKTTSGGCFFLGN-NLISWFSKK

Query:  QNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGF-TQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKP
        QN V+ S+ EAEY+A+  A  + +W K +L       ++ + +Y DN   I I+ N   H R K+IDI+++F RE V+N VI L +I +  QLADIF KP
Subjt:  QNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGF-TQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKP

Query:  LDANTFEHLRTGL
        L A  F  LR  L
Subjt:  LDANTFEHLRTGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-7936.46Show/hide
Query:  MQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLIISCIQRFKLYQMDVNGAFLN
        MQEE+   ++N  + LV  PK  + +  KW+ K K D    + + KARLV +G+ Q +G+DFDE F+PVV++ +I  +L ++     ++ Q+DV  AFL+
Subjt:  MQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLIISCIQRFKLYQMDVNGAFLN

Query:  DYLNEEVIYAVQPKGFIDFEYPKH-VYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNS----LLHKSMLMI------------S
          L EE IY  QP+GF +    KH V KLNK LYGL+QAPR WY +   ++  + Y +        F     N+    LL+   ++I             
Subjt:  DYLNEEVIYAVQPKGFIDFEYPKH-VYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNS----LLHKSMLMI------------S

Query:  YLESEFEMSMVGELSCFLGLQI--KQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHK------LYRSIIESLLH-LTASRP
         L   F+M  +G     LG++I  ++ S  +++SQEKY++ ++++F ++ ++   TP A  +K++K       + K       Y S + SL++ +  +RP
Subjt:  YLESEFEMSMVGELSCFLGLQI--KQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHK------LYRSIIESLLH-LTASRP

Query:  DIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEAEYIA
        DIA+AVG+  R+  +P   H EAVK IL+Y+ GT+   + +   +  IL GY DAD AG  D+RK+++G  F      ISW SK Q CV+LST EAEYIA
Subjt:  DIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEAEYIA

Query:  VRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTFE
              +++W K  L E G  Q    +YCD+ +AID+SKN + H+RTK+ID+R+++IRE+V+++ + +  I +N   AD+  K +  N FE
Subjt:  VRSACTQLVWRKNMLHEYGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTFE

P92519 Uncharacterized mitochondrial protein AtMg008106.4e-3237.85Show/hide
Query:  NSLLHKSMLMISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIESLLHLTA
        N+LL+   ++I  L S F M  +G +  FLG+QIK     +F+SQ KY + I+   G+   +   TP   ++  +  T     D   +RSI+ +L +LT 
Subjt:  NSLLHKSMLMISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIESLLHLTA

Query:  SRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILV-GYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEA
        +RPDI+YAV I  +   +P ++  + +KR+L+YV GT  F  LY +  + + V  +CD+DWAG T  R++T+G C FLG N+ISW +K+Q  VS S+ E 
Subjt:  SRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILV-GYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEA

Query:  EYIAVRSACTQLVW
        EY A+     +L W
Subjt:  EYIAVRSACTQLVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-7835.55Show/hide
Query:  EPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLV-PKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLII
        EP T    LKDE W NA+  E   Q   N+ W LV P P    I+G +WI   K +    + + KARLVA+GY Q  G+D+ ETF+PV++  +I ++L +
Subjt:  EPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLV-PKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLII

Query:  SCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNSLLHKSMLM
        +  + + + Q+DVN AFL   L ++V Y  QP GFID + P +V KL K LYGL+QAPRAWY  L  YL   G+  + ++   LF  Q   S+++  + +
Subjt:  SCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNSLLHKSMLM

Query:  ----------------ISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIES
                        +  L   F +    EL  FLG++ K+    + +SQ +Y+ +++ +  +  ++   TP A   K++  +     D   YR I+ S
Subjt:  ----------------ISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIES

Query:  LLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSL
        L +L  +RPDI+YAV    ++   P   HL+A+KRIL+Y+ GT +  I      T  L  Y DADWAG  DD  +T+G   +LG++ ISW SKKQ  V  
Subjt:  LLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSL

Query:  STAEAEYIAVRSACTQLVWRKNMLHEYGF-TQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTF
        S+ EAEY +V +  +++ W  ++L E G        +YCDN+ A  +  N V HSR K+I I ++FIR  V++  + + H+ ++ QLAD   KPL    F
Subjt:  STAEAEYIAVRSACTQLVWRKNMLHEYGF-TQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTF

Query:  EHLRTGLDACRI
        ++  + +   R+
Subjt:  EHLRTGLDACRI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.9e-7834.57Show/hide
Query:  EPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLV-PKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLII
        EP T    +KD+ W  A+  E   Q   N+ W LV P P    I+G +WI   K +    + + KARLVA+GY Q  G+D+ ETF+PV++  +I ++L +
Subjt:  EPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLV-PKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLLLII

Query:  SCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNSLLHKSMLM
        +  + + + Q+DVN AFL   L +EV Y  QP GF+D + P +V +L K +YGL+QAPRAWY  L  YL   G+  + ++   LF  Q   S+++  + +
Subjt:  SCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNSLLHKSMLM

Query:  ----------------ISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIES
                        +  L   F +    +L  FLG++ K+  + + +SQ +Y  +++ +  +  ++   TP AT  K+T  +     D   YR I+ S
Subjt:  ----------------ISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIES

Query:  LLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSL
        L +L  +RPD++YAV    +Y   P   H  A+KR+L+Y+ GT D  I      T  L  Y DADWAG TDD  +T+G   +LG++ ISW SKKQ  V  
Subjt:  LLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSL

Query:  STAEAEYIAVRSACTQLVWRKNMLHEYGF-TQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTF
        S+ EAEY +V +  ++L W  ++L E G        +YCDN+ A  +  N V HSR K+I + ++FIR  V++  + + H+ ++ QLAD   KPL    F
Subjt:  STAEAEYIAVRSACTQLVWRKNMLHEYGF-TQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTF

Query:  EHLRTGLDACRI
        ++    +   ++
Subjt:  EHLRTGLDACRI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-6834.94Show/hide
Query:  LCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAI
        +C   A EPST++   +   W  A M +E+      + W +   P   K IG KW+ K K +    + + KARLVA+GY Q EG+DF ETF+PV +L ++
Subjt:  LCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAI

Query:  GLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFI----DFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHY---LFTE
         L+L IS I  F L+Q+D++ AFLN  L+EE IY   P G+     D   P  V  L K +YGL+QA R W+ + ++ L   G+ ++     Y   +   
Subjt:  GLLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFI----DFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHY---LFTE

Query:  QMMNSLLHKSMLMI------------SYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADH
          +  L++   ++I            S L+S F++  +G L  FLGL+I + +  I I Q KY  +++ + GL   +    P    V  +  + G   D 
Subjt:  QMMNSLLHKSMLMI------------SYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADH

Query:  KLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWF
        K YR +I  L++L  +R DI++AV    ++   P+++H +AV +IL Y+ GT    + YS      L  + DA +    D R++T+G C FLG +LISW 
Subjt:  KLYRSIIESLLHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWF

Query:  SKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMT-LYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRE
        SKKQ  VS S+AEAEY A+  A  +++W      E        T L+CDN  AI I+ N V H RTK+I+   + +RE
Subjt:  SKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHEYGFTQDIMT-LYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.1e-0831.82Show/hide
Query:  LHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGC-----FFLG
        ++LT +RPD+ +AV    ++ +  + + ++AV ++L YV GT    + YS  +   L  + D+DWA   D R++ +G C     +FLG
Subjt:  LHLTASRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGC-----FFLG

ATMG00810.1 DNA/RNA polymerases superfamily protein4.6e-3337.85Show/hide
Query:  NSLLHKSMLMISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIESLLHLTA
        N+LL+   ++I  L S F M  +G +  FLG+QIK     +F+SQ KY + I+   G+   +   TP   ++  +  T     D   +RSI+ +L +LT 
Subjt:  NSLLHKSMLMISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIESLLHLTA

Query:  SRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILV-GYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEA
        +RPDI+YAV I  +   +P ++  + +KR+L+YV GT  F  LY +  + + V  +CD+DWAG T  R++T+G C FLG N+ISW +K+Q  VS S+ E 
Subjt:  SRPDIAYAVGICVRYQTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILV-GYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEA

Query:  EYIAVRSACTQLVW
        EY A+     +L W
Subjt:  EYIAVRSACTQLVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.9e-1543.27Show/hide
Query:  TFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLL
        T   EP +    LKD  W  A MQEEL    RN  W LVP P    I+G KW+ K K      + + KARLVA+G+ Q EG+ F ET++PVVR   I  +
Subjt:  TFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIGLL

Query:  LIIS
        L ++
Subjt:  LIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGCTGATTTATGTTATACATTTGCCATTGAACCTTCGACTTTTGATATTTTTCTTAAAGATGAATATTGGATAAATGCAATAATGCAAGAAGAACTACTT
CAATTCAGGCGTAACAATGTTTGGACATTAGTTCCCAAGCCAAAAAGAGCGAAAATTATAGGAACAAAATGGATCTTAAAAAATAAGACTGATGAAGCAAGGTGT
GTGACAAAGAATAAAGCTCGCCTAGTTGCTCAGGGTTATGCTCAAGTCGAGGGGGTTGACTTTGATGAAACGTTTGCACCTGTTGTCAGACTTGAAGCTATTGGC
TTGTTACTCATAATATCTTGTATTCAAAGATTTAAATTATATCAAATGGATGTAAATGGTGCCTTCCTGAATGATTATTTAAATGAAGAAGTGATCTATGCTGTA
CAACCTAAAGGATTTATCGATTTTGAATATCCTAAGCATGTGTATAAGCTTAATAAAGTTTTATATGGGCTTAGGCAAGCTCCTAGAGCATGGTACGAACGATTG
GCGATTTATTTAAGTTGTAAAGGATATTTCAGAGCAGAGCTGACAAAACATTATTTATTCACAGAACAAATGATGAACTCATTGTTGCACAAATCTATGTTGATG
ATATCATATTTGGAGTCAGAATTTGAGATGAGCATGGTGGGAGAACTATCGTGTTTTCTGGGTCTTCAAATCAAGCAGAAAAGTGAAAGTATATTCATATCTCAA
GAAAAGTATGTCAAGAACATAGTTAAAAAATTCGGGTTAGAACAGTCTCGACATAAAAGAACTCCAGCTGCGACACAAGTTAAAATTACCAAAGATACTAATGGT
GCAAGAGCAGATCACAAACTTTACAGAAGCATAATCGAAAGTCTATTGCATCTAACTGCCAGTCGACCTGACATCGCTTATGCTGTTGGGATATGTGTTCGTTAT
CAGACTGATCCTCAAATGTCACATTTAGAAGCTGTTAAAAGGATCCTCAAGTATGTTCATGGAACAAGTGATTTTAGAATTCTGTATTCCTATGACACAACTTCT
ATCTTGGTTGGATATTGTGATGCCGATTGGGCAGGCTCTACTGATGATAGGAAAACCACCTCTGGAGGTTGTTTCTTTCTTGGAAATAATCTTATCTCATGGTTC
AGTAAGAAACAAAATTGTGTTTCCTTGTCTACAGCTGAAGCTGAGTATATAGCTGTAAGGAGTGCTTGTACCCAATTGGTTTGGAGGAAAAATATGTTGCATGAA
TATGGATTCACACAGGATATTATGACTTTATATTGTGATAATATGAATGCCATTGATATATCAAAAAATATGGTTCAACATAGTCGAACCAAATATATTGATATT
AGGCATTATTTTATTAGAGAACTTGTTGAAAATAAGGTTATTACACTGGCTCACATTCGATCGAACTTGCAATTAGCAGATATTTTCATTAAGCCACTTGATGCA
AACACATTTGAGCATTTACGTACTGGCTTAGATGCATGTCGCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGCTGATTTATGTTATACATTTGCCATTGAACCTTCGACTTTTGATATTTTTCTTAAAGATGAATATTGGATAAATGCAATAATGCAAGAAGAACTACTT
CAATTCAGGCGTAACAATGTTTGGACATTAGTTCCCAAGCCAAAAAGAGCGAAAATTATAGGAACAAAATGGATCTTAAAAAATAAGACTGATGAAGCAAGGTGT
GTGACAAAGAATAAAGCTCGCCTAGTTGCTCAGGGTTATGCTCAAGTCGAGGGGGTTGACTTTGATGAAACGTTTGCACCTGTTGTCAGACTTGAAGCTATTGGC
TTGTTACTCATAATATCTTGTATTCAAAGATTTAAATTATATCAAATGGATGTAAATGGTGCCTTCCTGAATGATTATTTAAATGAAGAAGTGATCTATGCTGTA
CAACCTAAAGGATTTATCGATTTTGAATATCCTAAGCATGTGTATAAGCTTAATAAAGTTTTATATGGGCTTAGGCAAGCTCCTAGAGCATGGTACGAACGATTG
GCGATTTATTTAAGTTGTAAAGGATATTTCAGAGCAGAGCTGACAAAACATTATTTATTCACAGAACAAATGATGAACTCATTGTTGCACAAATCTATGTTGATG
ATATCATATTTGGAGTCAGAATTTGAGATGAGCATGGTGGGAGAACTATCGTGTTTTCTGGGTCTTCAAATCAAGCAGAAAAGTGAAAGTATATTCATATCTCAA
GAAAAGTATGTCAAGAACATAGTTAAAAAATTCGGGTTAGAACAGTCTCGACATAAAAGAACTCCAGCTGCGACACAAGTTAAAATTACCAAAGATACTAATGGT
GCAAGAGCAGATCACAAACTTTACAGAAGCATAATCGAAAGTCTATTGCATCTAACTGCCAGTCGACCTGACATCGCTTATGCTGTTGGGATATGTGTTCGTTAT
CAGACTGATCCTCAAATGTCACATTTAGAAGCTGTTAAAAGGATCCTCAAGTATGTTCATGGAACAAGTGATTTTAGAATTCTGTATTCCTATGACACAACTTCT
ATCTTGGTTGGATATTGTGATGCCGATTGGGCAGGCTCTACTGATGATAGGAAAACCACCTCTGGAGGTTGTTTCTTTCTTGGAAATAATCTTATCTCATGGTTC
AGTAAGAAACAAAATTGTGTTTCCTTGTCTACAGCTGAAGCTGAGTATATAGCTGTAAGGAGTGCTTGTACCCAATTGGTTTGGAGGAAAAATATGTTGCATGAA
TATGGATTCACACAGGATATTATGACTTTATATTGTGATAATATGAATGCCATTGATATATCAAAAAATATGGTTCAACATAGTCGAACCAAATATATTGATATT
AGGCATTATTTTATTAGAGAACTTGTTGAAAATAAGGTTATTACACTGGCTCACATTCGATCGAACTTGCAATTAGCAGATATTTTCATTAAGCCACTTGATGCA
AACACATTTGAGCATTTACGTACTGGCTTAGATGCATGTCGCATTTAA
Protein sequenceShow/hide protein sequence
MIADLCYTFAIEPSTFDIFLKDEYWINAIMQEELLQFRRNNVWTLVPKPKRAKIIGTKWILKNKTDEARCVTKNKARLVAQGYAQVEGVDFDETFAPVVRLEAIG
LLLIISCIQRFKLYQMDVNGAFLNDYLNEEVIYAVQPKGFIDFEYPKHVYKLNKVLYGLRQAPRAWYERLAIYLSCKGYFRAELTKHYLFTEQMMNSLLHKSMLM
ISYLESEFEMSMVGELSCFLGLQIKQKSESIFISQEKYVKNIVKKFGLEQSRHKRTPAATQVKITKDTNGARADHKLYRSIIESLLHLTASRPDIAYAVGICVRY
QTDPQMSHLEAVKRILKYVHGTSDFRILYSYDTTSILVGYCDADWAGSTDDRKTTSGGCFFLGNNLISWFSKKQNCVSLSTAEAEYIAVRSACTQLVWRKNMLHE
YGFTQDIMTLYCDNMNAIDISKNMVQHSRTKYIDIRHYFIRELVENKVITLAHIRSNLQLADIFIKPLDANTFEHLRTGLDACRI