; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0225471 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0225471
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr08:16555985..16556950
RNA-Seq ExpressionCmc08g0225471
SyntenyCmc08g0225471
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056162.1 putative mitochondrial protein [Cucumis melo var. makuwa]7.1e-167100Show/hide
Query:  VQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLLYL
        VQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLLYL
Subjt:  VQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLLYL

Query:  TASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAE
        TASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAE
Subjt:  TASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAE

Query:  AKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRA
        AKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRA
Subjt:  AKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRA

MCH79438.1 gag-pol polyprotein [Trifolium medium]7.3e-11162.88Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMD----KAKP-----KLTKD
        +F+  +  +  I QIYVDDIIFGG S+T V+ FV QM+ EFE+SMVGELT+FLG Q+KQ +  IF SQ KYAKN++ KFGM+    K  P     KLTKD
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMD----KAKP-----KLTKD

Query:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
             VD S+YRS+IGSLLYLT SRPDI F VGVCA+YQA+ + SHL   KRILKY+ GTCN+G+ Y+      L GYCDADWA  +DDRKSTS GCFFL
Subjt:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
         NN+ +WFSKK N VSLSTAEA+YIA GSSCSQL+WMKQM++EY + Q  + LYCDN+S INISKNP+QHSRTK IDI HHFIR+LVE N+I+LEHV + 
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFEGLRAGIRVC
         QLADIFTK LD   FE LR  + +C
Subjt:  LQLADIFTKPLDVSTFEGLRAGIRVC

MCH90383.1 gag-pol polyprotein [Trifolium medium]8.6e-11263.5Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKA---------KPKLTKD
        +F+  +  +  I QIYVDDI+FGG +   V+ FV QM+ EFE+SMVGELT+FLG QIKQ +   F SQ KYAKN++ KFGMD A           KLTKD
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKA---------KPKLTKD

Query:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
             VD S+YRS+IGSLLYLTASRPDIA+ VGVCARYQAD + SHL   KRILKY+ GTC++G+ YT     VL+GYCDADWA  +DDRKSTS GCFFL
Subjt:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
         NN+ +WFSKKQNSVSLSTAEA+YIA GSSCSQL+WMKQM+ EY + Q  M LYCDN+S INISKNP+QHSRTK IDIRHHFIR+LVE  +++LEH+ + 
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFEGLRAGIRVC
         QLADIFTK LD + FE LR  + +C
Subjt:  LQLADIFTKPLDVSTFEGLRAGIRVC

TXG60530.1 hypothetical protein EZV62_015103 [Acer yangbiense]1.9e-11162.88Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKP---------KLTKD
        +FI R   E FI QIYVDDI+FG T++T V+QFV+ M  EFE+S+VGEL++FLG QI+Q    IF +Q KYAKNL+ KFG++ AK          KL+KD
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKP---------KLTKD

Query:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
        A+ + V+ +LYR +IGSLLYLTASRPDI F+VG+CARYQAD + SHL S KRI++Y+ GT N+G+WY+FDT   LVG+ DADWA   DDRKSTS GCFFL
Subjt:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
         NN+ +WF KKQNS+SLSTAEA+YIA GS C+QLLWMKQM+ +YG  Q ++ L+CDN+S INISKNPVQHSRTK IDIRHHFIRELVE   I LEHV ++
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFEGLRAGIRVC
         QLAD+FTKPLD + F+ L   I VC
Subjt:  LQLADIFTKPLDVSTFEGLRAGIRVC

TYJ96615.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.6e-11677.82Show/hide
Query:  GELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAK---------PKLTKDATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSH
        G+LTFFLGFQIKQ +T IFFSQEKYAKN +SKFG DKAK          K+TKDAT EKVDSSLYRS IGS LYL AS+ DI F +GVCARYQ + +TSH
Subjt:  GELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAK---------PKLTKDATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSH

Query:  LHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI
        LHSAKRILKYI  TCN+ LWYTFDTT VLVGYCDADWA CSDDRKSTSRGCFFL NNIA WFSKKQNSV LSTAEA+YIA GSSCSQLL MKQMMEEYG+
Subjt:  LHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI

Query:  LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRAGIRVCQQLA
         Q S VLYCDNIS I+ISKNPVQHSRTK IDI HH IRELVE NIISLEHVRS LQLADIFTKPLDVST EGL AG+    +L+
Subjt:  LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRAGIRVCQQLA

TrEMBL top hitse value%identityAlignment
A0A392LWW5 Gag-pol polyprotein3.5e-11162.88Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMD----KAKP-----KLTKD
        +F+  +  +  I QIYVDDIIFGG S+T V+ FV QM+ EFE+SMVGELT+FLG Q+KQ +  IF SQ KYAKN++ KFGM+    K  P     KLTKD
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMD----KAKP-----KLTKD

Query:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
             VD S+YRS+IGSLLYLT SRPDI F VGVCA+YQA+ + SHL   KRILKY+ GTCN+G+ Y+      L GYCDADWA  +DDRKSTS GCFFL
Subjt:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
         NN+ +WFSKK N VSLSTAEA+YIA GSSCSQL+WMKQM++EY + Q  + LYCDN+S INISKNP+QHSRTK IDI HHFIR+LVE N+I+LEHV + 
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFEGLRAGIRVC
         QLADIFTK LD   FE LR  + +C
Subjt:  LQLADIFTKPLDVSTFEGLRAGIRVC

A0A392MST7 Gag-pol polyprotein (Fragment)4.2e-11263.5Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKA---------KPKLTKD
        +F+  +  +  I QIYVDDI+FGG +   V+ FV QM+ EFE+SMVGELT+FLG QIKQ +   F SQ KYAKN++ KFGMD A           KLTKD
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKA---------KPKLTKD

Query:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
             VD S+YRS+IGSLLYLTASRPDIA+ VGVCARYQAD + SHL   KRILKY+ GTC++G+ YT     VL+GYCDADWA  +DDRKSTS GCFFL
Subjt:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
         NN+ +WFSKKQNSVSLSTAEA+YIA GSSCSQL+WMKQM+ EY + Q  M LYCDN+S INISKNP+QHSRTK IDIRHHFIR+LVE  +++LEH+ + 
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFEGLRAGIRVC
         QLADIFTK LD + FE LR  + +C
Subjt:  LQLADIFTKPLDVSTFEGLRAGIRVC

A0A5A7UJW3 Putative mitochondrial protein3.5e-167100Show/hide
Query:  VQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLLYL
        VQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLLYL
Subjt:  VQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLLYL

Query:  TASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAE
        TASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAE
Subjt:  TASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAE

Query:  AKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRA
        AKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRA
Subjt:  AKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRA

A0A5C7HUE9 Integrase catalytic domain-containing protein9.2e-11262.88Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKP---------KLTKD
        +FI R   E FI QIYVDDI+FG T++T V+QFV+ M  EFE+S+VGEL++FLG QI+Q    IF +Q KYAKNL+ KFG++ AK          KL+KD
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKP---------KLTKD

Query:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
        A+ + V+ +LYR +IGSLLYLTASRPDI F+VG+CARYQAD + SHL S KRI++Y+ GT N+G+WY+FDT   LVG+ DADWA   DDRKSTS GCFFL
Subjt:  ATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
         NN+ +WF KKQNS+SLSTAEA+YIA GS C+QLLWMKQM+ +YG  Q ++ L+CDN+S INISKNPVQHSRTK IDIRHHFIRELVE   I LEHV ++
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFEGLRAGIRVC
         QLAD+FTKPLD + F+ L   I VC
Subjt:  LQLADIFTKPLDVSTFEGLRAGIRVC

A0A5D3BDS2 Gag-pol polyprotein1.2e-11677.82Show/hide
Query:  GELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAK---------PKLTKDATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSH
        G+LTFFLGFQIKQ +T IFFSQEKYAKN +SKFG DKAK          K+TKDAT EKVDSSLYRS IGS LYL AS+ DI F +GVCARYQ + +TSH
Subjt:  GELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAK---------PKLTKDATREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSH

Query:  LHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI
        LHSAKRILKYI  TCN+ LWYTFDTT VLVGYCDADWA CSDDRKSTSRGCFFL NNIA WFSKKQNSV LSTAEA+YIA GSSCSQLL MKQMMEEYG+
Subjt:  LHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI

Query:  LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRAGIRVCQQLA
         Q S VLYCDNIS I+ISKNPVQHSRTK IDI HH IRELVE NIISLEHVRS LQLADIFTKPLDVST EGL AG+    +L+
Subjt:  LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRAGIRVCQQLA

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-4533.44Show/hide
Query:  EFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAK-------PKLTKDATREKVD-SS
        E   V +YVDD++      T +  F   +  +F ++ + E+  F+G +I+  + +I+ SQ  Y K ++SKF M+           K+  +      D ++
Subjt:  EFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAK-------PKLTKDATREKVD-SS

Query:  LYRSIIGSLLY-LTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTT--EVLVGYCDADWAVCSDDRKSTSRGCFFLKN-NIA
          RS+IG L+Y +  +RPD+   V + +RY +   +    + KR+L+Y+ GT +  L +  +      ++GY D+DWA    DRKST+   F + + N+ 
Subjt:  LYRSIIGSLLY-LTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTT--EVLVGYCDADWAVCSDDRKSTSRGCFFLKN-NIA

Query:  AWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLA
         W +K+QNSV+ S+ EA+Y+A+  +  + LW+K ++    I L+  + +Y DN   I+I+ NP  H R K IDI++HF RE V+ N+I LE++ +  QLA
Subjt:  AWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLA

Query:  DIFTKPLDVSTFEGLRAGIRVCQ
        DIFTKPL  + F  LR  + + Q
Subjt:  DIFTKPLDVSTFEGLRAGIRVCQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-4333.44Show/hide
Query:  FFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKT--RIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSL------
        F I+ +YVDD++  G     + +    +   F++  +G     LG +I + +T  +++ SQEKY + ++ +F M  AKP  T  A   K+   +      
Subjt:  FFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKT--RIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSL------

Query:  ---------YRSIIGSLLY-LTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
                 Y S +GSL+Y +  +RPDIA  VGV +R+  +    H  + K IL+Y+ GT    L +   +  +L GY DAD A   D+RKS++   F  
Subjt:  ---------YRSIIGSLLY-LTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS
             +W SK Q  V+LST EA+YIA   +  +++W+K+ ++E G+ Q   V+YCD+ S I++SKN + H+RTK ID+R+H+IRE+V+   + +  + ++
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSS

Query:  LQLADIFTKPLDVSTFE
           AD+ TK +  + FE
Subjt:  LQLADIFTKPLDVSTFE

P92519 Uncharacterized mitochondrial protein AtMg008105.7e-3434.84Show/hide
Query:  IYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLT--------KDATREKVDSSLYRSII
        +YVDDI+  G+S+T +   + Q+   F +  +G + +FLG QIK + + +F SQ KYA+ +++  GM   KP  T          +T +  D S +RSI+
Subjt:  IYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLT--------KDATREKVDSSLYRSII

Query:  GSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSV
        G+L YLT +RPDI++ V +  +   +   +     KR+L+Y+ GT   GL+   ++   +  +CD+DWA C+  R+ST+  C FL  NI +W +K+Q +V
Subjt:  GSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSV

Query:  SLSTAEAKYIAVGSSCSQLLW
        S S+ E +Y A+  + ++L W
Subjt:  SLSTAEAKYIAVGSSCSQLLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-4935.28Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKV---
        +F+ ++G     + +YVDDI+  G   T +   ++ +   F +    EL +FLG + K+  T +  SQ +Y  +L+++  M  AKP  T  A   K+   
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKV---

Query:  ------DSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
              D + YR I+GSL YL  +RPDI++ V   +++       HL + KRIL+Y+ GT N G++     T  L  Y DADWA   DD  ST+    +L
Subjt:  ------DSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRS
         ++  +W SKKQ  V  S+ EA+Y +V ++ S++ W+  ++ E GI L    V+YCDN+    +  NPV HSR K I I +HFIR  V++  + + HV +
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRS

Query:  SLQLADIFTKPLDVSTFEGLRAGIRV
          QLAD  TKPL  + F+   + I V
Subjt:  SLQLADIFTKPLDVSTFEGLRAGIRV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-4833.74Show/hide
Query:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKV---
        +F+ ++G     + +YVDDI+  G  +  ++  ++ +   F +    +L +FLG + K+    +  SQ +Y  +L+++  M  AKP  T  AT  K+   
Subjt:  MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKV---

Query:  ------DSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL
              D + YR I+GSL YL  +RPD+++ V   ++Y       H ++ KR+L+Y+ GT + G++     T  L  Y DADWA  +DD  ST+    +L
Subjt:  ------DSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFL

Query:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRS
         ++  +W SKKQ  V  S+ EA+Y +V ++ S+L W+  ++ E GI L +  V+YCDN+    +  NPV HSR K I + +HFIR  V++  + + HV +
Subjt:  KNNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRS

Query:  SLQLADIFTKPLDVSTFEGLRAGIRV
          QLAD  TKPL    F+     I V
Subjt:  SLQLADIFTKPLDVSTFEGLRAGIRV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-4134.39Show/hide
Query:  FIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPK---------LTKDA
        F+    T F  V +YVDDII    +   V++  +Q+K  F++  +G L +FLG +I +    I   Q KYA +L+ + G+   KP           +  +
Subjt:  FIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPK---------LTKDA

Query:  TREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLK
          + VD+  YR +IG L+YL  +R DI+F V   +++    R +H  +  +IL YI GT   GL+Y+      L  + DA +  C D R+ST+  C FL 
Subjt:  TREKVDSSLYRSIIGSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLK

Query:  NNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRE
         ++ +W SKKQ  VS S+AEA+Y A+  +  +++W+ Q   E  + L    +L+CDN + I+I+ N V H RTK I+   H +RE
Subjt:  NNIAAWFSKKQNSVSLSTAEAKYIAVGSSCSQLLWMKQMMEEYGI-LQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.0e-1032.22Show/hide
Query:  LYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWF
        +YLT +RPD+ F V   +++ +  RT+ + +  ++L Y+ GT   GL+Y+  +   L  + D+DWA C D R+S +  C    + +  WF
Subjt:  LYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWF

ATMG00810.1 DNA/RNA polymerases superfamily protein4.1e-3534.84Show/hide
Query:  IYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLT--------KDATREKVDSSLYRSII
        +YVDDI+  G+S+T +   + Q+   F +  +G + +FLG QIK + + +F SQ KYA+ +++  GM   KP  T          +T +  D S +RSI+
Subjt:  IYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLT--------KDATREKVDSSLYRSII

Query:  GSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSV
        G+L YLT +RPDI++ V +  +   +   +     KR+L+Y+ GT   GL+   ++   +  +CD+DWA C+  R+ST+  C FL  NI +W +K+Q +V
Subjt:  GSLLYLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSV

Query:  SLSTAEAKYIAVGSSCSQLLW
        S S+ E +Y A+  + ++L W
Subjt:  SLSTAEAKYIAVGSSCSQLLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATATATCGACAGGGTACTGAATTTTTTATAGTTCAGATTTATGTTGATGACATTATATTTGGTGGGACGTCCTCTACATATGTTGAACAGTTTGTTAATCAAAT
GAAGGGCGAATTTGAAATAAGCATGGTTGGTGAACTGACATTCTTTCTAGGATTTCAGATTAAGCAATACAAGACTAGAATTTTCTTTTCTCAAGAAAAATATGCAAAGA
ATCTCATCTCCAAGTTTGGAATGGATAAAGCCAAACCCAAATTGACAAAGGATGCCACTAGAGAAAAGGTTGATTCAAGTCTATACAGAAGTATCATTGGTAGTTTACTC
TACTTAACAGCCAGCAGACCAGATATAGCTTTTACAGTAGGCGTGTGTGCTCGTTATCAGGCTGACCGTCGAACGTCCCATCTTCACAGTGCTAAACGCATACTGAAATA
TATAATAGGTACGTGTAATTTCGGTCTCTGGTATACTTTTGATACAACTGAAGTTCTTGTAGGGTACTGTGATGCTGATTGGGCCGTCTGTTCGGATGATCGGAAGAGCA
CATCTAGAGGGTGTTTTTTTCTTAAGAACAATATAGCTGCTTGGTTCAGCAAGAAACAAAACAGTGTGTCCTTATCAACTGCAGAAGCAAAATATATAGCAGTAGGGAGC
AGTTGTTCGCAACTTCTATGGATGAAACAGATGATGGAAGAATATGGTATACTTCAGTACTCTATGGTTCTCTACTGTGATAATATAAGCGTAATAAACATCTCCAAAAA
TCCAGTTCAACATAGCCGTACAAAGCGCATCGACATCCGTCATCATTTTATTCGGGAATTAGTGGAAGCTAATATTATTAGTTTAGAACATGTTAGAAGTTCATTGCAAC
TTGCCGATATTTTTACCAAGCCTTTGGATGTATCCACATTTGAAGGATTACGAGCAGGTATTAGGGTTTGTCAACAGCTCGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCATATATCGACAGGGTACTGAATTTTTTATAGTTCAGATTTATGTTGATGACATTATATTTGGTGGGACGTCCTCTACATATGTTGAACAGTTTGTTAATCAAAT
GAAGGGCGAATTTGAAATAAGCATGGTTGGTGAACTGACATTCTTTCTAGGATTTCAGATTAAGCAATACAAGACTAGAATTTTCTTTTCTCAAGAAAAATATGCAAAGA
ATCTCATCTCCAAGTTTGGAATGGATAAAGCCAAACCCAAATTGACAAAGGATGCCACTAGAGAAAAGGTTGATTCAAGTCTATACAGAAGTATCATTGGTAGTTTACTC
TACTTAACAGCCAGCAGACCAGATATAGCTTTTACAGTAGGCGTGTGTGCTCGTTATCAGGCTGACCGTCGAACGTCCCATCTTCACAGTGCTAAACGCATACTGAAATA
TATAATAGGTACGTGTAATTTCGGTCTCTGGTATACTTTTGATACAACTGAAGTTCTTGTAGGGTACTGTGATGCTGATTGGGCCGTCTGTTCGGATGATCGGAAGAGCA
CATCTAGAGGGTGTTTTTTTCTTAAGAACAATATAGCTGCTTGGTTCAGCAAGAAACAAAACAGTGTGTCCTTATCAACTGCAGAAGCAAAATATATAGCAGTAGGGAGC
AGTTGTTCGCAACTTCTATGGATGAAACAGATGATGGAAGAATATGGTATACTTCAGTACTCTATGGTTCTCTACTGTGATAATATAAGCGTAATAAACATCTCCAAAAA
TCCAGTTCAACATAGCCGTACAAAGCGCATCGACATCCGTCATCATTTTATTCGGGAATTAGTGGAAGCTAATATTATTAGTTTAGAACATGTTAGAAGTTCATTGCAAC
TTGCCGATATTTTTACCAAGCCTTTGGATGTATCCACATTTGAAGGATTACGAGCAGGTATTAGGGTTTGTCAACAGCTCGCATAG
Protein sequenceShow/hide protein sequence
MFIYRQGTEFFIVQIYVDDIIFGGTSSTYVEQFVNQMKGEFEISMVGELTFFLGFQIKQYKTRIFFSQEKYAKNLISKFGMDKAKPKLTKDATREKVDSSLYRSIIGSLL
YLTASRPDIAFTVGVCARYQADRRTSHLHSAKRILKYIIGTCNFGLWYTFDTTEVLVGYCDADWAVCSDDRKSTSRGCFFLKNNIAAWFSKKQNSVSLSTAEAKYIAVGS
SCSQLLWMKQMMEEYGILQYSMVLYCDNISVINISKNPVQHSRTKRIDIRHHFIRELVEANIISLEHVRSSLQLADIFTKPLDVSTFEGLRAGIRVCQQLA