; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0226211 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0226211
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr08:17890317..17891072
RNA-Seq ExpressionCmc08g0226211
SyntenyCmc08g0226211
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035790.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.5e-12397.84Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
        IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA

Query:  VGICARFQSDPRTSHLTVVKRILKYVHGTSD
        VGICARFQSDPRTSHLTVVKRILKY     D
Subjt:  VGICARFQSDPRTSHLTVVKRILKYVHGTSD

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.9e-10877.87Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        M VKSAFLNGYLNEEV+V QPKGFVD EFP +VYKLNKALY LKQAP+AWYERLT+YLG++ YSR  T KTLFINRTS+ LI+AQIYVDDIIFG FPK L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + NFINIMKSEFEMS+VGELSCFLGLQIKQRS  +FISQEKY KNLVKKFGLDQ QHKRT   TH K+TKD+ GT VDHKLYRS++GSLLYL  S+ DIA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        Y VGICAR+QSD RTSHL  +KRI+KYVHGT+DFGILYS++T+S LVGY +AD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

KAA0054623.1 putative mitochondrial protein [Cucumis melo var. makuwa]6.9e-10978.26Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEV+VAQPK F+DSEFPH +YKLNKALYGLKQAP  WYE+LT+YLG++GYSR  T KTL INRTS++LI+AQIYVDDIIFG FPK L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + NFINIM SEFEMS+VGELSCFLGLQIKQRS  +F+SQEKY KNLVKKFGLDQ Q+KRT  ATH K+TKD  GT VDHKLYRS++GS LYL AS+ DIA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        YAV ICAR+QSDP TSHL  VKRI+KYVHGT+DFGILYS++T S LVGYCDAD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

TYJ98295.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.7e-11279.45Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEV+VAQPKGF+DSEFP +VYK+NKALYGLKQAPRAWYERL IYL ++GYS+  T KTLFINRTS++LI+AQIYVDDIIFG FPK L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + NFINI+KSEFE+S+VG+LS FLGLQIKQRS  +FISQEKY KNLVKKFGLDQ Q+KRT  ATHVK+TKD  GT +DHKLYRS++GSLLYL AS+ DIA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        YAVGICAR+QSDPRTSHL  VKRI+KYVHGT+DFGILYS++T+S LVGYCDAD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

TYK29824.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]5.6e-13597.61Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEVFVAQPKGFVDSEFP HVYKLNKALYGLKQAPRAWY+RLTIYLGDKGYSR+GTSKTLFINRTSSELIIAQIYVDDIIFG FPKA 
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
        IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA

Query:  VGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        VGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSF+TTSTLVGYCDAD
Subjt:  VGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

TrEMBL top hitse value%identityAlignment
A0A5A7SXM5 Putative gag-pol polyprotein3.1e-12397.84Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
        IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA

Query:  VGICARFQSDPRTSHLTVVKRILKYVHGTSD
        VGICARFQSDPRTSHLTVVKRILKY     D
Subjt:  VGICARFQSDPRTSHLTVVKRILKYVHGTSD

A0A5A7UF87 Putative mitochondrial protein3.4e-10978.26Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEV+VAQPK F+DSEFPH +YKLNKALYGLKQAP  WYE+LT+YLG++GYSR  T KTL INRTS++LI+AQIYVDDIIFG FPK L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + NFINIM SEFEMS+VGELSCFLGLQIKQRS  +F+SQEKY KNLVKKFGLDQ Q+KRT  ATH K+TKD  GT VDHKLYRS++GS LYL AS+ DIA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        YAV ICAR+QSDP TSHL  VKRI+KYVHGT+DFGILYS++T S LVGYCDAD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

A0A5D3BIP9 Gag-pol polyprotein3.2e-11279.45Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEV+VAQPKGF+DSEFP +VYK+NKALYGLKQAPRAWYERL IYL ++GYS+  T KTLFINRTS++LI+AQIYVDDIIFG FPK L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + NFINI+KSEFE+S+VG+LS FLGLQIKQRS  +FISQEKY KNLVKKFGLDQ Q+KRT  ATHVK+TKD  GT +DHKLYRS++GSLLYL AS+ DIA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        YAVGICAR+QSDPRTSHL  VKRI+KYVHGT+DFGILYS++T+S LVGYCDAD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

A0A5D3CS19 Gag-pol polyprotein2.8e-10877.87Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        M VKSAFLNGYLNEEV+V QPKGFVD EFP +VYKLNKALY LKQAP+AWYERLT+YLG++ YSR  T KTLFINRTS+ LI+AQIYVDDIIFG FPK L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + NFINIMKSEFEMS+VGELSCFLGLQIKQRS  +FISQEKY KNLVKKFGLDQ QHKRT   TH K+TKD+ GT VDHKLYRS++GSLLYL  S+ DIA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        Y VGICAR+QSD RTSHL  +KRI+KYVHGT+DFGILYS++T+S LVGY +AD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

A0A5D3E1T0 Putative gag-pol polyprotein2.7e-13597.61Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDVKSAFLNGYLNEEVFVAQPKGFVDSEFP HVYKLNKALYGLKQAPRAWY+RLTIYLGDKGYSR+GTSKTLFINRTSSELIIAQIYVDDIIFG FPKA 
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
        IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYA

Query:  VGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        VGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSF+TTSTLVGYCDAD
Subjt:  VGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-3131.78Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFI--NRTSSELIIAQIYVDDIIFGVFPK
        MDVK+AFLNG L EE+++  P+G   S    +V KLNKA+YGLKQA R W+E     L +  +  +   + ++I      +E I   +YVDD++      
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFI--NRTSSELIIAQIYVDDIIFGVFPK

Query:  ALIGNFINIMKSEFEMSMVGELSCFLGLQIKQR--SIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLM-ASKL
          + NF   +  +F M+ + E+  F+G++I+ +   I++SQ  YVK ++ KF ++ C    T + + +   + +N     +   RSL+G L+Y+M  ++ 
Subjt:  ALIGNFINIMKSEFEMSMVGELSCFLGLQIKQR--SIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLM-ASKL

Query:  DIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTT--STLVGYCDAD
        D+  AV I +R+ S   +     +KR+L+Y+ GT D  +++  N    + ++GY D+D
Subjt:  DIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTT--STLVGYCDAD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-3936.12Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTS-SELIIAQIYVDDIIFGVFPKA
        +DVK+AFL+G L EE+++ QP+GF  +   H V KLNK+LYGLKQAPR WY +   ++  + Y +  +   ++  R S +  II  +YVDD++     K 
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTS-SELIIAQIYVDDIIFGVFPKA

Query:  LIGNFINIMKSEFEMSMVGELSCFLGLQI----KQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHK------LYRSLVGSLLY
        LI      +   F+M  +G     LG++I      R +++SQEKY++ ++++F +   +   T +A H+K++K +  T V+ K       Y S VGSL+Y
Subjt:  LIGNFINIMKSEFEMSMVGELSCFLGLQI----KQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHK------LYRSLVGSLLY

Query:  LM-ASKLDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
         M  ++ DIA+AVG+ +RF  +P   H   VK IL+Y+ GT+   + +   +   L GY DAD
Subjt:  LM-ASKLDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

P25600 Putative transposon Ty5-1 protein YCL074W3.3e-2930.31Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        MDV +AFLN  ++E ++V QP GFV+   P +V++L   +YGLKQAP  W E +   L   G+ R+     L+   TS   I   +YVDD++       +
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS---IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLM-ASKLD
               +   + M  +G++  FLGL I Q S   I +S + Y+     +  ++  +  +T +     + +  +  + D   Y+S+VG LL+     + D
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQRS---IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLM-ASKLD

Query:  IAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDA
        I+Y V + +RF  +PR  HL   +R+L+Y++ T    + Y   +   L  YCDA
Subjt:  IAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-4035.97Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        +DV +AFL G L ++V+++QP GF+D + P++V KL KALYGLKQAPRAWY  L  YL   G+  + +  +LF+ +    ++   +YVDDI+       L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQ--RSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + N ++ +   F +    EL  FLG++ K+    + +SQ +Y+ +L+ +  +   +   T +A   K++      + D   YR +VGSL YL  ++ DI+
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQ--RSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        YAV   ++F   P   HL  +KRIL+Y+ GT + GI      T +L  Y DAD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-3935.18Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL
        +DV +AFL G L +EV+++QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+  + +  +LF+ +    +I   +YVDDI+       L
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKAL

Query:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQ--RSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA
        + + ++ +   F +    +L  FLG++ K+  + + +SQ +Y  +L+ +  +   +   T +AT  K+T      + D   YR +VGSL YL  ++ D++
Subjt:  IGNFINIMKSEFEMSMVGELSCFLGLQIKQ--RSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIA

Query:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        YAV   +++   P   H   +KR+L+Y+ GT D GI      T +L  Y DAD
Subjt:  YAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-3634.77Show/hide
Query:  MDVKSAFLNGYLNEEVFVAQPKGFV----DSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVF
        +D+ +AFLNG L+EE+++  P G+     DS  P+ V  L K++YGLKQA R W+ + ++ L   G+ ++ +  T F+  T++  +   +YVDDII    
Subjt:  MDVKSAFLNGYLNEEVFVAQPKGFV----DSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVF

Query:  PKALIGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASK
          A +    + +KS F++  +G L  FLGL+I + +  I I Q KY  +L+ + GL  C+     +   V  +    G  VD K YR L+G L+YL  ++
Subjt:  PKALIGNFINIMKSEFEMSMVGELSCFLGLQIKQRS--IFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASK

Query:  LDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDA
        LDI++AV   ++F   PR +H   V +IL Y+ GT   G+ YS      L  + DA
Subjt:  LDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDA

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.0e-0432.81Show/hide
Query:  LYLMASKLDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        +YL  ++ D+ +AV   ++F S  RT+ +  V ++L YV GT   G+ YS  +   L  + D+D
Subjt:  LYLMASKLDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-1831.95Show/hide
Query:  IYVDDIIFGVFPKALIGNFINIMKSEFEMSMVGELSCFLGLQIKQR--SIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDIN-GTVVDHKLYRS
        +YVDDI+       L+   I  + S F M  +G +  FLG+QIK     +F+SQ KY + ++   G+  C  K  S    +K+   ++     D   +RS
Subjt:  IYVDDIIFGVFPKALIGNFINIMKSEFEMSMVGELSCFLGLQIKQR--SIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDIN-GTVVDHKLYRS

Query:  LVGSLLYLMASKLDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD
        +VG+L YL  ++ DI+YAV I  +   +P  +   ++KR+L+YV GT   G+    N+   +  +CD+D
Subjt:  LVGSLLYLMASKLDIAYAVGICARFQSDPRTSHLTVVKRILKYVHGTSDFGILYSFNTTSTLVGYCDAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCAAAAGTGCCTTTTTAAATGGTTACTTGAATGAGGAAGTCTTTGTAGCTCAACCAAAAGGGTTTGTTGATTCTGAATTTCCTCATCATGTTTACAAGCTCAA
TAAAGCTTTGTATGGGTTAAAGCAAGCTCCTCGAGCTTGGTATGAACGCTTAACAATCTATCTGGGTGATAAAGGATACTCTAGAAATGGAACTAGTAAGACATTATTTA
TTAATAGAACCAGCAGTGAGCTCATTATAGCACAGATTTATGTTGATGATATTATATTTGGGGTATTTCCCAAGGCACTTATTGGTAACTTCATTAACATAATGAAATCA
GAATTTGAAATGAGCATGGTAGGAGAACTTTCTTGTTTTCTAGGTCTACAGATCAAACAAAGAAGTATATTTATATCTCAAGAGAAGTATGTCAAGAACTTAGTCAAAAA
ATTTGGTCTGGATCAGTGTCAACATAAAAGGACTTCAGTGGCGACACATGTTAAAGTTACTAAAGACATTAATGGTACAGTAGTAGATCACAAACTGTATAGAAGCTTGG
TTGGGAGTCTTCTATATTTAATGGCAAGCAAACTTGACATTGCCTATGCTGTTGGAATATGTGCTCGATTTCAGTCAGATCCTCGTACTTCTCATTTGACCGTTGTTAAA
CGGATTCTCAAATATGTACATGGGACGAGTGACTTTGGAATTTTGTATTCATTTAACACGACTTCTACCTTGGTTGGATATTGTGATGCTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCAAAAGTGCCTTTTTAAATGGTTACTTGAATGAGGAAGTCTTTGTAGCTCAACCAAAAGGGTTTGTTGATTCTGAATTTCCTCATCATGTTTACAAGCTCAA
TAAAGCTTTGTATGGGTTAAAGCAAGCTCCTCGAGCTTGGTATGAACGCTTAACAATCTATCTGGGTGATAAAGGATACTCTAGAAATGGAACTAGTAAGACATTATTTA
TTAATAGAACCAGCAGTGAGCTCATTATAGCACAGATTTATGTTGATGATATTATATTTGGGGTATTTCCCAAGGCACTTATTGGTAACTTCATTAACATAATGAAATCA
GAATTTGAAATGAGCATGGTAGGAGAACTTTCTTGTTTTCTAGGTCTACAGATCAAACAAAGAAGTATATTTATATCTCAAGAGAAGTATGTCAAGAACTTAGTCAAAAA
ATTTGGTCTGGATCAGTGTCAACATAAAAGGACTTCAGTGGCGACACATGTTAAAGTTACTAAAGACATTAATGGTACAGTAGTAGATCACAAACTGTATAGAAGCTTGG
TTGGGAGTCTTCTATATTTAATGGCAAGCAAACTTGACATTGCCTATGCTGTTGGAATATGTGCTCGATTTCAGTCAGATCCTCGTACTTCTCATTTGACCGTTGTTAAA
CGGATTCTCAAATATGTACATGGGACGAGTGACTTTGGAATTTTGTATTCATTTAACACGACTTCTACCTTGGTTGGATATTGTGATGCTGATTAG
Protein sequenceShow/hide protein sequence
MDVKSAFLNGYLNEEVFVAQPKGFVDSEFPHHVYKLNKALYGLKQAPRAWYERLTIYLGDKGYSRNGTSKTLFINRTSSELIIAQIYVDDIIFGVFPKALIGNFINIMKS
EFEMSMVGELSCFLGLQIKQRSIFISQEKYVKNLVKKFGLDQCQHKRTSVATHVKVTKDINGTVVDHKLYRSLVGSLLYLMASKLDIAYAVGICARFQSDPRTSHLTVVK
RILKYVHGTSDFGILYSFNTTSTLVGYCDAD