; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0108721 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0108721
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPolyprotein, putative
Genome locationCMiso1.1chr04:28239072..28240055
RNA-Seq ExpressionCmc04g0108721
SyntenyCmc04g0108721
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABI34306.1 Polyprotein, putative [Solanum demissum]9.1e-10155.56Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG
        +I+ +N++   +GWW D+GA+ HVC++   F+KY   ++ K I+LGD HTT+V G G+VEL FTSG+ L LK+ L+TP +RK L+  +LLNKAGF Q I 
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG

Query:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPL
        SN + + K  +FVGKGYA DGMFKLN+E+NK ++S YML+S N WHARLCH+N R +  MS L LIP +   +FEKC  CS+AKITK  H  V R T+ L
Subjt:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPL

Query:  ELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEM
        EL+H+D+CE  G LTR   RY + FIDD S +T++YL+KNKSDA+E FK ++ E+ENQF ++IKR+ SDRG EY+S  FN F  S GIIHE T PYSP  
Subjt:  ELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEM

Query:  NGKEERKNRTLTELVVVILLESEA
        NG  ERKNRTL EL   +L+ES A
Subjt:  NGKEERKNRTLTELVVVILLESEA

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]3.0e-18196.64Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
        MISKVNVIGGSEGWWLDTGAS HVCHELSLFRKYNEVKDKNILLGDHHTTKV GIGEVELKFTS KTLV+KEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS

Query:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE
        NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVT+PLE
Subjt:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE

Query:  LIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMN
        LIHSDLCEFDGTLTRNSKRYVV FIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRL SDRGTEYDSVAFNEFYNSKGIIHE T PYSPEMN
Subjt:  LIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMN

Query:  GKEERKNRTLTELVVVILLESEAAPSW
        GKEERKNRTLTEL V ILLESEAAPSW
Subjt:  GKEERKNRTLTELVVVILLESEAAPSW

KAA0046026.1 putative Polyprotein [Cucumis melo var. makuwa]2.3e-11293.15Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
        MI +VNVIGGSEGWWLDTGAS HVCH+LSLFRKYNEVKDKNILLGDHHTTKV GI EV+LKFTSGKTLVLKE LHTPEIRKNLV  YLLNKAGFTQTIGS
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS

Query:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE
        NLFTLTKNNV+VGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKIT+TSHK VT VT PLE
Subjt:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE

Query:  LIHSDLCEFDGTLTRNSKR
        LIHSDLCEFDGTLTRNSKR
Subjt:  LIHSDLCEFDGTLTRNSKR

KAA0055815.1 putative Polyprotein [Cucumis melo var. makuwa]3.6e-10589.5Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
        MI +VNVIG SEGWWLDTGAS HV H+LSLFRKYNEVKDKNILLGDHH TKV GIGEVELKFTSGKTLVLKE LHT E RKNLV GYLLNK G TQTIG 
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS

Query:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE
        +LFTLTKNNVFVGKGYATD MFKLNL+INKIASSAYMLT FN WHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCA CSQAKITKT HK VTRVTEPLE
Subjt:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE

Query:  LIHSDLCEFDGTLTRNSKR
        LIHSDLCEFDGTLTRNSKR
Subjt:  LIHSDLCEFDGTLTRNSKR

XP_021732277.1 uncharacterized protein LOC110699091 [Chenopodium quinoa]2.4e-10158.36Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNE-VKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG
        MI+++N+ GGS+GWW+DTGA+ HVC++  +F+ Y E   DK +LLGD H+T +AG+G VELKFTSG+TL+LK+ LHTPE+RKNLV G+LLNKAGF QTIG
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNE-VKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG

Query:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTS-FNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEP
        S+LFTLTKN +FVGKGYATDGMFKLN+E+NKI++SAYML S  NVWH RLCHVNKRLI NMS L LIP +SL+DF+KC  CSQAKITKT HK V R +EP
Subjt:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTS-FNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEP

Query:  LELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPE
        L+LIHSD+CE +GTLTRN +RY + FIDDCSDYT IYL+KNKSDA+EM                                                    
Subjt:  LELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPE

Query:  MNGKEERKNRTLTELVVVILLESEAAPSW
             ERKNRT TELVV I L S AA  W
Subjt:  MNGKEERKNRTLTELVVVILLESEAAPSW

TrEMBL top hitse value%identityAlignment
A0A5A7TV55 Putative Polyprotein1.1e-11293.15Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
        MI +VNVIGGSEGWWLDTGAS HVCH+LSLFRKYNEVKDKNILLGDHHTTKV GI EV+LKFTSGKTLVLKE LHTPEIRKNLV  YLLNKAGFTQTIGS
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS

Query:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE
        NLFTLTKNNV+VGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKIT+TSHK VT VT PLE
Subjt:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE

Query:  LIHSDLCEFDGTLTRNSKR
        LIHSDLCEFDGTLTRNSKR
Subjt:  LIHSDLCEFDGTLTRNSKR

A0A5A7UQC7 Putative Polyprotein1.7e-10589.5Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
        MI +VNVIG SEGWWLDTGAS HV H+LSLFRKYNEVKDKNILLGDHH TKV GIGEVELKFTSGKTLVLKE LHT E RKNLV GYLLNK G TQTIG 
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS

Query:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE
        +LFTLTKNNVFVGKGYATD MFKLNL+INKIASSAYMLT FN WHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCA CSQAKITKT HK VTRVTEPLE
Subjt:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE

Query:  LIHSDLCEFDGTLTRNSKR
        LIHSDLCEFDGTLTRNSKR
Subjt:  LIHSDLCEFDGTLTRNSKR

A0A5D3DCJ1 Putative Polyprotein1.5e-18196.64Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
        MISKVNVIGGSEGWWLDTGAS HVCHELSLFRKYNEVKDKNILLGDHHTTKV GIGEVELKFTS KTLV+KEGLHTPEIRKNLVFGYLLNKAGFTQTIGS
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGS

Query:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE
        NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVT+PLE
Subjt:  NLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLE

Query:  LIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMN
        LIHSDLCEFDGTLTRNSKRYVV FIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRL SDRGTEYDSVAFNEFYNSKGIIHE T PYSPEMN
Subjt:  LIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMN

Query:  GKEERKNRTLTELVVVILLESEAAPSW
        GKEERKNRTLTEL V ILLESEAAPSW
Subjt:  GKEERKNRTLTELVVVILLESEAAPSW

A0A7N2L531 Uncharacterized protein3.5e-10660.92Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEV-KDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG
        MI+ +N++   EGWW D+GA+ HVC++ + F+ Y    ++K ++LGD   TKV G GEVELKFTSG+ L LK+ L+TP +RKNL+  +LLNKAGF QT+ 
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEV-KDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG

Query:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEP
        S+ + +TK  +FVGKGYA DGMFKLN+E NK + SS YML+S N WHARLCH+N R +  MS L LIP+LS  DFEKC  CSQAKITK  HK V R TE 
Subjt:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEP

Query:  LELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPE
        LELIHSDLCEF+G LTR   RY++ FIDD S YT IYLLKNKSDA+E F+ F+ E+ENQF ++IKR+ SDRG EY+S AFN F  S GIIHE TAPYSP 
Subjt:  LELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPE

Query:  MNGKEERKNRTLTELVVVILLESEA
         NG  ERKNRTL EL   +L+ES A
Subjt:  MNGKEERKNRTLTELVVVILLESEA

A0A7N2R9F3 Uncharacterized protein3.8e-10560.62Show/hide
Query:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEV-KDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG
        +I+ +N++   EGWW D GA+ HVC++ + F+ Y    ++K I+LGD   TKV G GEVELKFTSG+ L LK+  +TP +RKNL+  +LLNKAGF QT+ 
Subjt:  MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEV-KDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIG

Query:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEP
        S+ + +TK  +FVGKGYA DGMFKLN+E NK + SS YML+S N WHARLCH+N R +  MS L LIP+LS  DFEKC  CSQAKITK  HK V R TE 
Subjt:  SNLFTLTKNNVFVGKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEP

Query:  LELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPE
        LELIHSDLCEF+G LTR   RY++ FIDD S YT IYLLKNKSDA+E F+ F+ E+ENQF ++IKR+ SDRG EY+S AFN F  S GIIHE TAPYSP 
Subjt:  LELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPE

Query:  MNGKEERKNRTLTELVVVILLESEA
         NG  ERKNRTL EL   +L+ES A
Subjt:  MNGKEERKNRTLTELVVVILLESEA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-2931.19Show/hide
Query:  GWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFTLTKNNVFV
        G+ LD+GAS H+ ++ SL+    EV     +        +       ++  +   + L++ L   E   NL+    L +AG +     +  T++KN + V
Subjt:  GWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFTLTKNNVFV

Query:  GKGYATDGMFKLNLEINKIASS--AYMLTSFNVWHARLCHVN---------KRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLEL
         K     GM      IN  A S  A    +F +WH R  H++         K + S+ S LN + +LS    E C    QA++     K  T +  PL +
Subjt:  GKGYATDGMFKLNLEINKIASS--AYMLTSFNVWHARLCHVN---------KRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLEL

Query:  IHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMNG
        +HSD+C     +T + K Y VIF+D  + Y   YL+K KSD + MF+ FV + E  FN ++  L  D G EY S    +F   KGI +  T P++P++NG
Subjt:  IHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMNG

Query:  KEERKNRTLTE
          ER  RT+TE
Subjt:  KEERKNRTLTE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-4232.05Show/hide
Query:  VNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFT
        +++ G    W +DT AS H      LF +Y       + +G+   +K+AGIG++ +K   G TLVLK+  H P++R NL+ G  L++ G+     +  + 
Subjt:  VNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFT

Query:  LTKNNVFVGKGYATDGMFKLNLEI-NKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVT-RVTEPLELI
        LTK ++ + KG A   +++ N EI     ++A    S ++WH R+ H++++ +  +++ +LI        + C  C   K  + S +  + R    L+L+
Subjt:  LTKNNVFVGKGYATDGMFKLNLEI-NKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVT-RVTEPLELI

Query:  HSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMNGK
        +SD+C      +    +Y V FIDD S   ++Y+LK K   +++F+ F   +E +  +++KRL SD G EY S  F E+ +S GI HE T P +P+ NG 
Subjt:  HSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMNGK

Query:  EERKNRTLTELV
         ER NRT+ E V
Subjt:  EERKNRTLTELV

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.2e-1525.55Show/hide
Query:  LDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLV-FGYLLNK---AGFTQTI-----GSNLFTLT
        +D+GAS  +              + NI+        +  IG +   F +G    +K  LHTP I  +L+    L N+   A FT+       G+ L  + 
Subjt:  LDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLV-FGYLLNK---AGFTQTI-----GSNLFTLT

Query:  KNNVF--VGKGYATDGMFKLNLEINKI-ASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFE-------KCACCSQAKITKTSHKFVTRVT
        K+  F  + K Y         L IN +  S +     + + H  L H N R I    + N +  L   D E       +C  C   K TK  H   +R+ 
Subjt:  KNNVF--VGKGYATDGMFKLNLEINKI-ASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFE-------KCACCSQAKITKTSHKFVTRVT

Query:  -----EPLELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLL--KNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIH
             EP + +H+D+      L +++  Y + F D+ + + ++Y L  + +     +F   +  I+NQFN R+  +  DRG+EY +   ++F+ ++GI  
Subjt:  -----EPLELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLL--KNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIH

Query:  ENTAPYSPEMNGKEERKNRTL
          T       +G  ER NRTL
Subjt:  ENTAPYSPEMNGKEERKNRTL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.4e-2932.3Show/hide
Query:  SEGWWLDTGASSHVC---HELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGY-LLNKAGFTQTIGSNLFTLT
        S  W LD+GA+ H+    + LSL + Y    D  +++ D  T  ++  G   L  T  + L L   L+ P I KNL+  Y L N  G +       F + 
Subjt:  SEGWWLDTGASSHVC---HELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGY-LLNKAGFTQTIGSNLFTLT

Query:  KNNVFVG--KGYATDGMFKLNLEINK---IASSAYMLTSFNVWHARLCH----VNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVT-RVT
          N  V   +G   D +++  +  ++   + +S     + + WHARL H    +   +ISN S   L P    H F  C+ C   K  K      T   T
Subjt:  KNNVFVG--KGYATDGMFKLNLEINK---IASSAYMLTSFNVWHARLCH----VNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVT-RVT

Query:  EPLELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYS
         PLE I+SD+      L+ ++ RY VIF+D  + YT++Y LK KS   E F  F   +EN+F  RI    SD G E+  VA  E+++  GI H  + P++
Subjt:  EPLELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYS

Query:  PEMNGKEERKNRTLTELVVVIL
        PE NG  ERK+R + E  + +L
Subjt:  PEMNGKEERKNRTLTELVVVIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-2729.45Show/hide
Query:  VNVIGGSEGWWLDTGASSHVC---HELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLL---NKAGFTQTI
        VN    +  W LD+GA+ H+    + LS  + Y    D  +++ D  T  +   G   L  TS ++L L + L+ P I KNL+  Y L   N+       
Subjt:  VNVIGGSEGWWLDTGASSHVC---HELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLL---NKAGFTQTI

Query:  GSNLFTLTKNNVFVGKGYATDGMFKLNLEINKIAS---SAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLS-LHDFEKCACCSQAKITKT--SHKFV
         S         V + +G   D +++  +  ++  S   S     + + WH+RL H +  +++++   + +P L+  H    C+ C   K  K   S+  +
Subjt:  GSNLFTLTKNNVFVGKGYATDGMFKLNLEINKIAS---SAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLS-LHDFEKCACCSQAKITKT--SHKFV

Query:  TRVTEPLELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENT
        T  ++PLE I+SD+      L+ ++ RY VIF+D  + YT++Y LK KS   + F +F + +EN+F  RI  L SD G E+  V   ++ +  GI H  +
Subjt:  TRVTEPLELIHSDLCEFDGTLTRNSKRYVVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENT

Query:  APYSPEMNGKEERKNRTLTELVVVIL
         P++PE NG  ERK+R + E+ + +L
Subjt:  APYSPEMNGKEERKNRTLTELVVVIL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACCGGTGCATCCAGCCATGTCTGCCACGAACTTAGTCTTTTTAGAAAATATAATGAAGT
TAAGGATAAAAATATCCTTCTAGGAGATCATCACACAACCAAGGTGGCCGGTATTGGAGAAGTAGAACTGAAATTCACATCCGGCAAGACGCTTGTGCTGAAGGAAGGTC
TGCATACTCCAGAAATTCGAAAGAATTTGGTCTTCGGATATCTCCTCAACAAAGCTGGATTCACACAAACCATAGGATCAAACTTGTTTACTTTAACTAAAAACAATGTA
TTTGTAGGGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGCTAG
ACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGA
TAACTAAAACCTCACATAAGTTTGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCTGACTTATGTGAATTTGATGGCACTTTAACTAGAAACAGTAAAAGGTAT
GTAGTTATCTTTATAGATGACTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCTTATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCA
GTTTAACAAAAGAATTAAGAGACTTTGTAGTGATAGAGGAACTGAATATGATTCAGTTGCTTTCAATGAATTTTATAACTCAAAAGGAATAATACATGAAAATACTGCGC
CTTATTCTCCTGAAATGAATGGGAAAGAAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGTTATCTTACTTGAGTCAGAAGCAGCACCATCTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACCGGTGCATCCAGCCATGTCTGCCACGAACTTAGTCTTTTTAGAAAATATAATGAAGT
TAAGGATAAAAATATCCTTCTAGGAGATCATCACACAACCAAGGTGGCCGGTATTGGAGAAGTAGAACTGAAATTCACATCCGGCAAGACGCTTGTGCTGAAGGAAGGTC
TGCATACTCCAGAAATTCGAAAGAATTTGGTCTTCGGATATCTCCTCAACAAAGCTGGATTCACACAAACCATAGGATCAAACTTGTTTACTTTAACTAAAAACAATGTA
TTTGTAGGGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGCTAG
ACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGA
TAACTAAAACCTCACATAAGTTTGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCTGACTTATGTGAATTTGATGGCACTTTAACTAGAAACAGTAAAAGGTAT
GTAGTTATCTTTATAGATGACTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCTTATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAACCA
GTTTAACAAAAGAATTAAGAGACTTTGTAGTGATAGAGGAACTGAATATGATTCAGTTGCTTTCAATGAATTTTATAACTCAAAAGGAATAATACATGAAAATACTGCGC
CTTATTCTCCTGAAATGAATGGGAAAGAAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGTTATCTTACTTGAGTCAGAAGCAGCACCATCTTGGTGA
Protein sequenceShow/hide protein sequence
MISKVNVIGGSEGWWLDTGASSHVCHELSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKTLVLKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFTLTKNNV
FVGKGYATDGMFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKFVTRVTEPLELIHSDLCEFDGTLTRNSKRY
VVIFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIENQFNKRIKRLCSDRGTEYDSVAFNEFYNSKGIIHENTAPYSPEMNGKEERKNRTLTELVVVILLESEAAPSW