; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0022620 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0022620
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr01:21089461..21090848
RNA-Seq ExpressionIVF0022620
SyntenyIVF0022620
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]2.40e-158100Show/hide
Query:  MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQ
        MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQ
Subjt:  MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQ

Query:  AKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLP
        AKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLP
Subjt:  AKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLP

Query:  REYNAFRTSMQTRSQPVSFSELHILLKSEESALEKQTKR
        REYNAFRTSMQTRSQPVSFSELHILLKSEESALEKQTKR
Subjt:  REYNAFRTSMQTRSQPVSFSELHILLKSEESALEKQTKR

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]3.62e-11352.34Show/hide
Query:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM
        SSS  K+  SP+FLL+NICNLIS+RLDSTN+ LWKFQ   +LKAHKLYGFID + P PP+T  N ++++S+   Q         SNP YEDW AKDQA M
Subjt:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM

Query:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR
         +INATLS EAL YVVG  SS QVW  L + YSS +R+N+VNLKSDLQ I KKPDE I  YIK+IKE+KDKLAN +T + +EDL+IYALNGLP EYN FR
Subjt:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR

Query:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRLF---IQC
        TSM+TRSQPV+F ELH+LL++EESAL KQ+K                              RG G G++ G GR +F+ Q RG G S     +      C
Subjt:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRLF---IQC

Query:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG
        Q+C+R GH+  DC+NRMNYN+QG+HPPQ          N +L+  N  S   L DSGCN  +T+D    S+A E+NGE+ + +G
Subjt:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.16e-11352.34Show/hide
Query:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM
        SSS  K+  SP+FLL+NICNLIS+RLDSTN+ LWKFQ   +LKAHKLYGFID + P PP+T  N ++++S+   Q         SNP YEDW AKDQA M
Subjt:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM

Query:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR
         +INATLS EAL YVVG  SS QVW  L + YSS +R+N+VNLKSDLQ I KKPDE I  YIK+IKE+KDKLAN +T + +EDL+IYALNGLP EYN FR
Subjt:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR

Query:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRLF---IQC
        TSM+TRSQPV+F ELH+LL++EESAL KQ+K                              RG G G++ G GR +F+ Q RG G S     +      C
Subjt:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRLF---IQC

Query:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG
        Q+C+R GH+  DC+NRMNYN+QG+HPPQ          N +L+  N  S   L DSGCN  +T+D    S+A E+NGE+ + +G
Subjt:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]5.15e-11352.34Show/hide
Query:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM
        SSS  K+  SP+FLL+NICNLIS+RLDSTN+ LWKFQ   +LKAHKLYGFID + P PP+T  N ++++S+   Q         SNP YEDW AKDQA M
Subjt:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM

Query:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR
         +INATLS EAL YVVG  SS QVW  L + YSS +R+N+VNLKSDLQ I KKPDE I  YIK+IKE+KDKLAN +T + +EDL+IYALNGLP EYN FR
Subjt:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR

Query:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRLF---IQC
        TSM+TRSQPV+F ELH+LL++EESAL KQ+K                              RG G G++ G GR +F+ Q RG G S     +      C
Subjt:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRLF---IQC

Query:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG
        Q+C+R GH+  DC+NRMNYN+QG+HPPQ          N +L+  N  S   L DSGCN  +T+D    S+A E+NGE+ + +G
Subjt:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]3.36e-11651.27Show/hide
Query:  SSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQA
        SSS++T K+L SP+FLL+NICNL+SIRLDST++ LWKFQ   +LKAHKL+GFID S+  P + +++    SS   +Q T++++  + NP +EDW AKDQA
Subjt:  SSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQA

Query:  FMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNA
         M LINATLS EAL YVV   +S QVW+ LE+HYSSN+RTN+VNLKSDLQ I KK +E I  Y+K+IKE+KDK AN +  + DE L+IYALNGL  EYN 
Subjt:  FMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNA

Query:  FRTSMQTRSQPVSFSELHILLKSEESALEKQTKRRG-----------------------------RGRGRNTGRGRSNF-----NQGRGRGQSDFGNRLF
          TSM+TR+Q VSF ELH+ +KSEESA+EKQ KR                               RGRG+N GRG++NF     NQGRGR   +F     
Subjt:  FRTSMQTRSQPVSFSELHILLKSEESALEKQTKRRG-----------------------------RGRGRNTGRGRSNF-----NQGRGRGQSDFGNRLF

Query:  IQ----CQVCNRPGHS--DCYNRMNYNYQGKHPP----------QNLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIAN---EHNGEDSISVG
              CQ+C + GH+  DCYNRMN+++QG+HPP           N YL   N    TWLADS CN H+TAD  N SIA+   ++NGE++ISVG
Subjt:  IQ----CQVCNRPGHS--DCYNRMNYNYQGKHPP----------QNLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIAN---EHNGEDSISVG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X24.5e-9152.34Show/hide
Query:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM
        SSS  K+  SP+FLL+NICNLIS+RLDSTN+ LWKFQ   +LKAHKLYGFID + P PP+T       ++S++T T        SNP YEDW AKDQA M
Subjt:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM

Query:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR
         +INATLS EAL YVVG  SS QVW  L + YSS +R+N+VNLKSDLQ I KKPDE I  YIK+IKE+KDKLAN +T + +EDL+IYALNGLP EYN FR
Subjt:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR

Query:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRL---FIQC
        TSM+TRSQPV+F ELH+LL++EESAL KQ+K                              RG G G++ G GR +F+ Q RG G S     +      C
Subjt:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRL---FIQC

Query:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG
        Q+C+R GH+  DC+NRMNYN+QG+HPPQ          N +L+  N  S   L DSGCN  +T+D    S+A E+NGE+ + +G
Subjt:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X14.5e-9152.34Show/hide
Query:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM
        SSS  K+  SP+FLL+NICNLIS+RLDSTN+ LWKFQ   +LKAHKLYGFID + P PP+T       ++S++T T        SNP YEDW AKDQA M
Subjt:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM

Query:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR
         +INATLS EAL YVVG  SS QVW  L + YSS +R+N+VNLKSDLQ I KKPDE I  YIK+IKE+KDKLAN +T + +EDL+IYALNGLP EYN FR
Subjt:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR

Query:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRL---FIQC
        TSM+TRSQPV+F ELH+LL++EESAL KQ+K                              RG G G++ G GR +F+ Q RG G S     +      C
Subjt:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRL---FIQC

Query:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG
        Q+C+R GH+  DC+NRMNYN+QG+HPPQ          N +L+  N  S   L DSGCN  +T+D    S+A E+NGE+ + +G
Subjt:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG

A0A5D3CLI6 T4.54.5e-9152.34Show/hide
Query:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM
        SSS  K+  SP+FLL+NICNLIS+RLDSTN+ LWKFQ   +LKAHKLYGFID + P PP+T       ++S++T T        SNP YEDW AKDQA M
Subjt:  SSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFM

Query:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR
         +INATLS EAL YVVG  SS QVW  L + YSS +R+N+VNLKSDLQ I KKPDE I  YIK+IKE+KDKLAN +T + +EDL+IYALNGLP EYN FR
Subjt:  FLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFR

Query:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRL---FIQC
        TSM+TRSQPV+F ELH+LL++EESAL KQ+K                              RG G G++ G GR +F+ Q RG G S     +      C
Subjt:  TSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNFN-QGRGRGQSDFGNRL---FIQC

Query:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG
        Q+C+R GH+  DC+NRMNYN+QG+HPPQ          N +L+  N  S   L DSGCN  +T+D    S+A E+NGE+ + +G
Subjt:  QVCNRPGHS--DCYNRMNYNYQGKHPPQ----------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISVG

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein6.9e-124100Show/hide
Query:  MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQ
        MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQ
Subjt:  MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQ

Query:  AKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLP
        AKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLP
Subjt:  AKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLP

Query:  REYNAFRTSMQTRSQPVSFSELHILLKSEESALEKQTKR
        REYNAFRTSMQTRSQPVSFSELHILLKSEESALEKQTKR
Subjt:  REYNAFRTSMQTRSQPVSFSELHILLKSEESALEKQTKR

A0A6J1D9L6 uncharacterized protein LOC1110188927.4e-9451.52Show/hide
Query:  SSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQA
        SSS++T K+L SP+FLL+NICNL+SIRLDST++ LWKFQ   +LKAHKL+GFID S+  P + ++    +SS   +Q T++++  + NP +EDW AKDQA
Subjt:  SSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQA

Query:  FMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNA
         M LINATLS EAL YVV   +S QVW+ LE+HYSSN+RTN+VNLKSDLQ I KK +E I  Y+K+IKE+KDK AN +  + DE L+IYALNGL  EYN 
Subjt:  FMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNA

Query:  FRTSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNF-----NQGRGRGQSDF-----
          TSM+TR+Q VSF ELH+ +KSEESA+EKQ KR                               RGRG+N GRG++NF     NQGRGR   +F     
Subjt:  FRTSMQTRSQPVSFSELHILLKSEESALEKQTKR-----------------------------RGRGRGRNTGRGRSNF-----NQGRGRGQSDF-----

Query:  -GNRLFIQCQVCNRPGHS--DCYNRMNYNYQGKHPP----------QNLYLNTHNYPSNTWLADSGCNAHVTADFGNF---SIANEHNGEDSISVG
          NR    CQ+C + GH+  DCYNRMN+++QG+HPP           N YL   N    TWLADS CN H+TAD  N    SIA+++NGE++ISVG
Subjt:  -GNRLFIQCQVCNRPGHS--DCYNRMNYNYQGKHPP----------QNLYLNTHNYPSNTWLADSGCNAHVTADFGNF---SIANEHNGEDSISVG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.4e-1824.73Show/hide
Query:  LSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLINATL
        L++   L  N+ N+   +L STNY +W  Q   +   ++L GF+D S   PP TI   T A+                NP Y  W+ +D+     +   +
Subjt:  LSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLINATL

Query:  SVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRS
        S+     V    +++Q+W+TL + Y++ +  ++  L++ L+Q + K  + I  Y++ +    D+LA     ++ ++ V   L  LP EY      +  + 
Subjt:  SVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRS

Query:  QPVSFSELHILLKSEESAL--------------------EKQTKRRGRGRGRNTGRGRSNFNQGRGRGQSDFG-------NRLFI-QCQVCNRPGHS--D
         P + +E+H  L + ES +                       T     G   N    R+N N  +   QS          ++ ++ +CQ+C   GHS   
Subjt:  QPVSFSELHILLKSEESAL--------------------EKQTKRRGRGRGRNTGRGRSNFNQGRGRGQSDFG-------NRLFI-QCQVCNRPGHS--D

Query:  CYNRMNY--NYQGKHPPQ---------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISV
        C    ++  +   + PP          NL L +  Y SN WL DSG   H+T+DF N S+   + G D + V
Subjt:  CYNRMNY--NYQGKHPPQ---------NLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-1623.84Show/hide
Query:  TNICNLIS---IRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLINATLSVEAL
        TNI N+      +L STNY +W  Q   +   ++L GF+D S P PP TI                +      NP Y  W+ +D+     I   +S+   
Subjt:  TNICNLIS---IRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLINATLSVEAL

Query:  TYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRSQPVSF
          V    +++Q+W+TL + Y++ +  ++  L+                +I +     D+LA     ++ ++ V   L  LP +Y      +  +  P S 
Subjt:  TYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRSQPVSF

Query:  SELHILLKSEESAL-----------------------EKQTKRRGRGRGRNTGRGRSNFNQGRGRGQSDFGNR---LFIQCQVCNRPGHSDCYNRMNYNY
        +E+H  L + ES L                        +    RG  R  N    RSN  Q    G      +      +CQ+C+  GHS       + +
Subjt:  SELHILLKSEESAL-----------------------EKQTKRRGRGRGRNTGRGRSNFNQGRGRGQSDFGNR---LFIQCQVCNRPGHSDCYNRMNYNY

Query:  QGK-------------HPPQNLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISV
        Q                P  NL +N+  Y +N WL DSG   H+T+DF N S    + G D + +
Subjt:  QGK-------------HPPQNLYLNTHNYPSNTWLADSGCNAHVTADFGNFSIANEHNGEDSISV

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.9e-0421.69Show/hide
Query:  EASSSSSTSKELSSPLFLLTNI-----CNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYED
        E   S S + +  SP +L  +I      ++  +  D  NY  WK +F   L+  K +GFID ++P P                        +  +P Y+ 
Subjt:  EASSSSSTSKELSSPLFLLTNI-----CNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYED

Query:  WQAKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVE
        W+  +   M+ +  +++ + L  V+  +++ ++W+ L R +       I  L+  L  + +  D  +  Y  K+ +V  +L+  A I E
Subjt:  WQAKDQAFMFLINATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVE

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-0422.96Show/hide
Query:  LFLLTNICNLISIRLD--STNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLINATLSV
        ++ ++NI + I + LD   +NY  W+  F     +  + G ID ++         PT A+                     +WQ +D      +  TL+ 
Subjt:  LFLLTNICNLISIRLD--STNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLINATLSV

Query:  EALT-YVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRSQ
        +      V   +S  +W  ++  + +N     + L S+L +     D  +  Y +K+K++ D L N    V D +LV+Y LNGL  +++     ++ R  
Subjt:  EALT-YVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRSQ

Query:  PVSFSELHILLKSEESALEKQTKRRGRGRGRN--------------TGRGRSNFNQ--GRGRGQSDFGNRLFIQCQVCNRPGHSDCYNRMNYNYQGKHPP
          SF +   +L+ EE  L++  K        +              T   RS  NQ   RGRG+   GN +F       R G    YN   +N   + P 
Subjt:  PVSFSELHILLKSEESALEKQTKRRGRGRGRN--------------TGRGRSNFNQ--GRGRGQSDFGNRLFIQCQVCNRPGHSDCYNRMNYNYQGKHPP

Query:  QNLYLNTHNYPSNTWLADSGCNAHVTADFGN
           Y N++   ++ W    G   +V  + GN
Subjt:  QNLYLNTHNYPSNTWLADSGCNAHVTADFGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCGAAGCAAGTTCTTCTTCTTCAACCTCCAAAGAACTATCATCACCCCTCTTCCTTCTAACAAATATATGCAACCTAATATCCATCCGACTTGATTCAACCAA
CTACACCCTCTGGAAGTTCCAATTCGAACCTATGCTGAAAGCACACAAGTTGTATGGCTTCATTGACGAATCCATTCCCACACCTCCGAAGACGATTTCGAACCCCACAA
CTGCAAGTTCCTCAGCAACTACTCAGACAACTTCGTCTTCAACAACCGAAATCAGTAACCCTCCGTACGAAGATTGGCAAGCAAAGGACCAAGCGTTCATGTTCCTCATC
AACGCTACCCTATCTGTTGAAGCACTAACATATGTAGTTGGATGCAAATCTTCAAGTCAAGTATGGAAAACACTTGAAAGGCACTACTCTTCAAATACCAGGACAAATAT
TGTCAACCTAAAATCAGATTTACAACAAATCTCAAAGAAACCAGATGAACCAATTGGTTTGTATATCAAAAAAATCAAAGAAGTGAAAGACAAATTAGCAAATGCAGCAA
CTATTGTTGAAGATGAAGATCTCGTAATTTATGCCCTAAATGGCCTTCCAAGAGAATACAATGCCTTTCGCACCTCAATGCAAACAAGATCTCAACCGGTAAGTTTCTCT
GAACTCCATATCCTCCTAAAGTCTGAAGAATCTGCACTAGAAAAACAAACCAAACGAAGAGGAAGAGGAAGAGGCCGTAACACTGGTAGAGGCCGATCAAACTTCAACCA
AGGGAGAGGTCGTGGACAATCTGATTTTGGAAACCGTCTGTTCATTCAATGCCAAGTATGTAATCGACCTGGTCATTCTGATTGCTACAACAGAATGAACTACAATTATC
AAGGAAAGCACCCCCCTCAAAATCTTTATCTTAACACTCATAACTATCCTTCAAATACTTGGCTTGCAGATTCAGGATGCAATGCTCACGTTACTGCAGACTTTGGAAAC
TTCTCCATAGCAAATGAACATAATGGTGAAGATAGCATCTCTGTTGGCAGACAAAACTATTGTCAACCTTTTATTCCAAGACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACCGAAGCAAGTTCTTCTTCTTCAACCTCCAAAGAACTATCATCACCCCTCTTCCTTCTAACAAATATATGCAACCTAATATCCATCCGACTTGATTCAACCAA
CTACACCCTCTGGAAGTTCCAATTCGAACCTATGCTGAAAGCACACAAGTTGTATGGCTTCATTGACGAATCCATTCCCACACCTCCGAAGACGATTTCGAACCCCACAA
CTGCAAGTTCCTCAGCAACTACTCAGACAACTTCGTCTTCAACAACCGAAATCAGTAACCCTCCGTACGAAGATTGGCAAGCAAAGGACCAAGCGTTCATGTTCCTCATC
AACGCTACCCTATCTGTTGAAGCACTAACATATGTAGTTGGATGCAAATCTTCAAGTCAAGTATGGAAAACACTTGAAAGGCACTACTCTTCAAATACCAGGACAAATAT
TGTCAACCTAAAATCAGATTTACAACAAATCTCAAAGAAACCAGATGAACCAATTGGTTTGTATATCAAAAAAATCAAAGAAGTGAAAGACAAATTAGCAAATGCAGCAA
CTATTGTTGAAGATGAAGATCTCGTAATTTATGCCCTAAATGGCCTTCCAAGAGAATACAATGCCTTTCGCACCTCAATGCAAACAAGATCTCAACCGGTAAGTTTCTCT
GAACTCCATATCCTCCTAAAGTCTGAAGAATCTGCACTAGAAAAACAAACCAAACGAAGAGGAAGAGGAAGAGGCCGTAACACTGGTAGAGGCCGATCAAACTTCAACCA
AGGGAGAGGTCGTGGACAATCTGATTTTGGAAACCGTCTGTTCATTCAATGCCAAGTATGTAATCGACCTGGTCATTCTGATTGCTACAACAGAATGAACTACAATTATC
AAGGAAAGCACCCCCCTCAAAATCTTTATCTTAACACTCATAACTATCCTTCAAATACTTGGCTTGCAGATTCAGGATGCAATGCTCACGTTACTGCAGACTTTGGAAAC
TTCTCCATAGCAAATGAACATAATGGTGAAGATAGCATCTCTGTTGGCAGACAAAACTATTGTCAACCTTTTATTCCAAGACCCTAA
Protein sequenceShow/hide protein sequence
MATEASSSSSTSKELSSPLFLLTNICNLISIRLDSTNYTLWKFQFEPMLKAHKLYGFIDESIPTPPKTISNPTTASSSATTQTTSSSTTEISNPPYEDWQAKDQAFMFLI
NATLSVEALTYVVGCKSSSQVWKTLERHYSSNTRTNIVNLKSDLQQISKKPDEPIGLYIKKIKEVKDKLANAATIVEDEDLVIYALNGLPREYNAFRTSMQTRSQPVSFS
ELHILLKSEESALEKQTKRRGRGRGRNTGRGRSNFNQGRGRGQSDFGNRLFIQCQVCNRPGHSDCYNRMNYNYQGKHPPQNLYLNTHNYPSNTWLADSGCNAHVTADFGN
FSIANEHNGEDSISVGRQNYCQPFIPRP