; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027841 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027841
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:5928987..5932417
RNA-Seq ExpressionLag0027841
SyntenyLag0027841
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN77126.1 hypothetical protein VITISV_013628 [Vitis vinifera]4.9e-3736.65Show/hide
Query:  MESFIDG-TPPPSRFLDATQTQSSS------------TARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFV
        +E FI+G TP P++FLD  Q Q +              + I    +   KIKK G+T+S Y  +IK++ DK+S + EPLSYRD L Y L  L  EY+ FV
Subjt:  MESFIDG-TPPPSRFLDATQTQSSS------------TARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFV

Query:  SSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNL------VHSPRSSPSSTPRPSIPR---------PRAPFPFSHQFFAPQNSSSVLGRPQF--VS
        +SI+NR+D+ SL +V SLL  Y   LE++++   L         +S ++      +P+ P+         P  P    +Q F P +  S+LG+PQ    S
Subjt:  SSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNL------VHSPRSSPSSTPRPSIPR---------PRAPFPFSHQFFAPQNSSSVLGRPQF--VS

Query:  QPKW----PSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHA--PSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTL
          KW     + N   RPQCQICGK GH  L CY+R N  Y    P+S P P        Q P P  +  T   ST+P    ++ D  W+MDS ATHH T 
Subjt:  QPKW----PSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHA--PSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTL

Query:  DSQLLQQSSPYIGSEQVMIGNS
        +   L       G EQ M+GN+
Subjt:  DSQLLQQSSPYIGSEQVMIGNS

RVW66809.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.4e-3730.3Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFIDGTP-PPSRFLDA-------------------------TQT----------------------QSSSTARIMGL
        L  +NY+LWK+Q  N +FA   E FIDG+   P + L +                         T T                       SSS ARIM L
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFIDGTP-PPSRFLDA-------------------------TQT----------------------QSSSTARIMGL

Query:  RSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSS
        R +LQ  KKG L++  Y  ++K  +D  + I EP+  +D +  +L+ LG +YNA V++I+ R D+ S+  V S+LLA+E RLE+QSS++  + + S   +
Subjt:  RSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSS

Query:  PSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMS
         SS  R    R       +H       +    GR     Q    +SNS  +PQCQ+CGK GHT  +CY+R +  Y +  SS +          SP    +
Subjt:  PSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMS

Query:  SSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLN--------------------------------------
         ++ P   A  +NL+  D+ W++DS A+HH+T     L  SSPY G+++V IGN K L + N                                      
Subjt:  SSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLN--------------------------------------

Query:  --------------DCLSKRVLLQGHLENGLYKLIVAPSTNKF---ALSSSSCQPAALISFYQDVSLWHQRL
                      D  +K+VL QG LENGLY+  V  S       A +SS+        F   V LWH RL
Subjt:  --------------DCLSKRVLLQGHLENGLYKLIVAPSTNKF---ALSSSSCQPAALISFYQDVSLWHQRL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-3631.32Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFID-GTPPPSRFLDAT-------------------------------------------------QTQSSSTARIM
        L ++N LL K+Q LN I A  +E FID     P ++LDA                                                  + +S S A +M
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFID-GTPPPSRFLDAT-------------------------------------------------QTQSSSTARIM

Query:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR
         L SQLQ+IKK  + +S Y +++K + D+F+TI EPLSYRD L  +LE L  EY+ FV+SIHNR+DRPSL +V SLL  YE RL ++S   +LN      
Subjt:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR

Query:  SSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFF-NQFIQSPQP
          P + PR                                 QP + +S     PQCQICGK GH  L  Y+R N  YH P   P+  AF  N   Q+  P
Subjt:  SSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFF-NQFIQSPQP

Query:  TMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL--------------------HVLN---------------
          +  T   +   LS+ S  D +W+MDS ATHH T +   +  +  Y   +  ++GN K +                    HVL+               
Subjt:  TMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL--------------------HVLN---------------

Query:  -----------------DCLSKRVLLQGHLENGLYKLIVAPSTNKFALSSS------SCQPA-ALISFYQDVSLWHQRL
                         D  +K++LLQGHLE GLYKL    +    ++SSS         PA A +S    V LWH RL
Subjt:  -----------------DCLSKRVLLQGHLENGLYKLIVAPSTNKFALSSS------SCQPA-ALISFYQDVSLWHQRL

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]4.1e-4442.44Show/hide
Query:  QLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSP-----
        ++Q++KK GL+VS Y  +IK+I+ K S+I EP+S +DH++Y++E LG EYNAFV+SI NR+D  +L DVR+LLLAY+ RLEKQ+SVD LN+V +      
Subjt:  QLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSP-----

Query:  -RSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNR---PQCQICGKLGHTTLVCYNRHNPMYHA---PSSSPSPQAFF--
               T    +P   +P PF+      +    +LG+P   SQP WP S    R    QCQIC KLGHTT  CY+R N  Y     P++ P+P A F  
Subjt:  -RSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNR---PQCQICGKLGHTTLVCYNRHNPMYHA---PSSSPSPQAFF--

Query:  -----NQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGN
             +QF QS  PT +    P+  +  + +  PD+ W+MDS ATHH++ D   L     Y G EQV +G+
Subjt:  -----NQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGN

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.7e-6743.99Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFIDGT-PPPSRFLDATQTQ-------------------------------------------------SSSTARIM
        L D+N+LLWKNQ LN + A  +  ++DGT  PP +FLD  Q Q                                                 S +TARIM
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFIDGT-PPPSRFLDATQTQ-------------------------------------------------SSSTARIM

Query:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR
        GL+++LQ ++K G +VS Y  +IK+I+DKF+ + EPLSYRDHLA+VL+ LG EYNAFV+SIHNR D PSL DVRSLLLAYEARL+KQ++VD LN+  +  
Subjt:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR

Query:  SSPSSTPRPSIPRPRAPFP--FSHQF----FAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFI
         + S       P P+  FP  + H F     +   S S+LG+PQ V   KWP   S ++ QCQICGKLGH+  VCY+R N  YH    + SPQA ++   
Subjt:  SSPSSTPRPSIPRPRAPFP--FSHQF----FAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFI

Query:  QSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL
          P PT  SS             HPDE+WFMDS ATHHMT DS +L   +PY G EQV +GN   +
Subjt:  QSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL

TrEMBL top hitse value%identityAlignment
A0A438G3M6 Retrovirus-related Pol polyprotein from transposon RE14.1e-3730.3Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFIDGTP-PPSRFLDA-------------------------TQT----------------------QSSSTARIMGL
        L  +NY+LWK+Q  N +FA   E FIDG+   P + L +                         T T                       SSS ARIM L
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFIDGTP-PPSRFLDA-------------------------TQT----------------------QSSSTARIMGL

Query:  RSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSS
        R +LQ  KKG L++  Y  ++K  +D  + I EP+  +D +  +L+ LG +YNA V++I+ R D+ S+  V S+LLA+E RLE+QSS++  + + S   +
Subjt:  RSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSS

Query:  PSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMS
         SS  R    R       +H       +    GR     Q    +SNS  +PQCQ+CGK GHT  +CY+R +  Y +  SS +          SP    +
Subjt:  PSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMS

Query:  SSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLN--------------------------------------
         ++ P   A  +NL+  D+ W++DS A+HH+T     L  SSPY G+++V IGN K L + N                                      
Subjt:  SSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLN--------------------------------------

Query:  --------------DCLSKRVLLQGHLENGLYKLIVAPSTNKF---ALSSSSCQPAALISFYQDVSLWHQRL
                      D  +K+VL QG LENGLY+  V  S       A +SS+        F   V LWH RL
Subjt:  --------------DCLSKRVLLQGHLENGLYKLIVAPSTNKF---ALSSSSCQPAALISFYQDVSLWHQRL

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE16.9e-3731.32Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFID-GTPPPSRFLDAT-------------------------------------------------QTQSSSTARIM
        L ++N LL K+Q LN I A  +E FID     P ++LDA                                                  + +S S A +M
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFID-GTPPPSRFLDAT-------------------------------------------------QTQSSSTARIM

Query:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR
         L SQLQ+IKK  + +S Y +++K + D+F+TI EPLSYRD L  +LE L  EY+ FV+SIHNR+DRPSL +V SLL  YE RL ++S   +LN      
Subjt:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR

Query:  SSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFF-NQFIQSPQP
          P + PR                                 QP + +S     PQCQICGK GH  L  Y+R N  YH P   P+  AF  N   Q+  P
Subjt:  SSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFF-NQFIQSPQP

Query:  TMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL--------------------HVLN---------------
          +  T   +   LS+ S  D +W+MDS ATHH T +   +  +  Y   +  ++GN K +                    HVL+               
Subjt:  TMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL--------------------HVLN---------------

Query:  -----------------DCLSKRVLLQGHLENGLYKLIVAPSTNKFALSSS------SCQPA-ALISFYQDVSLWHQRL
                         D  +K++LLQGHLE GLYKL    +    ++SSS         PA A +S    V LWH RL
Subjt:  -----------------DCLSKRVLLQGHLENGLYKLIVAPSTNKFALSSS------SCQPA-ALISFYQDVSLWHQRL

A0A6J1D6N7 uncharacterized protein LOC1110174382.0e-4442.44Show/hide
Query:  QLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSP-----
        ++Q++KK GL+VS Y  +IK+I+ K S+I EP+S +DH++Y++E LG EYNAFV+SI NR+D  +L DVR+LLLAY+ RLEKQ+SVD LN+V +      
Subjt:  QLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSP-----

Query:  -RSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNR---PQCQICGKLGHTTLVCYNRHNPMYHA---PSSSPSPQAFF--
               T    +P   +P PF+      +    +LG+P   SQP WP S    R    QCQIC KLGHTT  CY+R N  Y     P++ P+P A F  
Subjt:  -RSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNR---PQCQICGKLGHTTLVCYNRHNPMYHA---PSSSPSPQAFF--

Query:  -----NQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGN
             +QF QS  PT +    P+  +  + +  PD+ W+MDS ATHH++ D   L     Y G EQV +G+
Subjt:  -----NQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGN

A0A6J1DQX7 uncharacterized protein LOC1110223151.3e-6743.99Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFIDGT-PPPSRFLDATQTQ-------------------------------------------------SSSTARIM
        L D+N+LLWKNQ LN + A  +  ++DGT  PP +FLD  Q Q                                                 S +TARIM
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFIDGT-PPPSRFLDATQTQ-------------------------------------------------SSSTARIM

Query:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR
        GL+++LQ ++K G +VS Y  +IK+I+DKF+ + EPLSYRDHLA+VL+ LG EYNAFV+SIHNR D PSL DVRSLLLAYEARL+KQ++VD LN+  +  
Subjt:  GLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPR

Query:  SSPSSTPRPSIPRPRAPFP--FSHQF----FAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFI
         + S       P P+  FP  + H F     +   S S+LG+PQ V   KWP   S ++ QCQICGKLGH+  VCY+R N  YH    + SPQA ++   
Subjt:  SSPSSTPRPSIPRPRAPFP--FSHQF----FAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFI

Query:  QSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL
          P PT  SS             HPDE+WFMDS ATHHMT DS +L   +PY G EQV +GN   +
Subjt:  QSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGL

A5BPS3 Uncharacterized protein2.4e-3736.65Show/hide
Query:  MESFIDG-TPPPSRFLDATQTQSSS------------TARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFV
        +E FI+G TP P++FLD  Q Q +              + I    +   KIKK G+T+S Y  +IK++ DK+S + EPLSYRD L Y L  L  EY+ FV
Subjt:  MESFIDG-TPPPSRFLDATQTQSSS------------TARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFV

Query:  SSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNL------VHSPRSSPSSTPRPSIPR---------PRAPFPFSHQFFAPQNSSSVLGRPQF--VS
        +SI+NR+D+ SL +V SLL  Y   LE++++   L         +S ++      +P+ P+         P  P    +Q F P +  S+LG+PQ    S
Subjt:  SSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNL------VHSPRSSPSSTPRPSIPR---------PRAPFPFSHQFFAPQNSSSVLGRPQF--VS

Query:  QPKW----PSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHA--PSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTL
          KW     + N   RPQCQICGK GH  L CY+R N  Y    P+S P P        Q P P  +  T   ST+P    ++ D  W+MDS ATHH T 
Subjt:  QPKW----PSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHA--PSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTL

Query:  DSQLLQQSSPYIGSEQVMIGNS
        +   L       G EQ M+GN+
Subjt:  DSQLLQQSSPYIGSEQVMIGNS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-1123.62Show/hide
Query:  STARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLN
        S   +  LR+QL++  KG  T+  Y   +    D+ + + +P+ + + +  VLE+L  EY   +  I  +   P+L ++   LL +E+++   SS   + 
Subjt:  STARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLN

Query:  LVHSPRSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRP---QCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFN
        +  +  S  ++T   +          ++++    N+++   +P   S   +  +N+Q++P   +CQICG  GH+   C    + + H  SS  S Q    
Subjt:  LVHSPRSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRP---QCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFN

Query:  QFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHV
               P+  +   P +   L +  +    W +DS ATHH+T D   L    PY G + VM+ +   + +
Subjt:  QFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-0821.99Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFIDGT---PPPSRFLDATQTQSSSTARIMGLRSQLQKIKKGGLTVSHYP---------------------------
        L  +NYL+W  Q       +++  F+DG+   PP +   DA    +    R       +     G +++S  P                           
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFIDGT---PPPSRFLDATQTQSSSTARIMGLRSQLQKIKKGGLTVSHYP---------------------------

Query:  TQIKDIS--DKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSSPSSTPRPSIPRPRAPF
        TQ++ I+  D+ + + +P+ + + +  VLE+L  +Y   +  I  +   PSL ++   L+  E++L   +S + +     P ++   T R +        
Subjt:  TQIKDIS--DKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSSPSSTPRPSIPRPRAPF

Query:  PFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRP---QCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLS-
           ++ +   N+ S   +P   S     S N Q +P   +CQIC   GH+   C                PQ    Q   + Q + S  T     A L+ 
Subjt:  PFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRP---QCQICGKLGHTTLVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLS-

Query:  NLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLNDCLSKRVLLQGHLENGLYKLIVAPSTNKFALS
        N  +    W +DS ATHH+T D   L    PY G + VMI +   + + +   +        L+  L K++  P+ +K  +S
Subjt:  NLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLNDCLSKRVLLQGHLENGLYKLIVAPSTNKFALS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-1023.96Show/hide
Query:  LADSNYLLWKNQFLNHIFAFDMESFIDGTPPPSRFLDA----------------------------------------TQTQSSSTARIMGLRSQLQKIK
        + +SNY  W+  FL H  +FD+   IDGT  P+   D                                          Q +++  AR + L S+L+   
Subjt:  LADSNYLLWKNQFLNHIFAFDMESFIDGTPPPSRFLDA----------------------------------------TQTQSSSTARIMGLRSQLQKIK

Query:  KGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSS
         G + V+ Y  ++K ++D    +D P++ R+ + YVL  L P+++  ++ I +R   PS  D  ++L   E RL++    +  ++ HS  S+
Subjt:  KGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSS

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.6e-0722.89Show/hide
Query:  QSSSTARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVD
        + +  AR +   ++L+      L+V  Y  ++K +SD  + +D P+S R  + ++L  L  +Y+  ++ I +++  PS  + RS+LL  E+RL  +S   
Subjt:  QSSSTARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNAFVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVD

Query:  SLNLVHSPRSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSP------S
          +  H   S+   T    +PR +  +P  +      N++S +GR +         S  +NR      G+  +      N+     + P  SP       
Subjt:  SLNLVHSPRSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTTLVCYNRHNPMYHAPSSSP------S

Query:  PQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMT
        PQ F  +     QP +  S      +P S L   +     D+R  +  T
Subjt:  PQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGGCAAGCAGAAAGGGAGATAGCTGTGCTCGGCCTCTTCTCGAGGCCGAGGCCGACCAGCACCACTTTTGTAAT
CTCTCACCCTTTTTCATTTCCGTTTCATCTAACCCCTATTCAGGGAGTGTCTACTTCTATCCTACTTTGCAGGATGACAACAAAGCTACACGCAATTTTGGACCACCCCG
ATATACAAGGAGCTGACGAGGACAACCGGGGAGGAATCAGGCTGAAAGATGGACCAAGGGGGCAAAACCGGCAACATCGGAGCCGGTGTGGCGAGCACCACACCGGTGTG
CAGGTTTACTGTCTTGCAGATTCTAATTATCTACTGTGGAAGAACCAGTTCCTCAACCACATCTTTGCATTTGACATGGAATCGTTTATTGATGGAACTCCTCCTCCATC
GCGTTTTTTGGATGCTACACAGACTCAGTCCTCTTCAACGGCTAGAATAATGGGTCTTAGATCACAACTTCAGAAAATTAAGAAAGGCGGGTTAACAGTGTCTCACTATC
CAACTCAGATTAAGGACATTTCGGACAAGTTCTCTACTATTGATGAGCCTCTGTCCTATCGGGATCATCTAGCTTATGTTCTTGAGAGTTTAGGTCCAGAGTATAATGCT
TTTGTAAGCTCCATACATAATCGCACTGATAGACCATCTCTTGCTGATGTTCGTAGCTTATTACTAGCATATGAGGCACGCCTTGAAAAACAGTCTTCGGTCGATAGCTT
AAATTTAGTGCATTCTCCACGATCTTCACCTTCCTCTACTCCTCGACCCTCTATTCCGCGACCTCGGGCCCCTTTTCCCTTTTCTCATCAGTTCTTTGCTCCTCAAAATA
GCTCCAGTGTCCTTGGTAGACCCCAGTTTGTATCTCAACCAAAATGGCCTTCCTCCAACTCTCAGAATCGACCTCAATGTCAAATCTGCGGGAAGCTTGGTCATACGACT
CTTGTTTGTTACAACCGACATAACCCAATGTACCATGCTCCCTCATCCTCGCCTTCTCCACAAGCCTTTTTCAATCAGTTTATCCAGTCCCCTCAGCCTACCATGTCGTC
CTCAACTGCCCCTGACTCCACTGCTCCTTTATCAAATTTGTCTCATCCCGACGAAGCATGGTTCATGGACTCTAGAGCAACTCACCATATGACACTGGATAGCCAGCTTC
TTCAACAGTCTAGCCCTTATATTGGCAGTGAGCAGGTCATGATTGGCAATAGTAAGGGACTTCATGTTCTCAATGATTGTCTATCCAAGAGAGTACTCCTTCAGGGACAT
CTTGAAAATGGTCTTTACAAGCTGATCGTTGCTCCATCTACCAACAAATTTGCTTTGTCCAGCTCCTCTTGTCAACCTGCAGCTCTTATTTCCTTCTATCAGGACGTGTC
TCTTTGGCATCAACGCCTTGCTAAGAGTCAGGCTGTCGTTTGTTGCCTCTTTTTCAAGGGCGACTACCCCTTTGAACCTTGTTTACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGGCAAGCAGAAAGGGAGATAGCTGTGCTCGGCCTCTTCTCGAGGCCGAGGCCGACCAGCACCACTTTTGTAAT
CTCTCACCCTTTTTCATTTCCGTTTCATCTAACCCCTATTCAGGGAGTGTCTACTTCTATCCTACTTTGCAGGATGACAACAAAGCTACACGCAATTTTGGACCACCCCG
ATATACAAGGAGCTGACGAGGACAACCGGGGAGGAATCAGGCTGAAAGATGGACCAAGGGGGCAAAACCGGCAACATCGGAGCCGGTGTGGCGAGCACCACACCGGTGTG
CAGGTTTACTGTCTTGCAGATTCTAATTATCTACTGTGGAAGAACCAGTTCCTCAACCACATCTTTGCATTTGACATGGAATCGTTTATTGATGGAACTCCTCCTCCATC
GCGTTTTTTGGATGCTACACAGACTCAGTCCTCTTCAACGGCTAGAATAATGGGTCTTAGATCACAACTTCAGAAAATTAAGAAAGGCGGGTTAACAGTGTCTCACTATC
CAACTCAGATTAAGGACATTTCGGACAAGTTCTCTACTATTGATGAGCCTCTGTCCTATCGGGATCATCTAGCTTATGTTCTTGAGAGTTTAGGTCCAGAGTATAATGCT
TTTGTAAGCTCCATACATAATCGCACTGATAGACCATCTCTTGCTGATGTTCGTAGCTTATTACTAGCATATGAGGCACGCCTTGAAAAACAGTCTTCGGTCGATAGCTT
AAATTTAGTGCATTCTCCACGATCTTCACCTTCCTCTACTCCTCGACCCTCTATTCCGCGACCTCGGGCCCCTTTTCCCTTTTCTCATCAGTTCTTTGCTCCTCAAAATA
GCTCCAGTGTCCTTGGTAGACCCCAGTTTGTATCTCAACCAAAATGGCCTTCCTCCAACTCTCAGAATCGACCTCAATGTCAAATCTGCGGGAAGCTTGGTCATACGACT
CTTGTTTGTTACAACCGACATAACCCAATGTACCATGCTCCCTCATCCTCGCCTTCTCCACAAGCCTTTTTCAATCAGTTTATCCAGTCCCCTCAGCCTACCATGTCGTC
CTCAACTGCCCCTGACTCCACTGCTCCTTTATCAAATTTGTCTCATCCCGACGAAGCATGGTTCATGGACTCTAGAGCAACTCACCATATGACACTGGATAGCCAGCTTC
TTCAACAGTCTAGCCCTTATATTGGCAGTGAGCAGGTCATGATTGGCAATAGTAAGGGACTTCATGTTCTCAATGATTGTCTATCCAAGAGAGTACTCCTTCAGGGACAT
CTTGAAAATGGTCTTTACAAGCTGATCGTTGCTCCATCTACCAACAAATTTGCTTTGTCCAGCTCCTCTTGTCAACCTGCAGCTCTTATTTCCTTCTATCAGGACGTGTC
TCTTTGGCATCAACGCCTTGCTAAGAGTCAGGCTGTCGTTTGTTGCCTCTTTTTCAAGGGCGACTACCCCTTTGAACCTTGTTTACTCTGA
Protein sequenceShow/hide protein sequence
MMLGLFPRPRPTRQAEREIAVLGLFSRPRPTSTTFVISHPFSFPFHLTPIQGVSTSILLCRMTTKLHAILDHPDIQGADEDNRGGIRLKDGPRGQNRQHRSRCGEHHTGV
QVYCLADSNYLLWKNQFLNHIFAFDMESFIDGTPPPSRFLDATQTQSSSTARIMGLRSQLQKIKKGGLTVSHYPTQIKDISDKFSTIDEPLSYRDHLAYVLESLGPEYNA
FVSSIHNRTDRPSLADVRSLLLAYEARLEKQSSVDSLNLVHSPRSSPSSTPRPSIPRPRAPFPFSHQFFAPQNSSSVLGRPQFVSQPKWPSSNSQNRPQCQICGKLGHTT
LVCYNRHNPMYHAPSSSPSPQAFFNQFIQSPQPTMSSSTAPDSTAPLSNLSHPDEAWFMDSRATHHMTLDSQLLQQSSPYIGSEQVMIGNSKGLHVLNDCLSKRVLLQGH
LENGLYKLIVAPSTNKFALSSSSCQPAALISFYQDVSLWHQRLAKSQAVVCCLFFKGDYPFEPCLL