; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032565 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032565
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr11:34725192..34726223
RNA-Seq ExpressionLag0032565
SyntenyLag0032565
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY82848.1 hypothetical protein Acr_02g0010880 [Actinidia rufa]3.1e-5143Show/hide
Query:  MSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFV
        MSWIY+S+NE  LG+I+G +SA +IW+ L  +Y ++S A +  LR+ LQ IKKDGL+   Y+ + + + +  ++IGEP++Y DHL Y L GL  +YN FV
Subjt:  MSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFV

Query:  TSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPP
        TS+ ++  RPS+ ++ SLL++YDARLE+Q++ D L+ +QANLANL    + Q  +F+  S    PNSNS    P     N S+SP+P       +SP+P 
Subjt:  TSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPP

Query:  RWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHP
                RP+C IC K GHT   CY+R N  YQ P    PP   FN +   +  +S S    AS+S      S PD SW+MDSGA+HH TP  N +   
Subjt:  RWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHP

Query:  SPYYGSE
        SPY G +
Subjt:  SPYYGSE

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]1.7e-4938.66Show/hide
Query:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA
        +E F+DG+   PP+FLDP   Q NP F  WQ+YNR +MSWIY+S+NE  LG+I+G +SA +IW+ L  +Y ++S A +  LR+ LQ IKK+GL+   Y+ 
Subjt:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA

Query:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP
        + + + +  ++IGEP++Y DHL Y L GL  +YN FVTS+ ++  RPS              +E+ TS   L               ++  +F+  S   
Subjt:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP

Query:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS
         PNSNS    P     N S+SP+P       +SP+P         RP+C IC K GHT   CY+  N  YQ P    PP   FN +   +  +S S    
Subjt:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS

Query:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
         S+   + L S PD SW+MDSGA+HH TPD N +   SPY G +
Subjt:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.1e-5138.53Show/hide
Query:  MESFID-GTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA
        +E FID    +PPK+LD A  Q+NP F  W + N+ +MSWIYSSL    +G+I+  S+A +IW  L   YES S A +MSL SQLQ+IKK  + +S+YL+
Subjt:  MESFID-GTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA

Query:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP
        ++K + D+F+ IGEPLSY D L  ILEGL  EY+ FVTS+HNR++RPSL ++ SLL  Y+ RL +++    LN  QAN                      
Subjt:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP

Query:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS
                                            PR P  N + PQC ICGK GH  L  Y+R N  Y  P   +      N    +S+P+S  +  S
Subjt:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS

Query:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPY
        A+ +     +S  D SW+MDSGATHH TP+F  +     Y
Subjt:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPY

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]3.4e-5046.52Show/hide
Query:  QLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLS
        ++Q++KKDGLSVSQYLA+IK+I  K S+IGEP+S  DH+ YI+EGL  EYNAFVTS+ NR++  +L D+R+LL+AYD RLEKQ SVDQLN++QAN+ANL 
Subjt:  QLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLS

Query:  SNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQN----RPQCHICGKFGHTTLVCYNRHNPVYQA---PQVS
         N  S   R            N R PS    P  F+F   PG+LG+P  + +PP WP + Q+    + QC IC K GHTT  CY+R N  Y+    P   
Subjt:  SNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQN----RPQCHICGKFGHTTLVCYNRHNPVYQA---PQVS

Query:  SPPQAFFNQFSASSAPVSQS-----VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
          P A F+  ++SS+   QS     VP + +  +F  L   PD+ W+MDSGATHH++ D N++ +   Y G E
Subjt:  SPPQAFFNQFSASSAPVSQS-----VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.2e-8049.42Show/hide
Query:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA
        +  ++DGT   PP+FLD    Q NP +  W++YNR LM WIYSSL+E+++GE++   + ++IW  L  VY+S +TARIM L+++LQ ++KDG SVSQYLA
Subjt:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA

Query:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP
        +IK+IADKF+A+GEPLSY DHL ++L+GL SEYNAFVTS+HNR + PSL D+RSLL+AY+ARL+KQ +VDQLN+ QANL NLS   +S+         PP
Subjt:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP

Query:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS
        K +  +     FP  P  S + S  +LG+PQ+  + P  PS+  ++ QC ICGK GH+  VCY+R N  Y     ++ PQA ++    S        P  
Subjt:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS

Query:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
         SS        HPDESWFMDSGATHHMTPD + + +P+PY G E
Subjt:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.5e-5138.53Show/hide
Query:  MESFID-GTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA
        +E FID    +PPK+LD A  Q+NP F  W + N+ +MSWIYSSL    +G+I+  S+A +IW  L   YES S A +MSL SQLQ+IKK  + +S+YL+
Subjt:  MESFID-GTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA

Query:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP
        ++K + D+F+ IGEPLSY D L  ILEGL  EY+ FVTS+HNR++RPSL ++ SLL  Y+ RL +++    LN  QAN                      
Subjt:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP

Query:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS
                                            PR P  N + PQC ICGK GH  L  Y+R N  Y  P   +      N    +S+P+S  +  S
Subjt:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS

Query:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPY
        A+ +     +S  D SW+MDSGATHH TP+F  +     Y
Subjt:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPY

A0A6J1D6N7 uncharacterized protein LOC1110174381.7e-5046.52Show/hide
Query:  QLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLS
        ++Q++KKDGLSVSQYLA+IK+I  K S+IGEP+S  DH+ YI+EGL  EYNAFVTS+ NR++  +L D+R+LL+AYD RLEKQ SVDQLN++QAN+ANL 
Subjt:  QLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLS

Query:  SNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQN----RPQCHICGKFGHTTLVCYNRHNPVYQA---PQVS
         N  S   R            N R PS    P  F+F   PG+LG+P  + +PP WP + Q+    + QC IC K GHTT  CY+R N  Y+    P   
Subjt:  SNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQN----RPQCHICGKFGHTTLVCYNRHNPVYQA---PQVS

Query:  SPPQAFFNQFSASSAPVSQS-----VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
          P A F+  ++SS+   QS     VP + +  +F  L   PD+ W+MDSGATHH++ D N++ +   Y G E
Subjt:  SPPQAFFNQFSASSAPVSQS-----VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

A0A6J1DQX7 uncharacterized protein LOC1110223151.5e-8049.42Show/hide
Query:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA
        +  ++DGT   PP+FLD    Q NP +  W++YNR LM WIYSSL+E+++GE++   + ++IW  L  VY+S +TARIM L+++LQ ++KDG SVSQYLA
Subjt:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA

Query:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP
        +IK+IADKF+A+GEPLSY DHL ++L+GL SEYNAFVTS+HNR + PSL D+RSLL+AY+ARL+KQ +VDQLN+ QANL NLS   +S+         PP
Subjt:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP

Query:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS
        K +  +     FP  P  S + S  +LG+PQ+  + P  PS+  ++ QC ICGK GH+  VCY+R N  Y     ++ PQA ++    S        P  
Subjt:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS

Query:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
         SS        HPDESWFMDSGATHHMTPD + + +P+PY G E
Subjt:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

A0A7J0E8R3 Uncharacterized protein1.5e-5143Show/hide
Query:  MSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFV
        MSWIY+S+NE  LG+I+G +SA +IW+ L  +Y ++S A +  LR+ LQ IKKDGL+   Y+ + + + +  ++IGEP++Y DHL Y L GL  +YN FV
Subjt:  MSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFV

Query:  TSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPP
        TS+ ++  RPS+ ++ SLL++YDARLE+Q++ D L+ +QANLANL    + Q  +F+  S    PNSNS    P     N S+SP+P       +SP+P 
Subjt:  TSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPP

Query:  RWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHP
                RP+C IC K GHT   CY+R N  YQ P    PP   FN +   +  +S S    AS+S      S PD SW+MDSGA+HH TP  N +   
Subjt:  RWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHP

Query:  SPYYGSE
        SPY G +
Subjt:  SPYYGSE

A0A7J0GPN0 UBX domain-containing protein8.2e-5038.66Show/hide
Query:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA
        +E F+DG+   PP+FLDP   Q NP F  WQ+YNR +MSWIY+S+NE  LG+I+G +SA +IW+ L  +Y ++S A +  LR+ LQ IKK+GL+   Y+ 
Subjt:  MESFIDGT-PAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLA

Query:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP
        + + + +  ++IGEP++Y DHL Y L GL  +YN FVTS+ ++  RPS              +E+ TS   L               ++  +F+  S   
Subjt:  QIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPP

Query:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS
         PNSNS    P     N S+SP+P       +SP+P         RP+C IC K GHT   CY+  N  YQ P    PP   FN +   +  +S S    
Subjt:  KPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVS

Query:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
         S+   + L S PD SW+MDSGA+HH TPD N +   SPY G +
Subjt:  ASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.2e-1921.84Show/hide
Query:  MESFIDG-TPAPPKFL-DPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYL
        +  F+DG T  PP  +   A  ++NP +T W++ ++ + S +  +++      +   ++A +IW+ LR +Y + S   +  LR+QL++  K   ++  Y+
Subjt:  MESFIDG-TPAPPKFL-DPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYL

Query:  AQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNP
          +    D+ + +G+P+ + + +  +LE L  EY   +  +  +   P+L ++   L+ +++++   +S   + +    +++ ++  ++      R++  
Subjt:  AQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNP

Query:  PKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRP---QCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQS
           N+N+          NF                     P+NNQ++P   +C ICG  GH+   C              S  Q F +  + S  P S  
Subjt:  PKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRP---QCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQS

Query:  VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
         P    ++  +  + +   +W +DSGATHH+T DFN +    PY G +
Subjt:  VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-1324.14Show/hide
Query:  MESFIDG-TPAPPKFL-DPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYL
        +  F+DG TP PP  +   A  ++NP +T W++ ++ + S I  +++      +   ++A +IW+ LR +Y + S   +  LR               ++
Subjt:  MESFIDG-TPAPPKFL-DPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYL

Query:  AQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNP
         +     D+ + +G+P+ + + +  +LE L  +Y   +  +  +   PSL ++   LI  +++L    S + +  I AN+     N ++   +  R  N 
Subjt:  AQIKDIADKFSAIGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNP

Query:  PKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRP---QCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQS
           N+N+R  S   + P+ S S S                  N Q +P   +C IC   GH+   C   H             Q+  NQ   S++P +  
Subjt:  PKPNSNSRQPSPFPFPPNFSFSPSPGVLGRPQTSPRPPRWPSNNQNRP---QCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQS

Query:  VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE
         P  A+ +   P  ++   +W +DSGATHH+T DFN +    PY G +
Subjt:  VPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDFNTIQHPSPYYGSE

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.9e-0828.71Show/hide
Query:  FIDGTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKD
        FIDGT   P   DP     +PL+  W++ N  +M W+ +S+ +  L  ++   +A+++W+ LR V+      +I  LR +L  +++ G SV +Y  ++  
Subjt:  FIDGTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKD

Query:  I
        +
Subjt:  I


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTTCATTGATGGTACTCCGGCTCCTCCGAAGTTTTTGGATCCCGCCCACACTCAACTAAATCCTCTGTTTACTGTATGGCAAAAGTATAATCGGACACTGAT
GAGTTGGATTTATTCCTCTTTGAATGAAGACGAATTAGGTGAAATAATAGGATGTTCTTCTGCTTATGAAATCTGGGATCATTTGCGTATTGTTTATGAATCGTCTTCTA
CTGCACGAATTATGAGCTTAAGGTCTCAGCTTCAGAAAATCAAGAAGGATGGTCTGTCGGTGTCTCAGTACCTTGCCCAAATCAAAGATATAGCCGACAAATTCTCCGCC
ATTGGTGAACCACTGTCGTATCTTGACCATCTCGGCTATATCCTTGAGGGGCTCGAGTCTGAGTACAATGCTTTCGTCACCTCTGTTCACAACAGAACAAACCGCCCTTC
CCTTGCCGACCTTCGGAGTCTTCTCATTGCTTATGATGCTCGTCTGGAAAAACAAACTTCGGTGGACCAACTGAACCTGATTCAGGCCAATCTTGCAAACTTGTCTTCCA
ATCCTTCGTCACAACCTAAGCGGTTTCAGCGTTCTTCAAATCCTCCTAAACCTAATTCAAATTCGAGGCAACCTTCCCCTTTTCCCTTTCCACCCAATTTCTCTTTTTCC
CCTAGTCCTGGTGTGTTGGGTCGGCCTCAGACTTCTCCTCGTCCTCCCAGATGGCCTTCTAATAATCAAAACCGTCCTCAGTGTCATATTTGCGGCAAGTTTGGACACAC
AACTCTTGTGTGTTATAATCGTCACAATCCTGTTTATCAAGCTCCACAAGTTTCTTCCCCTCCACAAGCCTTTTTTAACCAATTTTCAGCTTCCTCTGCCCCTGTTAGTC
AATCAGTCCCTGTTTCCGCCTCTTCATCCACCTTTATCCCTCTCACATCCCACCCTGATGAATCCTGGTTCATGGACTCGGGTGCCACTCATCACATGACCCCCGACTTC
AATACCATTCAGCATCCATCCCCATACTATGGCAGTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTTTCATTGATGGTACTCCGGCTCCTCCGAAGTTTTTGGATCCCGCCCACACTCAACTAAATCCTCTGTTTACTGTATGGCAAAAGTATAATCGGACACTGAT
GAGTTGGATTTATTCCTCTTTGAATGAAGACGAATTAGGTGAAATAATAGGATGTTCTTCTGCTTATGAAATCTGGGATCATTTGCGTATTGTTTATGAATCGTCTTCTA
CTGCACGAATTATGAGCTTAAGGTCTCAGCTTCAGAAAATCAAGAAGGATGGTCTGTCGGTGTCTCAGTACCTTGCCCAAATCAAAGATATAGCCGACAAATTCTCCGCC
ATTGGTGAACCACTGTCGTATCTTGACCATCTCGGCTATATCCTTGAGGGGCTCGAGTCTGAGTACAATGCTTTCGTCACCTCTGTTCACAACAGAACAAACCGCCCTTC
CCTTGCCGACCTTCGGAGTCTTCTCATTGCTTATGATGCTCGTCTGGAAAAACAAACTTCGGTGGACCAACTGAACCTGATTCAGGCCAATCTTGCAAACTTGTCTTCCA
ATCCTTCGTCACAACCTAAGCGGTTTCAGCGTTCTTCAAATCCTCCTAAACCTAATTCAAATTCGAGGCAACCTTCCCCTTTTCCCTTTCCACCCAATTTCTCTTTTTCC
CCTAGTCCTGGTGTGTTGGGTCGGCCTCAGACTTCTCCTCGTCCTCCCAGATGGCCTTCTAATAATCAAAACCGTCCTCAGTGTCATATTTGCGGCAAGTTTGGACACAC
AACTCTTGTGTGTTATAATCGTCACAATCCTGTTTATCAAGCTCCACAAGTTTCTTCCCCTCCACAAGCCTTTTTTAACCAATTTTCAGCTTCCTCTGCCCCTGTTAGTC
AATCAGTCCCTGTTTCCGCCTCTTCATCCACCTTTATCCCTCTCACATCCCACCCTGATGAATCCTGGTTCATGGACTCGGGTGCCACTCATCACATGACCCCCGACTTC
AATACCATTCAGCATCCATCCCCATACTATGGCAGTGAGTAG
Protein sequenceShow/hide protein sequence
MESFIDGTPAPPKFLDPAHTQLNPLFTVWQKYNRTLMSWIYSSLNEDELGEIIGCSSAYEIWDHLRIVYESSSTARIMSLRSQLQKIKKDGLSVSQYLAQIKDIADKFSA
IGEPLSYLDHLGYILEGLESEYNAFVTSVHNRTNRPSLADLRSLLIAYDARLEKQTSVDQLNLIQANLANLSSNPSSQPKRFQRSSNPPKPNSNSRQPSPFPFPPNFSFS
PSPGVLGRPQTSPRPPRWPSNNQNRPQCHICGKFGHTTLVCYNRHNPVYQAPQVSSPPQAFFNQFSASSAPVSQSVPVSASSSTFIPLTSHPDESWFMDSGATHHMTPDF
NTIQHPSPYYGSE