; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032045 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032045
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr11:23218942..23224710
RNA-Seq ExpressionLag0032045
SyntenyLag0032045
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTS SVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-7393.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTSRSVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTS SVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

KAA0058279.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTSRSVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LI EIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTS SVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein4.3e-7393.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTSRSVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

A0A5A7TKM4 Gag/pol protein1.3e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTS SVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

A0A5A7TZD0 Gag/pol protein1.3e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTS SVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

A0A5A7UXM9 Gag/pol protein1.6e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTSRSVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LI EIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

A0A5A7UYE8 Gag/pol protein1.3e-7293.29Show/hide
Query:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH
        TDKDSRKSTS SVFTLNGG VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLP+TLYCDNSGAVANSKEP SHKRGKHIERKYH
Subjt:  TDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYH

Query:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI
        LIREIVQ+GDVIVTKIASEHNIADPFTK L+AK+FEGHLESLGLR+MYI
Subjt:  LIREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGLREMYI

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-2038.57Show/hide
Query:  RKSTSRSVFTL-NGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHLIRE
        RKST+  +F + +  ++ W + +Q  +A S+ EAEY+A  EA +EA+WL+  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH  RE
Subjt:  RKSTSRSVFTL-NGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHLIRE

Query:  IVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGL
         VQ   + +  I +E+ +AD FTK L A  F    + LGL
Subjt:  IVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1618.0e-0842.86Show/hide
Query:  QKPDSFGLGPSHLSVLPLGSISQPDFSPVVLTDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL
        Q   ++GL  +      L   S  D++     D +SR+STS  +F LNGG V WRS KQ  +A S+ E EY+A  EA +EAVWL
Subjt:  QKPDSFGLGPSHLSVLPLGSISQPDFSPVVLTDKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-2341.26Show/hide
Query:  DKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHL
        D D+RKS++  +FT +GG + W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH 
Subjt:  DKDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHL

Query:  IREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGL
        IRE+V    + V KI++  N AD  TK +    FE   E +G+
Subjt:  IREIVQQGDVIVTKIASEHNIADPFTKALSAKMFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-1236.27Show/hide
Query:  KDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHLI
        KD+R+ST+     L   ++ W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +
Subjt:  KDSRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHLI

Query:  RE
        RE
Subjt:  RE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCACAATGTTGCTGCTCCACGTGTCGATCGGTTCAAGGTCCGAGTGTCACAATGTTGCTGCACGTGTCGATCGGTTCGGCCGAAGGTGAAGTCCGCGTTTGACCG
TCCTTCCGCGCGCGACGTTGACCGTTTGAAGTGTCGTCAGTTCCGTTTGTTGACCGTTAGTCTCGCCTGCGCGGATTGCTCCTGTTGTGACACGTGGCAGCTGCTTACCT
CCTTTTGTGGGAAAGTGGTCGTGGTTATCTCCACGACACAACAACTGTGTTTCTCAACTTCTTCTAACAGTGAGTGCTTGAGCGCCGACAAAAGCTTTGTGACGGCGTTC
AATCTGTCTCGCTCTCTCTTGTCTAGTTCGAATCGTGGGTCACGTACGGGAAATGACTGCGGCGGCGTTGCTGAGAGAGAGAGGACGACAATCTGGTCATGGGTATCGCC
GACGGGGAGGGAAGAGGAATTCCCTTCTCCTCATTTCGGCATGGGGAACGGGGATGAGAATTCGGAGGCATTTCGGGACAAACCAGACGAAACCGAGGCGGCCAAAGGCG
ATAGGGACCAAGCGGAGTCGGACGGACTCGGCCCGCGGGCCGAGGCCGAGCAGGGGGTCAGGCCAAAACCCGACCCCTTCGGTCTTGGCCCGTCCCACTTGCCGGTTTTG
CCTCTTGGGTCCATCTCTCAGCCTGATTTCTTCCCGGTTGTCCTCGTCAGCTCCTTTAGCGAAGATGGGTATAAATACCTGCTCATGCTTCTAGGGTTTTTAGGAATTCG
GAGGCGTTTCGGGACAAACCAGGCGAACCCGAGGCAGCCAAAGGCGGTAGGGACCAAGCGGAGTCAGACGGACTCGGCCCGCGCGAACGGGCCGAGGCCGAGTAGGAGGT
CGGGCCAAAAACCCGACTCCTTCGGTCTTGGCCCGTCCCACTTGTCGGTTTTGCCTCTTGGGTCCATCTCTCAGCCTGATTTCTCCCCGGTTGTCCTCACTGACAAGGAT
TCGAGGAAATCTACGTCAAGGTCAGTATTTACCCTTAACGGGGGAGTTGTAGTCTGGAGAAGCATCAAGCAAGGATGCATAGCAGACTCCACAATGGAGGCAGAATATGT
AGCTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTGAGAAAGTTCTTGACAGATTTGGAGGTCGTTCCAAATATGAACTTGCCCGTTACATTATACTGTGACAACAGTG
GGGCTGTAGCTAATTCTAAAGAACCTCACAGCCACAAACGAGGAAAGCACATCGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACAAGGAGATGTAATCGTTACC
AAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGGCTCTCTCGGCTAAAATGTTCGAGGGTCATCTAGAGAGTCTAGGTCTGCGAGAAATGTATATAGTATA
A
mRNA sequenceShow/hide mRNA sequence
ATGTGTCACAATGTTGCTGCTCCACGTGTCGATCGGTTCAAGGTCCGAGTGTCACAATGTTGCTGCACGTGTCGATCGGTTCGGCCGAAGGTGAAGTCCGCGTTTGACCG
TCCTTCCGCGCGCGACGTTGACCGTTTGAAGTGTCGTCAGTTCCGTTTGTTGACCGTTAGTCTCGCCTGCGCGGATTGCTCCTGTTGTGACACGTGGCAGCTGCTTACCT
CCTTTTGTGGGAAAGTGGTCGTGGTTATCTCCACGACACAACAACTGTGTTTCTCAACTTCTTCTAACAGTGAGTGCTTGAGCGCCGACAAAAGCTTTGTGACGGCGTTC
AATCTGTCTCGCTCTCTCTTGTCTAGTTCGAATCGTGGGTCACGTACGGGAAATGACTGCGGCGGCGTTGCTGAGAGAGAGAGGACGACAATCTGGTCATGGGTATCGCC
GACGGGGAGGGAAGAGGAATTCCCTTCTCCTCATTTCGGCATGGGGAACGGGGATGAGAATTCGGAGGCATTTCGGGACAAACCAGACGAAACCGAGGCGGCCAAAGGCG
ATAGGGACCAAGCGGAGTCGGACGGACTCGGCCCGCGGGCCGAGGCCGAGCAGGGGGTCAGGCCAAAACCCGACCCCTTCGGTCTTGGCCCGTCCCACTTGCCGGTTTTG
CCTCTTGGGTCCATCTCTCAGCCTGATTTCTTCCCGGTTGTCCTCGTCAGCTCCTTTAGCGAAGATGGGTATAAATACCTGCTCATGCTTCTAGGGTTTTTAGGAATTCG
GAGGCGTTTCGGGACAAACCAGGCGAACCCGAGGCAGCCAAAGGCGGTAGGGACCAAGCGGAGTCAGACGGACTCGGCCCGCGCGAACGGGCCGAGGCCGAGTAGGAGGT
CGGGCCAAAAACCCGACTCCTTCGGTCTTGGCCCGTCCCACTTGTCGGTTTTGCCTCTTGGGTCCATCTCTCAGCCTGATTTCTCCCCGGTTGTCCTCACTGACAAGGAT
TCGAGGAAATCTACGTCAAGGTCAGTATTTACCCTTAACGGGGGAGTTGTAGTCTGGAGAAGCATCAAGCAAGGATGCATAGCAGACTCCACAATGGAGGCAGAATATGT
AGCTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTGAGAAAGTTCTTGACAGATTTGGAGGTCGTTCCAAATATGAACTTGCCCGTTACATTATACTGTGACAACAGTG
GGGCTGTAGCTAATTCTAAAGAACCTCACAGCCACAAACGAGGAAAGCACATCGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACAAGGAGATGTAATCGTTACC
AAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGGCTCTCTCGGCTAAAATGTTCGAGGGTCATCTAGAGAGTCTAGGTCTGCGAGAAATGTATATAGTATA
A
Protein sequenceShow/hide protein sequence
MCHNVAAPRVDRFKVRVSQCCCTCRSVRPKVKSAFDRPSARDVDRLKCRQFRLLTVSLACADCSCCDTWQLLTSFCGKVVVVISTTQQLCFSTSSNSECLSADKSFVTAF
NLSRSLLSSSNRGSRTGNDCGGVAERERTTIWSWVSPTGREEEFPSPHFGMGNGDENSEAFRDKPDETEAAKGDRDQAESDGLGPRAEAEQGVRPKPDPFGLGPSHLPVL
PLGSISQPDFFPVVLVSSFSEDGYKYLLMLLGFLGIRRRFGTNQANPRQPKAVGTKRSQTDSARANGPRPSRRSGQKPDSFGLGPSHLSVLPLGSISQPDFSPVVLTDKD
SRKSTSRSVFTLNGGVVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPVTLYCDNSGAVANSKEPHSHKRGKHIERKYHLIREIVQQGDVIVT
KIASEHNIADPFTKALSAKMFEGHLESLGLREMYIV