; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022213 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022213
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr7:21243349..21245767
RNA-Seq ExpressionLag0022213
SyntenyLag0022213
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]6.8e-1226.8Show/hide
Query:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKAPNPPSPLSHLSSPVGANLRRHPRQRQ
        G + E +QV+FIL SLP S++ F+TNAS+NKI+FNLTTLL+ELQ +++L  SKGK V   EANVA +K+KF++                           
Subjt:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKAPNPPSPLSHLSSPVGANLRRHPRQRQ

Query:  QQQRKDRESGGVVCVGFFFSFCCSSSLNLLSFVLSHFRPLPLCLLLNSGESPSLTSLPCRFFFSFSHAAVTRRSYLISQVSTPPSSKKNKFGFNRTELNR
                                                                                            SS KNK G ++ ++ +
Subjt:  QQQRKDRESGGVVCVGFFFSFCCSSSLNLLSFVLSHFRPLPLCLLLNSGESPSLTSLPCRFFFSFSHAAVTRRSYLISQVSTPPSSKKNKFGFNRTELNR

Query:  TILVRFKNRNRDSFGFLVQETAKPTSSVRSMVRPKTGPNRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFHCNADEHWKRNCPRYLAEKR
                                                      KGKAP        K K  ADKG CFHCN D HWKRNCP+YLAEK+
Subjt:  TILVRFKNRNRDSFGFLVQETAKPTSSVRSMVRPKTGPNRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFHCNADEHWKRNCPRYLAEKR

KAA0035827.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-1245.45Show/hide
Query:  HAATGLGWTERTMRGRKFFNTC-LARGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKA
        H   G+   E  M     FN   +  G + E +QV+FIL SLP S++ F+TNAS+NKI+FNLTT L+ELQ +++L K KGK V   EANVA +K+KF + 
Subjt:  HAATGLGWTERTMRGRKFFNTC-LARGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKA

Query:  PNPPSPLSHL
         +  S +  L
Subjt:  PNPPSPLSHL

KAA0062742.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-1262.5Show/hide
Query:  RGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        RG + E +QV+FIL SLP S++SF+TNAS+NKI+FNLTTLL+ELQ +++L K KGK V   EANVA +K KF
Subjt:  RGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

KAA0062742.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-0533.61Show/hide
Query:  SQVSTPPSSKKNKFGFNRTELNRTILVRFKNRNRDSFGFLVQETAKPTSSVRSMV--RPKTGP---NRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFH
        S +S   ++  NK  FN T L    L RF+N  +     +    A      +  +  + K GP   NR +    KGK P Q      K K +A+KG C+H
Subjt:  SQVSTPPSSKKNKFGFNRTELNRTILVRFKNRNRDSFGFLVQETAKPTSSVRSMV--RPKTGP---NRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFH

Query:  CNADEHWKRNCPRYLAEKR
        C  + HW RNCP++LA+K+
Subjt:  CNADEHWKRNCPRYLAEKR

KAA0062742.1 gag/pol protein [Cucumis melo var. makuwa]5.2e-1259.15Show/hide
Query:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        GV+ E +Q++FIL SLP S+++F+TNAS+NKI+FNLTTL++ELQ +++L K KGK V   EANVA +K+KF
Subjt:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

TYK23836.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-1258.23Show/hide
Query:  FNTCLARGVVC-ERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        FN     GVV  E +Q++FIL SLP SY+ F+TNAS+NKI+FNL TLL+ELQ +++L K KGK V   EANVA +K+KF
Subjt:  FNTCLARGVVC-ERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

TrEMBL top hitse value%identityAlignment
A0A5A7T2N1 Gag/pol protein4.3e-1245.45Show/hide
Query:  HAATGLGWTERTMRGRKFFNTC-LARGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKA
        H   G+   E  M     FN   +  G + E +QV+FIL SLP S++ F+TNAS+NKI+FNLTT L+ELQ +++L K KGK V   EANVA +K+KF + 
Subjt:  HAATGLGWTERTMRGRKFFNTC-LARGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKA

Query:  PNPPSPLSHL
         +  S +  L
Subjt:  PNPPSPLSHL

A0A5D3DFR5 Gag/pol protein1.1e-1262.5Show/hide
Query:  RGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        RG + E +QV+FIL SLP S++SF+TNAS+NKI+FNLTTLL+ELQ +++L K KGK V   EANVA +K KF
Subjt:  RGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

A0A5D3DFR5 Gag/pol protein3.0e-0533.61Show/hide
Query:  SQVSTPPSSKKNKFGFNRTELNRTILVRFKNRNRDSFGFLVQETAKPTSSVRSMV--RPKTGP---NRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFH
        S +S   ++  NK  FN T L    L RF+N  +     +    A      +  +  + K GP   NR +    KGK P Q      K K +A+KG C+H
Subjt:  SQVSTPPSSKKNKFGFNRTELNRTILVRFKNRNRDSFGFLVQETAKPTSSVRSMV--RPKTGP---NRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFH

Query:  CNADEHWKRNCPRYLAEKR
        C  + HW RNCP++LA+K+
Subjt:  CNADEHWKRNCPRYLAEKR

A0A5D3DFR5 Gag/pol protein2.5e-1259.15Show/hide
Query:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        GV+ E +Q++FIL SLP S+++F+TNAS+NKI+FNLTTL++ELQ +++L K KGK V   EANVA +K+KF
Subjt:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

A0A5D3DJJ4 Gag/pol protein4.3e-1258.23Show/hide
Query:  FNTCLARGVVC-ERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF
        FN     GVV  E +Q++FIL SLP SY+ F+TNAS+NKI+FNL TLL+ELQ +++L K KGK V   EANVA +K+KF
Subjt:  FNTCLARGVVC-ERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKF

E2GK51 Gag/pol protein (Fragment)3.3e-1226.8Show/hide
Query:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKAPNPPSPLSHLSSPVGANLRRHPRQRQ
        G + E +QV+FIL SLP S++ F+TNAS+NKI+FNLTTLL+ELQ +++L  SKGK V   EANVA +K+KF++                           
Subjt:  GVVCERSQVAFILHSLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKAPNPPSPLSHLSSPVGANLRRHPRQRQ

Query:  QQQRKDRESGGVVCVGFFFSFCCSSSLNLLSFVLSHFRPLPLCLLLNSGESPSLTSLPCRFFFSFSHAAVTRRSYLISQVSTPPSSKKNKFGFNRTELNR
                                                                                            SS KNK G ++ ++ +
Subjt:  QQQRKDRESGGVVCVGFFFSFCCSSSLNLLSFVLSHFRPLPLCLLLNSGESPSLTSLPCRFFFSFSHAAVTRRSYLISQVSTPPSSKKNKFGFNRTELNR

Query:  TILVRFKNRNRDSFGFLVQETAKPTSSVRSMVRPKTGPNRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFHCNADEHWKRNCPRYLAEKR
                                                      KGKAP        K K  ADKG CFHCN D HWKRNCP+YLAEK+
Subjt:  TILVRFKNRNRDSFGFLVQETAKPTSSVRSMVRPKTGPNRTVTTPNKGKAPAQAVQGKGKAKVVADKGMCFHCNADEHWKRNCPRYLAEKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAGATCTGAAACCCTTCCCCGTAATCCGCCACCGTTGGAGGAGAAAACGTCATGGTTTCTCGCGATTTTGGTCCGTTGCAGTCGAAAGCGAGAAAAGAGAAGGAG
AGCCAACAAGAGAGAAGAAGAAATTAGAGCTGCTGGGTCGCACGCGGCTACTGGGCTAGGCTGGACAGAACGAACGATGCGAGGAAGAAAGTTTTTTAACACGTGCCTTG
CACGCGGGGTTGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTCGCTCCCAGCGAGTTATCTGTCATTCAGGACGAACGCGAGCATGAATAAGATTCAGTTCAAC
CTGACTACCCTCCTCTCGGAGTTACAGATTTATGAATCCTTGCAAAAGAGCAAGGGAAAAAATGTGGTTAAAGGAGAGGCCAATGTGGCCCATTCCAAGAAGAAGTTCCT
GAAGGCCCCCAACCCTCCATCTCCTCTCTCCCATTTGTCTTCTCCCGTAGGTGCAAATCTCCGACGGCATCCACGGCAACGGCAGCAGCAGCAACGTAAAGATCGTGAGT
CCGGCGGCGTCGTGTGCGTGGGTTTTTTTTTCTCGTTTTGTTGTTCTTCAAGCTTGAACCTTCTCTCCTTCGTTCTCTCTCACTTTCGACCTCTCCCTCTCTGTCTTCTC
TTAAATTCCGGCGAGTCTCCATCTCTCACGTCGCTGCCCTGCCGCTTCTTCTTCTCCTTCTCTCACGCCGCCGTCACTCGTCGGAGTTATCTCATCTCTCAAGTCTCAAC
GCCGCCGTCGTCCAAAAAAAATAAGTTCGGTTTTAACCGAACCGAACTGAACCGAACCATTTTGGTTCGGTTCAAAAACCGAAACCGAGATAGTTTCGGTTTTTTGGTTC
AAGAGACTGCAAAACCGACAAGTTCGGTTCGGTCCATGGTTCGGCCCAAAACCGGACCGAACCGAACCGTGACCACCCCTAACAAGGGGAAGGCTCCTGCACAGGCTGTG
CAAGGGAAGGGAAAGGCCAAGGTCGTGGCCGACAAAGGCATGTGCTTCCACTGCAACGCAGATGAACATTGGAAGCGGAACTGTCCCCGTTACCTTGCTGAGAAGAGAAG
ATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAGATCTGAAACCCTTCCCCGTAATCCGCCACCGTTGGAGGAGAAAACGTCATGGTTTCTCGCGATTTTGGTCCGTTGCAGTCGAAAGCGAGAAAAGAGAAGGAG
AGCCAACAAGAGAGAAGAAGAAATTAGAGCTGCTGGGTCGCACGCGGCTACTGGGCTAGGCTGGACAGAACGAACGATGCGAGGAAGAAAGTTTTTTAACACGTGCCTTG
CACGCGGGGTTGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTCGCTCCCAGCGAGTTATCTGTCATTCAGGACGAACGCGAGCATGAATAAGATTCAGTTCAAC
CTGACTACCCTCCTCTCGGAGTTACAGATTTATGAATCCTTGCAAAAGAGCAAGGGAAAAAATGTGGTTAAAGGAGAGGCCAATGTGGCCCATTCCAAGAAGAAGTTCCT
GAAGGCCCCCAACCCTCCATCTCCTCTCTCCCATTTGTCTTCTCCCGTAGGTGCAAATCTCCGACGGCATCCACGGCAACGGCAGCAGCAGCAACGTAAAGATCGTGAGT
CCGGCGGCGTCGTGTGCGTGGGTTTTTTTTTCTCGTTTTGTTGTTCTTCAAGCTTGAACCTTCTCTCCTTCGTTCTCTCTCACTTTCGACCTCTCCCTCTCTGTCTTCTC
TTAAATTCCGGCGAGTCTCCATCTCTCACGTCGCTGCCCTGCCGCTTCTTCTTCTCCTTCTCTCACGCCGCCGTCACTCGTCGGAGTTATCTCATCTCTCAAGTCTCAAC
GCCGCCGTCGTCCAAAAAAAATAAGTTCGGTTTTAACCGAACCGAACTGAACCGAACCATTTTGGTTCGGTTCAAAAACCGAAACCGAGATAGTTTCGGTTTTTTGGTTC
AAGAGACTGCAAAACCGACAAGTTCGGTTCGGTCCATGGTTCGGCCCAAAACCGGACCGAACCGAACCGTGACCACCCCTAACAAGGGGAAGGCTCCTGCACAGGCTGTG
CAAGGGAAGGGAAAGGCCAAGGTCGTGGCCGACAAAGGCATGTGCTTCCACTGCAACGCAGATGAACATTGGAAGCGGAACTGTCCCCGTTACCTTGCTGAGAAGAGAAG
ATAA
Protein sequenceShow/hide protein sequence
MPRSETLPRNPPPLEEKTSWFLAILVRCSRKREKRRRANKREEEIRAAGSHAATGLGWTERTMRGRKFFNTCLARGVVCERSQVAFILHSLPASYLSFRTNASMNKIQFN
LTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKAPNPPSPLSHLSSPVGANLRRHPRQRQQQQRKDRESGGVVCVGFFFSFCCSSSLNLLSFVLSHFRPLPLCLL
LNSGESPSLTSLPCRFFFSFSHAAVTRRSYLISQVSTPPSSKKNKFGFNRTELNRTILVRFKNRNRDSFGFLVQETAKPTSSVRSMVRPKTGPNRTVTTPNKGKAPAQAV
QGKGKAKVVADKGMCFHCNADEHWKRNCPRYLAEKRR