; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000784 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000784
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr4:16483573..16488567
RNA-Seq ExpressionLag0000784
SyntenyLag0000784
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]4.6e-7660.66Show/hide
Query:  SEALEKFKEYKAEVENLL------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVT
        SEALEKFKEYK EVENLL                                    VETAV+ILN  PSKSVSETPFELW G K SL HFRIWGCP HVLVT
Subjt:  SEALEKFKEYKAEVENLL------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVT

Query:  NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST-------------------
        NPKKL SRS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLE DH+R HKP+SK+VLS    EATD STRVVD+ GPS+                   
Subjt:  NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST-------------------

Query:  -----RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
             RVV +PN     +ET AVIPDD VEDPL++ Q+MNDVDKDQW KAMD EMESMYFN V +LVD PEG
Subjt:  -----RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-7358.91Show/hide
Query:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV
        SEALEKFKEYK EVENLL                                       VETAV+ILN VPSKSVSE PFELW   K SL HFRIWGCPAHV
Subjt:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV

Query:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------
        LVTNPKKL+ RS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLE DH+R HKP+SK+VLS    EATD STRVVD+ GPS+                
Subjt:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------

Query:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
                RVV +PN     +ET  VIPDD VEDPL++ Q+MNDVDKDQW KAMD EMESMYFN V +LVD  EG
Subjt:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

KAA0058368.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-7459.64Show/hide
Query:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV
        SEALEKFKEYK EVENLL                                       VETAV+ILN VPSKSVSETPFEL  G K SL HFRIWGCPAHV
Subjt:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV

Query:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------
        LVTNPKKL+ RS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLE DH+R HKP+SK+VLS    EATD STRVVD+ GPS+                
Subjt:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------

Query:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
                RVV +PN     +ET  VIPDD VEDPL++ Q+MNDVDKDQW KAMD EMESMYFN V +LVD PEG
Subjt:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

KAA0062410.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-7357.45Show/hide
Query:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV
        SEALEKFKEYKAEVENLL                                       V+TAV+ILN VPSKSVSETPFELW G K SL HFRIWGCPAHV
Subjt:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV

Query:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRV--------------
        LVTNPKKL+ RS+LC F+GYPKETRGG FFDPQEN+VF+ TNATFLE DH+R+HKPQSK+VL+    EATD STRVVD+ GPS+RV              
Subjt:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRV--------------

Query:  ----------VDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
                  V +PN     +ET  VIPDD  EDPL++ Q+ NDVDKDQW KAMD +MESMYFN + +LVD PEG
Subjt:  ----------VDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

TYK06521.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-7666.1Show/hide
Query:  SEALEKFKEYKAEVENLLVETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFV
        SEALEKFKEYKAEVENLL+ETAV ILN VPSKS+SET F+LW G K SL HFRIW CP HVL+TNPKKL+SRS+LC FVGYPKETRGG FFDPQEN+VFV
Subjt:  SEALEKFKEYKAEVENLLVETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFV

Query:  STNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRVVDEPNTS-----------------------------ETLAVIPDDEVEDPLTF
        STNATFLE  H+R+HKP+SK+VL+    +ATD STRVVD+ GPS+R VDE  TS                             ET  VIPDD VEDPL++
Subjt:  STNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRVVDEPNTS-----------------------------ETLAVIPDDEVEDPLTF

Query:  NQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPE
         Q+MNDVDK+QW KAMD EMESMYFN V +LVD PE
Subjt:  NQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPE

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein2.2e-7660.66Show/hide
Query:  SEALEKFKEYKAEVENLL------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVT
        SEALEKFKEYK EVENLL                                    VETAV+ILN  PSKSVSETPFELW G K SL HFRIWGCP HVLVT
Subjt:  SEALEKFKEYKAEVENLL------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVT

Query:  NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST-------------------
        NPKKL SRS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLE DH+R HKP+SK+VLS    EATD STRVVD+ GPS+                   
Subjt:  NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST-------------------

Query:  -----RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
             RVV +PN     +ET AVIPDD VEDPL++ Q+MNDVDKDQW KAMD EMESMYFN V +LVD PEG
Subjt:  -----RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

A0A5A7USZ2 Gag/pol protein1.0e-7358.91Show/hide
Query:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV
        SEALEKFKEYK EVENLL                                       VETAV+ILN VPSKSVSE PFELW   K SL HFRIWGCPAHV
Subjt:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV

Query:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------
        LVTNPKKL+ RS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLE DH+R HKP+SK+VLS    EATD STRVVD+ GPS+                
Subjt:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------

Query:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
                RVV +PN     +ET  VIPDD VEDPL++ Q+MNDVDKDQW KAMD EMESMYFN V +LVD  EG
Subjt:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

A0A5A7UT80 Gag/pol protein7.1e-7559.64Show/hide
Query:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV
        SEALEKFKEYK EVENLL                                       VETAV+ILN VPSKSVSETPFEL  G K SL HFRIWGCPAHV
Subjt:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV

Query:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------
        LVTNPKKL+ RS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLE DH+R HKP+SK+VLS    EATD STRVVD+ GPS+                
Subjt:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPST----------------

Query:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
                RVV +PN     +ET  VIPDD VEDPL++ Q+MNDVDKDQW KAMD EMESMYFN V +LVD PEG
Subjt:  --------RVVDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

A0A5D3C3N8 Gag/pol protein9.9e-7766.1Show/hide
Query:  SEALEKFKEYKAEVENLLVETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFV
        SEALEKFKEYKAEVENLL+ETAV ILN VPSKS+SET F+LW G K SL HFRIW CP HVL+TNPKKL+SRS+LC FVGYPKETRGG FFDPQEN+VFV
Subjt:  SEALEKFKEYKAEVENLLVETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFV

Query:  STNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRVVDEPNTS-----------------------------ETLAVIPDDEVEDPLTF
        STNATFLE  H+R+HKP+SK+VL+    +ATD STRVVD+ GPS+R VDE  TS                             ET  VIPDD VEDPL++
Subjt:  STNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRVVDEPNTS-----------------------------ETLAVIPDDEVEDPLTF

Query:  NQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPE
         Q+MNDVDK+QW KAMD EMESMYFN V +LVD PE
Subjt:  NQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPE

A0A5D3DS50 Gag/pol protein7.9e-7457.45Show/hide
Query:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV
        SEALEKFKEYKAEVENLL                                       V+TAV+ILN VPSKSVSETPFELW G K SL HFRIWGCPAHV
Subjt:  SEALEKFKEYKAEVENLL---------------------------------------VETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHV

Query:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRV--------------
        LVTNPKKL+ RS+LC F+GYPKETRGG FFDPQEN+VF+ TNATFLE DH+R+HKPQSK+VL+    EATD STRVVD+ GPS+RV              
Subjt:  LVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRV--------------

Query:  ----------VDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
                  V +PN     +ET  VIPDD  EDPL++ Q+ NDVDKDQW KAMD +MESMYFN + +LVD PEG
Subjt:  ----------VDEPN----TSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.1e-1126.45Show/hide
Query:  VETAVYILNVVPSKSVS-ETPFELWGGHKDSLRHFRIWGCP--AHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE-------
        V+TA Y++N  PS  ++ E P  +W   + S  H +++GC   AHV      KLD +S  C+F+GY  E  G   +DP + KV  S +  F E       
Subjt:  VETAVYILNVVPSKSVS-ETPFELWGGHKDSLRHFRIWGCP--AHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE-------

Query:  ------------------------------VDHVREHKPQSKMVLSELSGEATDTSTRVVD------------KAGPSTRVVDEPNTSETLAVIPDDEVE
                                       D V E   Q   V+ +  GE  D     V+            +     RV      S    +I DD   
Subjt:  ------------------------------VDHVREHKPQSKMVLSELSGEATDTSTRVVD------------KAGPSTRVVDEPNTSETLAVIPDDEVE

Query:  DPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG
        +P +  + ++  +K+Q  KAM  EMES+  N    LV+ P+G
Subjt:  DPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.8e-0527.78Show/hide
Query:  AVYILNVVPSKSVS-ETPFELWGGHKDSLRHFRIWGCPAHVLVT--NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE
        AVY++N +P+  +  E+PF+   G   +    R++GC  +  +   N  KLD +S+ C+F+GY            Q +++++S +  F E
Subjt:  AVYILNVVPSKSVS-ETPFELWGGHKDSLRHFRIWGCPAHVLVT--NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGAGTGGCCTACACGCTGACTCAATAAGCCTACCATTTTGGGGACAAGATCGAGTGGGGAGCTGGGAACATAATCACACAAAATGGAATTCACTCCTTCCCGA
CTCTAGGGTAAATAGAGGTGTGTTCCCTGAAGTGGTGTCTTCGGGTCTTGAACAATGGGCCCTACCCTCTCTATGGCACGAGAAGGACTTCTGTTTGTTGGTTGGACCTC
AAACAGGTCCATCAGGTCCCACTGATAGCTCAAATAAGGGCCCTTGGGGTGGGGTGAAAGACATCTCATTCATTCTCCTACTCTCAGAGAATTCCCTTGAAGGCTCCCAC
CAGTCTCCTACCTCTAGAAGTCTTAGAGTCATACTGTCTGAAGCCCTAGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAGAACTTGTTAGTAGAGACTGCAGTGTATAT
TTTGAATGTAGTTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTCTGGGGAGGGCATAAAGATAGTTTACGTCATTTTAGAATTTGGGGCTGCCCTGCACATGTGC
TAGTGACAAATCCCAAGAAATTGGATTCACGTTCCAAATTATGCCTATTCGTAGGCTACCCAAAAGAAACAAGAGGTGGATACTTCTTCGATCCACAAGAAAATAAAGTG
TTTGTGTCGACAAATGCTACTTTTTTGGAAGTAGACCACGTTAGAGAACATAAGCCACAAAGTAAAATGGTGTTAAGTGAACTTTCTGGTGAAGCTACAGATACATCAAC
AAGAGTTGTTGATAAGGCTGGACCTTCAACAAGAGTTGTTGATGAACCCAATACATCTGAAACTCTAGCTGTCATACCTGATGATGAGGTCGAGGATCCATTGACCTTTA
ACCAGTCAATGAATGATGTGGATAAAGACCAGTGGTTCAAAGCTATGGACTTTGAGATGGAGTCAATGTACTTCAATCAAGTTTTGGATCTTGTAGACCAACCTGAAGGG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGAGTGGCCTACACGCTGACTCAATAAGCCTACCATTTTGGGGACAAGATCGAGTGGGGAGCTGGGAACATAATCACACAAAATGGAATTCACTCCTTCCCGA
CTCTAGGGTAAATAGAGGTGTGTTCCCTGAAGTGGTGTCTTCGGGTCTTGAACAATGGGCCCTACCCTCTCTATGGCACGAGAAGGACTTCTGTTTGTTGGTTGGACCTC
AAACAGGTCCATCAGGTCCCACTGATAGCTCAAATAAGGGCCCTTGGGGTGGGGTGAAAGACATCTCATTCATTCTCCTACTCTCAGAGAATTCCCTTGAAGGCTCCCAC
CAGTCTCCTACCTCTAGAAGTCTTAGAGTCATACTGTCTGAAGCCCTAGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAGAACTTGTTAGTAGAGACTGCAGTGTATAT
TTTGAATGTAGTTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTCTGGGGAGGGCATAAAGATAGTTTACGTCATTTTAGAATTTGGGGCTGCCCTGCACATGTGC
TAGTGACAAATCCCAAGAAATTGGATTCACGTTCCAAATTATGCCTATTCGTAGGCTACCCAAAAGAAACAAGAGGTGGATACTTCTTCGATCCACAAGAAAATAAAGTG
TTTGTGTCGACAAATGCTACTTTTTTGGAAGTAGACCACGTTAGAGAACATAAGCCACAAAGTAAAATGGTGTTAAGTGAACTTTCTGGTGAAGCTACAGATACATCAAC
AAGAGTTGTTGATAAGGCTGGACCTTCAACAAGAGTTGTTGATGAACCCAATACATCTGAAACTCTAGCTGTCATACCTGATGATGAGGTCGAGGATCCATTGACCTTTA
ACCAGTCAATGAATGATGTGGATAAAGACCAGTGGTTCAAAGCTATGGACTTTGAGATGGAGTCAATGTACTTCAATCAAGTTTTGGATCTTGTAGACCAACCTGAAGGG
TAA
Protein sequenceShow/hide protein sequence
MGESGLHADSISLPFWGQDRVGSWEHNHTKWNSLLPDSRVNRGVFPEVVSSGLEQWALPSLWHEKDFCLLVGPQTGPSGPTDSSNKGPWGGVKDISFILLLSENSLEGSH
QSPTSRSLRVILSEALEKFKEYKAEVENLLVETAVYILNVVPSKSVSETPFELWGGHKDSLRHFRIWGCPAHVLVTNPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKV
FVSTNATFLEVDHVREHKPQSKMVLSELSGEATDTSTRVVDKAGPSTRVVDEPNTSETLAVIPDDEVEDPLTFNQSMNDVDKDQWFKAMDFEMESMYFNQVLDLVDQPEG