; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034853 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034853
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:11639370..11643326
RNA-Seq ExpressionLag0034853
SyntenyLag0034853
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV33171.1 Integrase, catalytic core domain containing protein [Dorcoceras hygrometricum]5.9e-4866.03Show/hide
Query:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI
        L ENR +  E+ +  A  D+S    STWYLDNG +NHMTGDK KFVELDTS+KGFVSFGDNTKV+I GK  +L E K+G HKVL DVYY+PKLTSNILSI
Subjt:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI

Query:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        GQLLE  YKI+MED  LW+RD +S L+A+V MTKN MF L+LK+ G  CLKS V D
Subjt:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

KZV34378.1 Integrase, catalytic core domain containing protein [Dorcoceras hygrometricum]2.8e-5069.23Show/hide
Query:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI
        L ENR +  E+ +  A KD+S    STWYLDNGASNHMTGDK KFVELDTS+KGFVSFGDNTKV+I GKG +L E K+G HKVL DV Y+PKLTSNILSI
Subjt:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI

Query:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        GQLLE  YKI+M D  LW+RD +S L+AKV MTKNRMFLL+LK+ G  CLKS V D
Subjt:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

KZV44615.1 hypothetical protein F511_33023 [Dorcoceras hygrometricum]3.7e-5068.59Show/hide
Query:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI
        L ENR +  E+ +  A KD+S    STWYLDNGASNHMTGDK KFVELDTS+KGFVSFGDNTKV+I GKG +L E K+G HKVL DVYY+P LTSNILSI
Subjt:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI

Query:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        GQLLE  YKI++ED  LW+RD  S L+AKV MTKNR FLL+LK+ G  CLKS V D
Subjt:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

XP_021649648.1 uncharacterized protein LOC110642034 [Hevea brasiliensis]2.8e-4260.84Show/hide
Query:  EASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLLEGGYKIHMED
        +A KD   D+ S+WYLDNGASNHM G K KFVELD   K  VSF D++KV+I G+G +L+ +KDGGH ++ +VYY+PKL SNILS+GQLLE GY+I ++D
Subjt:  EASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLLEGGYKIHMED

Query:  CMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDDE
        C LWLRDQ + ++AKV M+KNRMF+LNLK   A+CLK+ V+DE
Subjt:  CMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDDE

XP_021649651.1 uncharacterized protein LOC110642036 [Hevea brasiliensis]2.8e-4260.84Show/hide
Query:  EASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLLEGGYKIHMED
        +A KD   D+ S+WYLDNGASNHM G K KFVELD   K  VSF D++KV+I G+G +L+ +KDGGH ++ +VYY+PKL SNILS+GQLLE GY+I ++D
Subjt:  EASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLLEGGYKIHMED

Query:  CMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDDE
        C LWLRDQ + ++AKV M+KNRMF+LNLK   A+CLK+ V+DE
Subjt:  CMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDDE

TrEMBL top hitse value%identityAlignment
A0A2I0WQY0 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-3951.59Show/hide
Query:  VKWLGENRGWEKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILS
        V ++ E R  + I+  A K++   E  TWYLD GASNHM G +S FVELD +  G VSFGD++K+E+ GKG +L+  K+G H+ + +VY++P + SNILS
Subjt:  VKWLGENRGWEKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILS

Query:  IGQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        +GQLLE GY IH+++  L+L+D    L+AKVPM++NRMFLLN++N  AKCLK+C  D
Subjt:  IGQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

A0A2Z7BMK3 Integrase, catalytic core domain containing protein2.9e-4866.03Show/hide
Query:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI
        L ENR +  E+ +  A  D+S    STWYLDNG +NHMTGDK KFVELDTS+KGFVSFGDNTKV+I GK  +L E K+G HKVL DVYY+PKLTSNILSI
Subjt:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI

Query:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        GQLLE  YKI+MED  LW+RD +S L+A+V MTKN MF L+LK+ G  CLKS V D
Subjt:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

A0A2Z7BQJ1 Integrase, catalytic core domain containing protein1.4e-5069.23Show/hide
Query:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI
        L ENR +  E+ +  A KD+S    STWYLDNGASNHMTGDK KFVELDTS+KGFVSFGDNTKV+I GKG +L E K+G HKVL DV Y+PKLTSNILSI
Subjt:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI

Query:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        GQLLE  YKI+M D  LW+RD +S L+AKV MTKNRMFLL+LK+ G  CLKS V D
Subjt:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

A0A2Z7CD47 CCHC-type domain-containing protein1.8e-5068.59Show/hide
Query:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI
        L ENR +  E+ +  A KD+S    STWYLDNGASNHMTGDK KFVELDTS+KGFVSFGDNTKV+I GKG +L E K+G HKVL DVYY+P LTSNILSI
Subjt:  LGENRGW--EKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSI

Query:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        GQLLE  YKI++ED  LW+RD  S L+AKV MTKNR FLL+LK+ G  CLKS V D
Subjt:  GQLLEGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

A0A445J0L7 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-3851.97Show/hide
Query:  ENRGWEKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLL
        E  G E+ +  A ++   ++ + WYLD GASNHM GDKS FVE++    G VSFGD++K+ + GKGK+L+  K+G H+ + +VYY+P + +NILS+GQLL
Subjt:  ENRGWEKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLL

Query:  EGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD
        E GY IH+++  L+LRD    L+AKVPM+KNRMFLLN++N  AKCLK+C  D
Subjt:  EGGYKIHMEDCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGTTATCAAGCGACAACAACTCATCTTTCTTCTTCTCTCACTCACGAACCAACAATCCGAGGGCTATCATTGATGGCCAGTTATCAAGTGAACGAAATACACTT
ATTTTTTTTAGTTGGAGAAAATCGTTGTCGTTGTGCCTTGTTCGCCGTCGTCGTCACCTTCGTCGTGCCTGTCTCTTTCGCCGTCGCCTTCGTCGTGCCTGTCGCTTTCA
CCGTCGCCTTCGTCGTGCCTGTCGCTTTTGTCGTCGCCTTCGTCTTCGTCGTGCCTATCGTTGTCGTCGTCGCCTCCGTCGTGCCTGTCGCCGTCGCCGTGAAGTGGTTG
GGAGAAAATCGTGGTTGGGAGAAAATCGTGGAGGAGGCTTCAAAGGATGATTCCATTGATGAGACAAGCACCTGGTATCTTGACAATGGTGCTAGCAATCATATGACAGG
TGATAAAAGCAAATTTGTGGAGCTTGATACAAGCAAGAAAGGCTTTGTAAGCTTTGGTGACAACACGAAGGTGGAGATCATGGGCAAAGGTAAAGTTTTGGTTGAGACAA
AAGATGGAGGCCATAAAGTTCTTTGTGATGTTTATTACATTCCAAAGTTGACTAGTAATATTCTAAGTATTGGTCAACTTTTGGAAGGAGGCTACAAGATTCACATGGAG
GATTGTATGCTTTGGCTTAGAGACCAAGAGTCCAAACTTGTAGCCAAAGTGCCAATGACCAAAAATCGGATGTTCTTGTTGAACTTGAAGAATGGTGGTGCCAAGTGCTT
GAAAAGTTGTGTTGATGATGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCAACGATTGGACCTTACGCCGTATATTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGTTATCAAGCGACAACAACTCATCTTTCTTCTTCTCTCACTCACGAACCAACAATCCGAGGGCTATCATTGATGGCCAGTTATCAAGTGAACGAAATACACTT
ATTTTTTTTAGTTGGAGAAAATCGTTGTCGTTGTGCCTTGTTCGCCGTCGTCGTCACCTTCGTCGTGCCTGTCTCTTTCGCCGTCGCCTTCGTCGTGCCTGTCGCTTTCA
CCGTCGCCTTCGTCGTGCCTGTCGCTTTTGTCGTCGCCTTCGTCTTCGTCGTGCCTATCGTTGTCGTCGTCGCCTCCGTCGTGCCTGTCGCCGTCGCCGTGAAGTGGTTG
GGAGAAAATCGTGGTTGGGAGAAAATCGTGGAGGAGGCTTCAAAGGATGATTCCATTGATGAGACAAGCACCTGGTATCTTGACAATGGTGCTAGCAATCATATGACAGG
TGATAAAAGCAAATTTGTGGAGCTTGATACAAGCAAGAAAGGCTTTGTAAGCTTTGGTGACAACACGAAGGTGGAGATCATGGGCAAAGGTAAAGTTTTGGTTGAGACAA
AAGATGGAGGCCATAAAGTTCTTTGTGATGTTTATTACATTCCAAAGTTGACTAGTAATATTCTAAGTATTGGTCAACTTTTGGAAGGAGGCTACAAGATTCACATGGAG
GATTGTATGCTTTGGCTTAGAGACCAAGAGTCCAAACTTGTAGCCAAAGTGCCAATGACCAAAAATCGGATGTTCTTGTTGAACTTGAAGAATGGTGGTGCCAAGTGCTT
GAAAAGTTGTGTTGATGATGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCAACGATTGGACCTTACGCCGTATATTAATTAA
Protein sequenceShow/hide protein sequence
MASYQATTTHLSSSLTHEPTIRGLSLMASYQVNEIHLFFLVGENRCRCALFAVVVTFVVPVSFAVAFVVPVAFTVAFVVPVAFVVAFVFVVPIVVVVASVVPVAVAVKWL
GENRGWEKIVEEASKDDSIDETSTWYLDNGASNHMTGDKSKFVELDTSKKGFVSFGDNTKVEIMGKGKVLVETKDGGHKVLCDVYYIPKLTSNILSIGQLLEGGYKIHME
DCMLWLRDQESKLVAKVPMTKNRMFLLNLKNGGAKCLKSCVDDEGNKSPHAAEAQRLDLTPYIN