; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008047 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008047
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:10636053..10637495
RNA-Seq ExpressionLag0008047
SyntenyLag0008047
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]3.0e-3953.8Show/hide
Query:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRV
        GLTLLAQ+ MPL YWWE F+  V+LINRLP+PV    SP+       PD+   + FGCAC+PCL+PYN HK QF +TKCVFLGYS SHKGYKC++S+GRV
Subjt:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRV

Query:  FISCHVVFNEFDFPFKSDFLQHSGSSLSDNVVLHWLPFHKSSNLPYPQSLPLSARVQA
        FIS HVVFNE  FPF   F       L+  V L  L    SS+ P   + P S+  ++
Subjt:  FISCHVVFNEFDFPFKSDFLQHSGSSLSDNVVLHWLPFHKSSNLPYPQSLPLSARVQA

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]9.3e-4165.83Show/hide
Query:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRV
        GLTLLAQ+ MPL YWWE F+  VFLINRLPT V+   SP+++ F   PD+   + FGCAC+PCL+PYN HK QF +TKCVFLGYS SHKGYKCL+S GR+
Subjt:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRV

Query:  FISCHVVFNEFDFPFKSDFL
        FIS HVVFNE  FPF   FL
Subjt:  FISCHVVFNEFDFPFKSDFL

KYP75364.1 Copia protein [Cajanus cajan]3.0e-3958.27Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        +GLTLLAQ+ MPL + WE F+  VFLINRLPTP++   SP+       PD+   + FGCAC+PC++PYNAHK Q+ +TKCVFLGYS SHKG+KC++SNGR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQHSGSS
        +FIS HV+FNE +FPF   FL    S+
Subjt:  VFISCHVVFNEFDFPFKSDFLQHSGSS

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]1.9e-3848.7Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        +GLT+LAQ+ MPLCYWWE F+ +V+LINRLP+ +     P+   +   PD+   + FGCAC+PCL+PYN HK QF +T+CVFLGYS SHKGYKC++S+GR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQHSGS----SLSDNVVLHWLPFHKSSNLPYPQS
        +F+S HVVFNE  FPF   FL         + +D ++    P   ++N+  P++
Subjt:  VFISCHVVFNEFDFPFKSDFLQHSGS----SLSDNVVLHWLPFHKSSNLPYPQS

TXG59466.1 hypothetical protein EZV62_014039 [Acer yangbiense]7.9e-4060.47Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        MGLTLLAQ+ +PL +WWE F   V+ INRLPTP+LG +SP++  F   PD++F +VFGCACFP LRPYN HKF F S+KC+ LGYS +HKGYKCL  +GR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQHSGSSLS
        V+IS +V+FNE DFP+ S F   + SS+S
Subjt:  VFISCHVVFNEFDFPFKSDFLQHSGSSLS

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-4165.83Show/hide
Query:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRV
        GLTLLAQ+ MPL YWWE F+  VFLINRLPT V+   SP+++ F   PD+   + FGCAC+PCL+PYN HK QF +TKCVFLGYS SHKGYKCL+S GR+
Subjt:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRV

Query:  FISCHVVFNEFDFPFKSDFL
        FIS HVVFNE  FPF   FL
Subjt:  FISCHVVFNEFDFPFKSDFL

A0A151U7U2 Copia protein1.5e-3958.27Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        +GLTLLAQ+ MPL + WE F+  VFLINRLPTP++   SP+       PD+   + FGCAC+PC++PYNAHK Q+ +TKCVFLGYS SHKG+KC++SNGR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQHSGSS
        +FIS HV+FNE +FPF   FL    S+
Subjt:  VFISCHVVFNEFDFPFKSDFLQHSGSS

A0A5C7HT39 Integrase catalytic domain-containing protein3.8e-4060.47Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        MGLTLLAQ+ +PL +WWE F   V+ INRLPTP+LG +SP++  F   PD++F +VFGCACFP LRPYN HKF F S+KC+ LGYS +HKGYKCL  +GR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQHSGSSLS
        V+IS +V+FNE DFP+ S F   + SS+S
Subjt:  VFISCHVVFNEFDFPFKSDFLQHSGSSLS

A0A803Q6A2 Uncharacterized protein1.8e-4262.04Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        MGLTLLAQ+S+P  YWW+ F  +V+LINRLPT VLG  +P+E  F   PD++F +VFG ACFPCLRPY +HKFQF STKCV LGYS SH+GYKCLSS GR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQ-HSGSSLSDNVVLHW
        ++IS HVVFNE +FPF+  FL  H   +L    V +W
Subjt:  VFISCHVVFNEFDFPFKSDFLQ-HSGSSLSDNVVLHW

A0A803QCY3 Uncharacterized protein1.1e-3959.7Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR
        MGLTLLAQS MPL YWW+ F+  V+LINRLPTP+L   +P+E     +PD++F + FG ACFPCLRPY AHKFQF S KCV LGYS +HKGYKCLS  GR
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGR

Query:  VFISCHVVFNEFDFPFKSDFLQHSGSSLSDNVVL
        ++I   VVFNE +FPF+  FL    +  S+N+V+
Subjt:  VFISCHVVFNEFDFPFKSDFLQHSGSSLSDNVVL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-2952.54Show/hide
Query:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLS-SNGR
        GLTLL+ +S+P  YW   F + V+LINRLPTP+L   SP++K FG  P++   RVFGCAC+P LRPYN HK   +S +CVFLGYS +   Y CL     R
Subjt:  GLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLS-SNGR

Query:  VFISCHVVFNEFDFPFKS
        ++IS HV F+E  FPF +
Subjt:  VFISCHVVFNEFDFPFKS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-2950.42Show/hide
Query:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLS-SNG
        MGLTLL+ +S+P  YW   F++ V+LINRLPTP+L   SP++K FG  P+++  +VFGCAC+P LRPYN HK + +S +C F+GYS +   Y CL    G
Subjt:  MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLS-SNG

Query:  RVFISCHVVFNEFDFPFKS
        R++ S HV F+E  FPF +
Subjt:  RVFISCHVVFNEFDFPFKS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTTACGCTTCTAGCTCAATCTTCTATGCCTCTGTGTTATTGGTGGGAAGTTTTTACCTTAACTGTCTTTCTTATTAATCGTCTCCCTACACCGGTCCTTGGCAA
ACTTTCCCCTTGGGAAAAAGCTTTTGGTCATGTTCCTGACTTTCAGTTTTTTCGCGTCTTTGGCTGTGCATGTTTCCCTTGTTTACGTCCCTATAATGCTCACAAATTTC
AGTTTAGAAGCACAAAGTGTGTCTTCCTTGGTTACAGTTTCTCACATAAAGGGTACAAATGTCTGAGTTCCAATGGTCGTGTCTTTATTTCCTGTCATGTGGTATTTAAT
GAATTTGATTTTCCTTTCAAGTCTGATTTCCTTCAGCATTCTGGTTCCTCTCTCTCTGATAATGTTGTTCTCCATTGGCTCCCTTTTCATAAGTCTTCCAATTTGCCTTA
TCCTCAGTCTTTACCTCTTTCGGCGAGAGTGCAAGCCTACTTGCGTGAGCTTCAGACTTGGTCTCTTCATCGGCATGGGGGATTGCTCCAGAGGGCCATCAAGTGTAAAA
AGGAGGAAATTTGTATTTTGGACTCTAGCAAGCTAGAAAACTTGGAGGCGAGTTGGGTTGCAGTTGAGAAGGATTTGTCTTTAATGGTTGAAAATAAGGCATACTGGCAA
CAAAGGTCGAGGGAGGAATGGTTGGTGTGGGGGGATAGGAATTTAAAGTGGTTTCATGCCCAAGCATCTCAGTGGCCAATGGCAGTTAGATCCGAGTATGATCGAGCATT
ACTCAAGGAGTTCACTCGAGCTAAGGTAGAGCCAGCTTTGAAAGGCATGGGCCCCACGAAGGCCCCAATGCCTGATGGGGTTAATGCTCTGTTTTATCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTTACGCTTCTAGCTCAATCTTCTATGCCTCTGTGTTATTGGTGGGAAGTTTTTACCTTAACTGTCTTTCTTATTAATCGTCTCCCTACACCGGTCCTTGGCAA
ACTTTCCCCTTGGGAAAAAGCTTTTGGTCATGTTCCTGACTTTCAGTTTTTTCGCGTCTTTGGCTGTGCATGTTTCCCTTGTTTACGTCCCTATAATGCTCACAAATTTC
AGTTTAGAAGCACAAAGTGTGTCTTCCTTGGTTACAGTTTCTCACATAAAGGGTACAAATGTCTGAGTTCCAATGGTCGTGTCTTTATTTCCTGTCATGTGGTATTTAAT
GAATTTGATTTTCCTTTCAAGTCTGATTTCCTTCAGCATTCTGGTTCCTCTCTCTCTGATAATGTTGTTCTCCATTGGCTCCCTTTTCATAAGTCTTCCAATTTGCCTTA
TCCTCAGTCTTTACCTCTTTCGGCGAGAGTGCAAGCCTACTTGCGTGAGCTTCAGACTTGGTCTCTTCATCGGCATGGGGGATTGCTCCAGAGGGCCATCAAGTGTAAAA
AGGAGGAAATTTGTATTTTGGACTCTAGCAAGCTAGAAAACTTGGAGGCGAGTTGGGTTGCAGTTGAGAAGGATTTGTCTTTAATGGTTGAAAATAAGGCATACTGGCAA
CAAAGGTCGAGGGAGGAATGGTTGGTGTGGGGGGATAGGAATTTAAAGTGGTTTCATGCCCAAGCATCTCAGTGGCCAATGGCAGTTAGATCCGAGTATGATCGAGCATT
ACTCAAGGAGTTCACTCGAGCTAAGGTAGAGCCAGCTTTGAAAGGCATGGGCCCCACGAAGGCCCCAATGCCTGATGGGGTTAATGCTCTGTTTTATCGATGA
Protein sequenceShow/hide protein sequence
MGLTLLAQSSMPLCYWWEVFTLTVFLINRLPTPVLGKLSPWEKAFGHVPDFQFFRVFGCACFPCLRPYNAHKFQFRSTKCVFLGYSFSHKGYKCLSSNGRVFISCHVVFN
EFDFPFKSDFLQHSGSSLSDNVVLHWLPFHKSSNLPYPQSLPLSARVQAYLRELQTWSLHRHGGLLQRAIKCKKEEICILDSSKLENLEASWVAVEKDLSLMVENKAYWQ
QRSREEWLVWGDRNLKWFHAQASQWPMAVRSEYDRALLKEFTRAKVEPALKGMGPTKAPMPDGVNALFYR