; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025554 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025554
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:15118488..15124811
RNA-Seq ExpressionLag0025554
SyntenyLag0025554
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAD17414.1 copia-like retroelement pol polyprotein [Arabidopsis thaliana]2.4e-1361.9Show/hide
Query:  NDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKC
        +DSDYA DLD+RRS++ Y+F + GN ISW+S+LQSVVALS+T+AE++AL E+VKE +W K  C
Subjt:  NDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKC

BAB09923.1 copia-like retrotransposable element [Arabidopsis thaliana]2.4e-1336.05Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR--------------KCFSCSPIASQATTLANRQRSQVPPE
        DSDYA DLD+RRS++ ++FT  GN ISW+S LQ VVALS+T+AE++ALAE+VKE +W +                C S S IA    ++ + +   +   
Subjt:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR--------------KCFSCSPIASQATTLANRQRSQVPPE

Query:  ----KERYVAWSVKVSKLETSWPPLAKLHIFQHNTPECKDKNTVEQL
            +E+     ++V K+ T+W P     IF    P  K +  ++ L
Subjt:  ----KERYVAWSVKVSKLETSWPPLAKLHIFQHNTPECKDKNTVEQL

CAA20201.1 putative transposable element [Arabidopsis thaliana]1.6e-1455Show/hide
Query:  LGVVGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFK
        +G+V   + + G    V    DSD+AADLDKRRS+S Y+FT+ GN +SW+S+LQ VVALSST+AEF+AL E+VKE +W +
Subjt:  LGVVGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFK

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]2.4e-1366.67Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR
        D+DYAADLDKRRSLS +IF L+GNV+SW+  LQ VVALS+T++E+++L E+VKE VW KR
Subjt:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR

XP_009787759.1 PREDICTED: uncharacterized protein LOC104235647 [Nicotiana sylvestris]1.8e-1346.39Show/hide
Query:  VGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKCFSCSPIASQATTLANRQ
        VG   +K G    +    DSDYA DLD+RRS + YIFTL G+ +SW+STLQS+VALS+T+AE++A  E+VKE +W K      S +  ++T   + Q
Subjt:  VGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKCFSCSPIASQATTLANRQ

TrEMBL top hitse value%identityAlignment
A0A1J3FQK8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)8.7e-1462.9Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKC
        DSDYA DLDKRRS++ +IF ++GN +SWRS LQSVVALS+T+AE++AL+ +VKE +W K  C
Subjt:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKC

A0A1J3I7B7 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-1462.9Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKC
        DSDYA DLDKRRS++ +IF ++GN +SWRS LQSVVALS+T+AE++AL+ +VKE +W K  C
Subjt:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKC

A0A1U7XLC9 uncharacterized protein LOC1042356478.7e-1446.39Show/hide
Query:  VGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKCFSCSPIASQATTLANRQ
        VG   +K G    +    DSDYA DLD+RRS + YIFTL G+ +SW+STLQS+VALS+T+AE++A  E+VKE +W K      S +  ++T   + Q
Subjt:  VGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKCFSCSPIASQATTLANRQ

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.1e-1366.67Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR
        D+DYAADLDKRRSLS +IF L+GNV+SW+  LQ VVALS+T++E+++L E+VKE VW KR
Subjt:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR

O81903 Putative transposable element7.9e-1555Show/hide
Query:  LGVVGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFK
        +G+V   + + G    V    DSD+AADLDKRRS+S Y+FT+ GN +SW+S+LQ VVALSST+AEF+AL E+VKE +W +
Subjt:  LGVVGALLKKEGGKSPVAN--DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-0645Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFG-NVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFK
        DSD+A     R+S + Y+F +F  N+I W +  Q+ VA SST+AE++AL E+V+E +W K
Subjt:  DSDYAADLDKRRSLSDYIFTLFG-NVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFK

P0CV72 Secreted RxLR effector protein 1615.7e-1051.72Show/hide
Query:  NDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVW
        +D+D+A D++ RRS S Y+F L G  +SWRS  Q  VALSST+ E++AL+E+ +E VW
Subjt:  NDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-1147.89Show/hide
Query:  GGKSPVA---NDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR
        GG  P+     D+D A D+D R+S + Y+FT  G  ISW+S LQ  VALS+T+AE++A  E+ KE++W KR
Subjt:  GGKSPVA---NDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKR

Arabidopsis top hitse value%identityAlignment
ATMG00810.1 DNA/RNA polymerases superfamily protein3.6e-0440.35Show/hide
Query:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVW
        DSD+A     RRS + +   L  N+ISW +  Q  V+ SST+ E+ ALA +  E+ W
Subjt:  DSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAGTTTCGAGGTTTAGGAGTGTTTGATCTGTTGGGAGTGGTTGGTGCTTTGTTGAAGAAGGAAGGTGGCAAGTCGCCAGTGGCCAACGACTCAGATTATGCAGC
TGACCTTGATAAAAGGAGGTCTTTGTCTGATTATATTTTTACCCTATTTGGAAATGTGATAAGCTGGAGATCCACCTTACAATCTGTTGTGGCACTATCTTCAACCAAGG
CCGAATTTTTAGCCCTTGCAGAATCGGTGAAGGAGGTTGTTTGGTTTAAAAGGAAATGTTTCAGCTGTAGTCCCATAGCCAGCCAAGCCACGACCCTCGCCAATCGCCAA
CGCTCTCAAGTACCACCTGAAAAGGAAAGATATGTAGCATGGTCGGTTAAGGTAAGTAAACTAGAGACTAGCTGGCCACCTCTAGCAAAACTACACATATTCCAACACAA
CACCCCTGAATGTAAAGATAAAAATACGGTGGAGCAACTGACGCATGTAAACATGCATGGTGCAAACTGGTGCACGAACAACGTGTCTGATCATAAGCTCAACATTGTCC
TGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAGTTTCGAGGTTTAGGAGTGTTTGATCTGTTGGGAGTGGTTGGTGCTTTGTTGAAGAAGGAAGGTGGCAAGTCGCCAGTGGCCAACGACTCAGATTATGCAGC
TGACCTTGATAAAAGGAGGTCTTTGTCTGATTATATTTTTACCCTATTTGGAAATGTGATAAGCTGGAGATCCACCTTACAATCTGTTGTGGCACTATCTTCAACCAAGG
CCGAATTTTTAGCCCTTGCAGAATCGGTGAAGGAGGTTGTTTGGTTTAAAAGGAAATGTTTCAGCTGTAGTCCCATAGCCAGCCAAGCCACGACCCTCGCCAATCGCCAA
CGCTCTCAAGTACCACCTGAAAAGGAAAGATATGTAGCATGGTCGGTTAAGGTAAGTAAACTAGAGACTAGCTGGCCACCTCTAGCAAAACTACACATATTCCAACACAA
CACCCCTGAATGTAAAGATAAAAATACGGTGGAGCAACTGACGCATGTAAACATGCATGGTGCAAACTGGTGCACGAACAACGTGTCTGATCATAAGCTCAACATTGTCC
TGAACTGA
Protein sequenceShow/hide protein sequence
MVEFRGLGVFDLLGVVGALLKKEGGKSPVANDSDYAADLDKRRSLSDYIFTLFGNVISWRSTLQSVVALSSTKAEFLALAESVKEVVWFKRKCFSCSPIASQATTLANRQ
RSQVPPEKERYVAWSVKVSKLETSWPPLAKLHIFQHNTPECKDKNTVEQLTHVNMHGANWCTNNVSDHKLNIVLN