; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024157 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024157
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr10:864847..867146
RNA-Seq ExpressionLag0024157
SyntenyLag0024157
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7049796.1 unnamed protein product [Microthlaspi erraticum]6.8e-2870.59Show/hide
Query:  SKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        S + K EIEKL++ ++ +RI++PS SP+SSP+LLVKKKDGGWRFCVDYRALN+VT+PD++PIP+IEELLDEL+G++V+SKLDLKS
Subjt:  SKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

KAG7637566.1 Cyclin-like [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-2872.94Show/hide
Query:  SKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        S L K EIE+L++ +L +R++QPS SP+SSPVLLVKKKDGGWRFCVDYRALN+VT+PD++P+P+IEELLDEL G+RV+SKLDLKS
Subjt:  SKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

XP_022848903.1 uncharacterized protein LOC111371244 [Olea europaea var. sylvestris]1.8e-2876.54Show/hide
Query:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        K EIE+L+K +L + I+QPSSSPFSSPVLLVKKKDG WRFCVDYRALN+VT+PD+FPIP+I+ELLDELHG+ ++SKLDLKS
Subjt:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

XP_023633382.1 uncharacterized protein LOC111829008 [Capsella rubella]8.8e-2875.31Show/hide
Query:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        K EIEKL++ +L +RI++PS SPFSSPVLLVKKKDGGWRFCVDYRALN+VT+ D++PIP+IEELLDELHG+ ++SKLDLKS
Subjt:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

XP_024448007.1 uncharacterized protein LOC112325543 [Populus trichocarpa]3.0e-2853.72Show/hide
Query:  SNDYVEPT--PPMLTEAFKWIVEPKAINRTNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPM
        S+ +  PT  PP  T   +  ++P+ +  +    +     KTEIEK+++ +L +R+++PS+SPFSSPVLLVKK DG WRFCVDYRALN +T+ DK+PIP+
Subjt:  SNDYVEPT--PPMLTEAFKWIVEPKAINRTNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPM

Query:  IEELLDELHGSRVYSKLDLKS
        I+ELLDELHGS++YSKLDL+S
Subjt:  IEELLDELHGSRVYSKLDLKS

TrEMBL top hitse value%identityAlignment
A0A5A7UNX8 Transposon Ty3-I Gag-Pol polyprotein5.6e-2874.07Show/hide
Query:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        K EIEKL+  +L   I++PS SPFSSPV+LVKKKDGGWRFCVDYRALN+ T+PDKFPIPMI+ELLDELHG+ ++SK+DLKS
Subjt:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

A0A5A7UU61 Ty3-gypsy retrotransposon protein5.6e-2872.84Show/hide
Query:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        K EIEKL+  +L + +++PS SPFSSPV+LVKKKDGGWRFCVDYRALN+ T+PDKFPIPMI+ELLDELHG+ ++SK+DLKS
Subjt:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

A0A5D3E123 Transposon Ty3-I Gag-Pol polyprotein1.2e-2774.07Show/hide
Query:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        K EIEKL+  +L +R+++PS SP+SSPVLLVKKKDGGWRFCVDYR LNQ T+ DKFPIP+IEELLDELHG+ ++SKLDLKS
Subjt:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

A0A6D2KQX8 Uncharacterized protein3.3e-2870.59Show/hide
Query:  SKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        S + K EIEKL++ ++ +RI++PS SP+SSP+LLVKKKDGGWRFCVDYRALN+VT+PD++PIP+IEELLDEL+G++V+SKLDLKS
Subjt:  SKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

A0A6P4DBC2 uncharacterized protein LOC1074880997.3e-2875.31Show/hide
Query:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        K E+EK+I  +L  RI++PS+SPFSSPV+LVKKKDGGWRFCVDYRALN++T+PDKFPIP+IEELLDEL G+ V+SKLDLKS
Subjt:  KTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein2.1e-1145.07Show/hide
Query:  LLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKSA
        L S I++ S +  + PV+ V KK+G  R  VDY+ LN+   P+ +P+P+IE+LL ++ GS +++KLDLKSA
Subjt:  LLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKSA

P10394 Retrovirus-related Pol polyprotein from transposon 4124.1e-1236.19Show/hide
Query:  EPKAINRTNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDG------GWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSK
        EP       +   + + ++ +++KLIK     +I++PS S ++SP+LLV KK         WR  +DYR +N+  L DKFP+P I+++LD+L  ++ +S 
Subjt:  EPKAINRTNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDG------GWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSK

Query:  LDLKS
        LDL S
Subjt:  LDLKS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.6e-1648.1Show/hide
Query:  EIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        EI K+++ +L ++ + PS SP SSPV+LV KKDG +R CVDYR LN+ T+ D FP+P I+ LL  +  +++++ LDL S
Subjt:  EIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-1648.1Show/hide
Query:  EIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS
        EI K+++ +L ++ + PS SP SSPV+LV KKDG +R CVDYR LN+ T+ D FP+P I+ LL  +  +++++ LDL S
Subjt:  EIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKS

Q9UR07 Transposon Tf2-11 polyprotein2.1e-1145.07Show/hide
Query:  LLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKSA
        L S I++ S +  + PV+ V KK+G  R  VDY+ LN+   P+ +P+P+IE+LL ++ GS +++KLDLKSA
Subjt:  LLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHGSRVYSKLDLKSA

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein1.6e-0651.02Show/hide
Query:  TNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGW
        T+  T    L +T ++  +  +L +RI+QPS SP+SSPVLLV+KKDGGW
Subjt:  TNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGTAATGACTATGTGGAACCGACGCCACCAATGTTAACAGAGGCATTCAAATGGATCGTGGAACCTAAGGCCATCAATCGGACAAATGCTACAACCAAGGAATC
GAAACTATTGAAGACAGAAATTGAAAAATTGATCAAGGCGGTGTTGTTATCTAGAATTCTTCAACCAAGCTCTAGCCCTTTCTCGAGTCCAGTGTTGTTAGTCAAGAAGA
AGGATGGAGGGTGGAGGTTCTGTGTAGACTACAGGGCCCTGAACCAAGTGACACTTCCGGACAAATTTCCCATTCCGATGATTGAGGAGTTGCTTGACGAACTACACGGA
TCAAGGGTGTACTCGAAGCTGGATTTGAAGTCAGCTACCATCAGATTCGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGTAATGACTATGTGGAACCGACGCCACCAATGTTAACAGAGGCATTCAAATGGATCGTGGAACCTAAGGCCATCAATCGGACAAATGCTACAACCAAGGAATC
GAAACTATTGAAGACAGAAATTGAAAAATTGATCAAGGCGGTGTTGTTATCTAGAATTCTTCAACCAAGCTCTAGCCCTTTCTCGAGTCCAGTGTTGTTAGTCAAGAAGA
AGGATGGAGGGTGGAGGTTCTGTGTAGACTACAGGGCCCTGAACCAAGTGACACTTCCGGACAAATTTCCCATTCCGATGATTGAGGAGTTGCTTGACGAACTACACGGA
TCAAGGGTGTACTCGAAGCTGGATTTGAAGTCAGCTACCATCAGATTCGGATGA
Protein sequenceShow/hide protein sequence
MGSNDYVEPTPPMLTEAFKWIVEPKAINRTNATTKESKLLKTEIEKLIKAVLLSRILQPSSSPFSSPVLLVKKKDGGWRFCVDYRALNQVTLPDKFPIPMIEELLDELHG
SRVYSKLDLKSATIRFG