; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005375 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005375
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr6:15474461..15474820
RNA-Seq ExpressionLag0005375
SyntenyLag0005375
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158198.1 uncharacterized protein LOC111024735 [Momordica charantia]2.1e-3990.43Show/hide
Query:  LQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSKSVEDLF
        LQYESKIE VF+CNNFSEE+KLKLAVAEFCDYA NIWWTSL+SE RRNYE+PIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSKSVED +
Subjt:  LQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSKSVEDLF

XP_022972407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111470974 [Cucurbita maxima]7.5e-3787.91Show/hide
Query:  EEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSK
        EE+LQYESKIE VF+CNNFSEE+KLKLAVAEFCDYA  IWWTSL+SEWRRNYE+PIETWEELK LMRKRYIPKHYSR LKQKLY LQQGSK
Subjt:  EEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSK

XP_023521225.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111784949, partial [Cucurbita pepo subsp. pepo]4.7e-4780.51Show/hide
Query:  EERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSREL
        +E+G  GVKLKIP ++G SDPEE+LQYESKIE VF+CNNFS+E+KLKLAVAEFCDYA  IWWTSL+SEWRRNYE+PIETWEELK LMRKRYIPKHYSR L
Subjt:  EERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSREL

Query:  KQKLYALQQGSKSVEDLF
        KQKLY LQQGSKSVE+ +
Subjt:  KQKLYALQQGSKSVEDLF

XP_023534142.1 uncharacterized protein LOC111795765 [Cucurbita pepo subsp. pepo]8.0e-4781.2Show/hide
Query:  ERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELK
        E+G  GVKLKIP  +GRSDPEE+LQYESKIE VF+CNNFSEE+KLKLAVAEFCDYA  IWWT L+SEWRRNYE+PIETWEEL+ LMRKRYIPKHYSR LK
Subjt:  ERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELK

Query:  QKLYALQQGSKSVEDLF
        QKLY LQQGSKSVE+ +
Subjt:  QKLYALQQGSKSVEDLF

XP_038888883.1 uncharacterized protein LOC120078660 [Benincasa hispida]1.6e-3970.34Show/hide
Query:  EERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSREL
        EE G  GVKL IP  HG  DP+ +LQYESKI+ VF C+NFSEE+K+KL VA+F DY  ++WWTSL+ EWRRNYE+ IETWEELKALMRK YIPKHY REL
Subjt:  EERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSREL

Query:  KQKLYALQQGSKSVEDLF
        KQKLY+LQQGSK V+D +
Subjt:  KQKLYALQQGSKSVEDLF

TrEMBL top hitse value%identityAlignment
A0A1U8N4L1 uncharacterized protein LOC1079435373.0e-3159.29Show/hide
Query:  RGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLY
        +G+KL IPP HG+SDPE +L++E KIE++F C+N+SE KK+KLAV EF DYA  +WW  L +  RRN E PI +W E+KA+MRKR+IP +Y REL QKL 
Subjt:  RGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLY

Query:  ALQQGSKSVEDLF
         L QG++SVED F
Subjt:  ALQQGSKSVEDLF

A0A2N9IRC6 Reverse transcriptase domain-containing protein3.0e-3160Show/hide
Query:  ERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELK
        +R    +K+KIP   GR+DPE +L++E KIELVF+C+N+SEEKK+KL V EF DY   IWW  L +  RRNYE P+ETW ELKALMR+R++P HY R+L 
Subjt:  ERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELK

Query:  QKLYALQQGSKSVED
        QKL  L QGS+SVED
Subjt:  QKLYALQQGSKSVED

A0A5A7UZP1 Transposon Ty3-I Gag-Pol polyprotein5.1e-3160.17Show/hide
Query:  EERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSREL
        +E   RGVKLKIPP  G +D E +LQ++ KIE VF+CN FSE KK+KLA+AEF +YA+  W+  L+SE RR  EDPIETWEELK  MRKR++PKHY R+L
Subjt:  EERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSREL

Query:  KQKLYALQQGSKSVEDLF
        K KL +L+QG+KSV + +
Subjt:  KQKLYALQQGSKSVEDLF

A0A6J1DVF2 uncharacterized protein LOC1110247351.0e-3990.43Show/hide
Query:  LQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSKSVEDLF
        LQYESKIE VF+CNNFSEE+KLKLAVAEFCDYA NIWWTSL+SE RRNYE+PIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSKSVED +
Subjt:  LQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSKSVEDLF

A0A6J1I4Q5 LOW QUALITY PROTEIN: uncharacterized protein LOC1114709743.6e-3787.91Show/hide
Query:  EEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSK
        EE+LQYESKIE VF+CNNFSEE+KLKLAVAEFCDYA  IWWTSL+SEWRRNYE+PIETWEELK LMRKRYIPKHYSR LKQKLY LQQGSK
Subjt:  EEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQGSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAGAGGAGCTAGGGGAGTGAAGCTTAAAATCCCACCAATTCATGGGAGGTCTGATCCGGAGGAGCATTTGCAATATGAGAGCAAAATTGAGCTTGTATTTAA
TTGCAACAATTTTAGTGAAGAAAAAAAGTTGAAACTTGCTGTGGCTGAATTTTGTGATTATGCCAACAACATTTGGTGGACATCTTTGGAATCAGAATGGAGGAGAAACT
ATGAGGATCCAATTGAAACATGGGAGGAATTAAAGGCCTTAATGAGGAAGAGGTACATTCCTAAGCATTATTCCCGAGAACTCAAGCAAAAGCTCTATGCTTTGCAACAA
GGATCCAAAAGTGTGGAAGATTTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGAGAGGAGCTAGGGGAGTGAAGCTTAAAATCCCACCAATTCATGGGAGGTCTGATCCGGAGGAGCATTTGCAATATGAGAGCAAAATTGAGCTTGTATTTAA
TTGCAACAATTTTAGTGAAGAAAAAAAGTTGAAACTTGCTGTGGCTGAATTTTGTGATTATGCCAACAACATTTGGTGGACATCTTTGGAATCAGAATGGAGGAGAAACT
ATGAGGATCCAATTGAAACATGGGAGGAATTAAAGGCCTTAATGAGGAAGAGGTACATTCCTAAGCATTATTCCCGAGAACTCAAGCAAAAGCTCTATGCTTTGCAACAA
GGATCCAAAAGTGTGGAAGATTTATTTTGA
Protein sequenceShow/hide protein sequence
MEERGARGVKLKIPPIHGRSDPEEHLQYESKIELVFNCNNFSEEKKLKLAVAEFCDYANNIWWTSLESEWRRNYEDPIETWEELKALMRKRYIPKHYSRELKQKLYALQQ
GSKSVEDLF