; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041904 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041904
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr13:31000492..31001974
RNA-Seq ExpressionLag0041904
SyntenyLag0041904
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]1.5e-3651.18Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        + C D   ++G VFML+ +A  WW                 +FKDLLYDYY+ ETVKD KEAEFL+L QG++ V QYERKFT LS FA +L+     KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP-LRNPPIEFTQH
        RF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+ GVKRK  P   +P +   QH
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP-LRNPPIEFTQH

XP_022155872.1 uncharacterized protein LOC111022885 [Momordica charantia]3.1e-3759.72Show/hide
Query:  MLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIKRFIKGLREEIRGSV
        ML+D+A  WW                 +FKDLLYDYY+PETVKD KEAEFL+L QG++ V QYERKFT LS FA +L+ T   KIKRF+KGLR+ IRG V
Subjt:  MLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIKRFIKGLREEIRGSV

Query:  ALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP
         L RPAT+AEA+ GALIMDK+VS + QP +E GS+ GVKRK+SP
Subjt:  ALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.4e-3754.84Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        + C D   ++G VFML+ +A  WW                 +FK+LLYDYY+PETVKD KEAEFL+L QG++ V QYERKFT LS FA +L+ T   KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRK
        RF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS+ GVKRK
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRK

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]2.6e-3651.9Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        + C D   ++G VFML+ +A  WW                 +FK+LLYD+Y+ ETV+D KE EFL+L QG++ V QYERKFT LS FA +L+ T   KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP
        RF+KGL + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.0e-3753.16Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        ++C +   ++GVVFML+ +A  WW                 +FKDLLYDYY+P+T+KD KEAEFL+ + G++ V QYERKFT LS FA +L+ T   KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP
        RF+KGLR+ IRG V L RPAT+AEA+ G LIMD +VS   QP +E GS+ GVKRK+SP
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221447.3e-3751.18Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        + C D   ++G VFML+ +A  WW                 +FKDLLYDYY+ ETVKD KEAEFL+L QG++ V QYERKFT LS FA +L+     KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP-LRNPPIEFTQH
        RF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+ GVKRK  P   +P +   QH
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP-LRNPPIEFTQH

A0A6J1DQJ4 uncharacterized protein LOC1110228851.5e-3759.72Show/hide
Query:  MLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIKRFIKGLREEIRGSV
        ML+D+A  WW                 +FKDLLYDYY+PETVKD KEAEFL+L QG++ V QYERKFT LS FA +L+ T   KIKRF+KGLR+ IRG V
Subjt:  MLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIKRFIKGLREEIRGSV

Query:  ALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP
         L RPAT+AEA+ GALIMDK+VS + QP +E GS+ GVKRK+SP
Subjt:  ALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP

A0A6J1DUM2 uncharacterized protein LOC1110232476.6e-3854.84Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        + C D   ++G VFML+ +A  WW                 +FK+LLYDYY+PETVKD KEAEFL+L QG++ V QYERKFT LS FA +L+ T   KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRK
        RF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS+ GVKRK
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRK

A0A6J1DVA0 uncharacterized protein LOC1110234241.3e-3651.9Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        + C D   ++G VFML+ +A  WW                 +FK+LLYD+Y+ ETV+D KE EFL+L QG++ V QYERKFT LS FA +L+ T   KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP
        RF+KGL + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP

A0A6J1DYU5 uncharacterized protein LOC1110255175.1e-3853.16Show/hide
Query:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK
        ++C +   ++GVVFML+ +A  WW                 +FKDLLYDYY+P+T+KD KEAEFL+ + G++ V QYERKFT LS FA +L+ T   KIK
Subjt:  MNCNDPLNIRGVVFMLKDDARMWW-----------------KFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIK

Query:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP
        RF+KGLR+ IRG V L RPAT+AEA+ G LIMD +VS   QP +E GS+ GVKRK+SP
Subjt:  RFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKTQPHLEKGSTCGVKRKLSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGTAATGATCCCCTGAATATCAGAGGAGTAGTCTTCATGCTCAAGGACGACGCTCGCATGTGGTGGAAGTTCAAAGACCTGTTGTACGATTATTACTTCCCGGA
GACAGTCAAGGATGACAAAGAAGCGGAATTCCTTAATTTGGCTCAGGGAAGTATGCCTGTAGTGCAGTACGAGAGGAAGTTTACTGCACTATCACACTTTGCTCCTGACC
TGGTCAGCACGCTAGAGCGGAAGATCAAGAGGTTCATTAAAGGTCTCCGTGAGGAAATTCGTGGCTCTGTAGCCCTGAGCAGGCCTGCGACCTTTGCTGAAGCACTCACG
GGTGCATTGATCATGGATAAGAATGTTTCCAAGAAGACACAACCTCATCTTGAAAAGGGATCAACTTGTGGAGTTAAAAGAAAGTTGTCTCCCCTGAGGAACCCACCTAT
TGAGTTTACTCAGCATGCAATTGGGGAGGTCATTATGCTAGGGATTGCTCATCATGAGCAACCTGAACGCCACGTGCCTAGGAAGTCTCAAGGTCTTGTTGGACTTGCTT
GTGACATGCTTTTGATGGTGTGTTGGTTTAGTATTAGACCAGGAATCAAAGCATTTCCCTCACGCATACTTATGGCGGTGTTATTGTCACCCCGTAGCCATTCTAGGAAG
GGTGGTTGGTTTGATAAGGAGAGTTGGCAGCGTAAGCTGTTTCGACGTTCCTTTGGCGATGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGTAATGATCCCCTGAATATCAGAGGAGTAGTCTTCATGCTCAAGGACGACGCTCGCATGTGGTGGAAGTTCAAAGACCTGTTGTACGATTATTACTTCCCGGA
GACAGTCAAGGATGACAAAGAAGCGGAATTCCTTAATTTGGCTCAGGGAAGTATGCCTGTAGTGCAGTACGAGAGGAAGTTTACTGCACTATCACACTTTGCTCCTGACC
TGGTCAGCACGCTAGAGCGGAAGATCAAGAGGTTCATTAAAGGTCTCCGTGAGGAAATTCGTGGCTCTGTAGCCCTGAGCAGGCCTGCGACCTTTGCTGAAGCACTCACG
GGTGCATTGATCATGGATAAGAATGTTTCCAAGAAGACACAACCTCATCTTGAAAAGGGATCAACTTGTGGAGTTAAAAGAAAGTTGTCTCCCCTGAGGAACCCACCTAT
TGAGTTTACTCAGCATGCAATTGGGGAGGTCATTATGCTAGGGATTGCTCATCATGAGCAACCTGAACGCCACGTGCCTAGGAAGTCTCAAGGTCTTGTTGGACTTGCTT
GTGACATGCTTTTGATGGTGTGTTGGTTTAGTATTAGACCAGGAATCAAAGCATTTCCCTCACGCATACTTATGGCGGTGTTATTGTCACCCCGTAGCCATTCTAGGAAG
GGTGGTTGGTTTGATAAGGAGAGTTGGCAGCGTAAGCTGTTTCGACGTTCCTTTGGCGATGCCTAG
Protein sequenceShow/hide protein sequence
MNCNDPLNIRGVVFMLKDDARMWWKFKDLLYDYYFPETVKDDKEAEFLNLAQGSMPVVQYERKFTALSHFAPDLVSTLERKIKRFIKGLREEIRGSVALSRPATFAEALT
GALIMDKNVSKKTQPHLEKGSTCGVKRKLSPLRNPPIEFTQHAIGEVIMLGIAHHEQPERHVPRKSQGLVGLACDMLLMVCWFSIRPGIKAFPSRILMAVLLSPRSHSRK
GGWFDKESWQRKLFRRSFGDA