; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028862 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028862
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr8:32037835..32038513
RNA-Seq ExpressionLag0028862
SyntenyLag0028862
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-4368.06Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        +INATLSP ALAY VG T+SKQ W VL K Y SSSR+N+VNLKSDLQ+ISKK  ESID Y+KRIKE+KDKLANVS ++++ED+ IY LN LP+++NT RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS
        SMRTRS  VTF ELHVLLK EE A+ KQ+K DD   QP A+ AS
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.2e-4367.36Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        +INATLSP ALAY VG TSSKQ W+VL K Y S SR+N+VNLKSDLQ+I KKP ESID Y+KRIKE+KDKLANVS  ++EED+ IY LN LP+++NT RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS
        SMRTRSQ VTF ELHVLL+ EE A+ KQ+K DD+  QP  + +S
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.2e-4367.36Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        +INATLSP ALAY VG TSSKQ W+VL K Y S SR+N+VNLKSDLQ+I KKP ESID Y+KRIKE+KDKLANVS  ++EED+ IY LN LP+++NT RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS
        SMRTRSQ VTF ELHVLL+ EE A+ KQ+K DD+  QP  + +S
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS

XP_022157455.1 uncharacterized protein LOC111024149 [Momordica charantia]1.6e-4872.03Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        LINATLSP+A AY VG TSSK+ W+ LEKHY SSSRTN+VNLKSDLQSISKK GE IDDYVKRIKE+KDKL NVS+++D+ED+ IYTLN LPS +N  RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFA
        SMRTRSQ VTF+ELHVL+K+EEVA+++Q K DD  +Q  A+FA
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFA

XP_022159298.1 uncharacterized protein LOC111025709 [Momordica charantia]1.0e-4464.05Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        L+NATLSP+ALAY VGC SS+Q W+ L K+Y SSSRTN+VNLKS+LQSISKKPGESID Y++RIKELKDKLANVS+++D ED+ IYTLN LP +FN   T
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFASVVSPAKSVN
        SM TRSQ V+F EL+VLL  EE AI+KQTKHD+   Q + + A++ +  ++ N
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFASVVSPAKSVN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X25.6e-4467.36Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        +INATLSP ALAY VG TSSKQ W+VL K Y S SR+N+VNLKSDLQ+I KKP ESID Y+KRIKE+KDKLANVS  ++EED+ IY LN LP+++NT RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS
        SMRTRSQ VTF ELHVLL+ EE A+ KQ+K DD+  QP  + +S
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X15.6e-4467.36Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        +INATLSP ALAY VG TSSKQ W+VL K Y S SR+N+VNLKSDLQ+I KKP ESID Y+KRIKE+KDKLANVS  ++EED+ IY LN LP+++NT RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS
        SMRTRSQ VTF ELHVLL+ EE A+ KQ+K DD+  QP  + +S
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS

A0A5D3CLI6 T4.55.6e-4467.36Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        +INATLSP ALAY VG TSSKQ W+VL K Y S SR+N+VNLKSDLQ+I KKP ESID Y+KRIKE+KDKLANVS  ++EED+ IY LN LP+++NT RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS
        SMRTRSQ VTF ELHVLL+ EE A+ KQ+K DD+  QP  + +S
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFAS

A0A6J1DT57 uncharacterized protein LOC1110241497.5e-4972.03Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        LINATLSP+A AY VG TSSK+ W+ LEKHY SSSRTN+VNLKSDLQSISKK GE IDDYVKRIKE+KDKL NVS+++D+ED+ IYTLN LPS +N  RT
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFA
        SMRTRSQ VTF+ELHVL+K+EEVA+++Q K DD  +Q  A+FA
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFA

A0A6J1DYF1 uncharacterized protein LOC1110257095.0e-4564.05Show/hide
Query:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT
        L+NATLSP+ALAY VGC SS+Q W+ L K+Y SSSRTN+VNLKS+LQSISKKPGESID Y++RIKELKDKLANVS+++D ED+ IYTLN LP +FN   T
Subjt:  LINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRT

Query:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFASVVSPAKSVN
        SM TRSQ V+F EL+VLL  EE AI+KQTKHD+   Q + + A++ +  ++ N
Subjt:  SMRTRSQFVTFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFASVVSPAKSVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.3e-0523.93Show/hide
Query:  VGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGE-SIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRTSMRTRSQFVTFNE
        V  ++S+  W  ++  + ++     + L S+L+  +K  G+  + DY +++K+L D L NV + + + ++ +Y LN L   F+ +   ++ R  F +F++
Subjt:  VGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGE-SIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRTSMRTRSQFVTFNE

Query:  LHVLLKTEEVAIEKQTK
           +L+ EE  +++  K
Subjt:  LHVLLKTEEVAIEKQTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATCAACGCCACACTCTCACCGACAGCGTTGGCCTATGATGTTGGTTGTACATCATCCAAACAAGCTTGGGAAGTCTTGGAGAAGCACTATTTCTCGAGTTCAAG
AACCAACATCGTCAATCTAAAATCCGATCTTCAATCTATCTCTAAGAAACCAGGTGAGTCCATTGATGACTATGTTAAACGAATTAAGGAGCTTAAGGACAAATTAGCTA
ATGTCTCTATTATTATGGATGAAGAGGATATTCAAATTTATACCCTAAATGACTTACCCTCTGATTTTAATACCGTTCGCACGTCTATGAGAACCCGTTCACAGTTTGTT
ACTTTCAATGAGTTACATGTTTTATTGAAGACTGAAGAAGTTGCCATTGAAAAACAGACGAAACATGATGATGCCCTAACTCAACCAGCAGCTATGTTTGCATCGGTCGT
CTCTCCTGCCAAATCTGTCAACGCCCTGGACATAGTGCCATCGATTGCTACAATAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGATCAACGCCACACTCTCACCGACAGCGTTGGCCTATGATGTTGGTTGTACATCATCCAAACAAGCTTGGGAAGTCTTGGAGAAGCACTATTTCTCGAGTTCAAG
AACCAACATCGTCAATCTAAAATCCGATCTTCAATCTATCTCTAAGAAACCAGGTGAGTCCATTGATGACTATGTTAAACGAATTAAGGAGCTTAAGGACAAATTAGCTA
ATGTCTCTATTATTATGGATGAAGAGGATATTCAAATTTATACCCTAAATGACTTACCCTCTGATTTTAATACCGTTCGCACGTCTATGAGAACCCGTTCACAGTTTGTT
ACTTTCAATGAGTTACATGTTTTATTGAAGACTGAAGAAGTTGCCATTGAAAAACAGACGAAACATGATGATGCCCTAACTCAACCAGCAGCTATGTTTGCATCGGTCGT
CTCTCCTGCCAAATCTGTCAACGCCCTGGACATAGTGCCATCGATTGCTACAATAGAATGA
Protein sequenceShow/hide protein sequence
MLINATLSPTALAYDVGCTSSKQAWEVLEKHYFSSSRTNIVNLKSDLQSISKKPGESIDDYVKRIKELKDKLANVSIIMDEEDIQIYTLNDLPSDFNTVRTSMRTRSQFV
TFNELHVLLKTEEVAIEKQTKHDDALTQPAAMFASVVSPAKSVNALDIVPSIATIE