; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039238 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039238
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr2:39726952..39727718
RNA-Seq ExpressionLag0039238
SyntenyLag0039238
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]2.1e-3439.18Show/hide
Query:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPK----------------------------------
        +SST    SS      S + LL+NICNL+S+RLDSTNFVLW FQ++ +LK+HKLF +VDG+   P+                                  
Subjt:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPK----------------------------------

Query:  -----------MFLNDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIR
                      + +VWD L K +SS +RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS  IN EDL+IY ++GLP   N F+TS+R R
Subjt:  -----------MFLNDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIR

Query:  AQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
        +Q  TF+ELH+L++ EE AL KQSK  ++   P + +     L+S
Subjt:  AQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]8.6e-3642.02Show/hide
Query:  MSSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAP-------KMFLND-----------------
        M SS +SS+    SST ++  S + LL+NICNL+S++LDSTN+VLW FQ++ LLK+HKLF ++DG+   P         F  D                 
Subjt:  MSSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAP-------KMFLND-----------------

Query:  ---------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFD
                 +VW+ L K +SSS+RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS ++N EDL+IY ++GLPT  N F+TS+R R+   TF+
Subjt:  ---------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFD

Query:  ELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
        ELH+L+K EE AL KQSK  +    P   +     LMS
Subjt:  ELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-3642.02Show/hide
Query:  MSSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAP-------KMFLND-----------------
        M SS +SS+    SST ++  S + LL+NICNL+S++LDSTN+VLW FQ++ LLK+HKLF ++DG+   P         F  D                 
Subjt:  MSSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAP-------KMFLND-----------------

Query:  ---------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFD
                 +VW+ L K +SSS+RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS ++N EDL+IY ++GLPT  N F+TS+R R+   TF+
Subjt:  ---------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFD

Query:  ELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
        ELH+L+K EE AL KQSK  +    P   +     LMS
Subjt:  ELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]2.1e-3439.18Show/hide
Query:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPK----------------------------------
        +SST    SS      S + LL+NICNL+S+RLDSTNFVLW FQ++ +LK+HKLF +VDG+   P+                                  
Subjt:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPK----------------------------------

Query:  -----------MFLNDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIR
                      + +VWD L K +SS +RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS  IN EDL+IY ++GLP   N F+TS+R R
Subjt:  -----------MFLNDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIR

Query:  AQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
        +Q  TF+ELH+L++ EE AL KQSK  ++   P + +     L+S
Subjt:  AQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]3.8e-3639.27Show/hide
Query:  SSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFL---------------------------
        SSSTN+  D          +S + LL+NICNLVS+RLDST+F+LW FQ++ +LK+HKLF ++DGS+ AP  FL                           
Subjt:  SSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFL---------------------------

Query:  ---------------------------NDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLP
                                   + +VW+ LEKH+SS++R+ +V+LK++LQSIVKK  +S+D YV+R+K++ ++ A VS+ IN E L+IY ++GL 
Subjt:  ---------------------------NDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLP

Query:  TSNNVFKTSLRIRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPN
        T  N   TS+R RAQ  +F+ELH+ MK+EE A++KQ K ++    PN
Subjt:  TSNNVFKTSLRIRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X25.1e-3438.46Show/hide
Query:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLND-----------------------------
        +SST    SS      S + LL+NICNL+S+RLDSTNFVLW FQ++ +LK+HKL+ ++DG+   P    N                              
Subjt:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLND-----------------------------

Query:  ------------------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLR
                          +VWD L K +SS +RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS  IN EDL+IY ++GLP   N F+TS+R
Subjt:  ------------------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLR

Query:  IRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
         R+Q  TF+ELH+L++ EE AL KQSK  ++   P + +     L+S
Subjt:  IRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X15.1e-3438.46Show/hide
Query:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLND-----------------------------
        +SST    SS      S + LL+NICNL+S+RLDSTNFVLW FQ++ +LK+HKL+ ++DG+   P    N                              
Subjt:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLND-----------------------------

Query:  ------------------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLR
                          +VWD L K +SS +RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS  IN EDL+IY ++GLP   N F+TS+R
Subjt:  ------------------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLR

Query:  IRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
         R+Q  TF+ELH+L++ EE AL KQSK  ++   P + +     L+S
Subjt:  IRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

A0A5D3CLI6 T4.55.1e-3438.46Show/hide
Query:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLND-----------------------------
        +SST    SS      S + LL+NICNL+S+RLDSTNFVLW FQ++ +LK+HKL+ ++DG+   P    N                              
Subjt:  NSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLND-----------------------------

Query:  ------------------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLR
                          +VWD L K +SS +RS +V+LK++LQ+I KK  +S+D Y++R+K++ ++LA VS  IN EDL+IY ++GLP   N F+TS+R
Subjt:  ------------------EVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLR

Query:  IRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS
         R+Q  TF+ELH+L++ EE AL KQSK  ++   P + +     L+S
Subjt:  IRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMS

A0A6J1D9L6 uncharacterized protein LOC1110188921.9e-3639.27Show/hide
Query:  SSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFL---------------------------
        SSSTN+  D          +S + LL+NICNLVS+RLDST+F+LW FQ++ +LK+HKLF ++DGS+ AP  FL                           
Subjt:  SSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFL---------------------------

Query:  ---------------------------NDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLP
                                   + +VW+ LEKH+SS++R+ +V+LK++LQSIVKK  +S+D YV+R+K++ ++ A VS+ IN E L+IY ++GL 
Subjt:  ---------------------------NDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLP

Query:  TSNNVFKTSLRIRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPN
        T  N   TS+R RAQ  +F+ELH+ MK+EE A++KQ K ++    PN
Subjt:  TSNNVFKTSLRIRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPN

A0A6J1E049 uncharacterized protein LOC1110251502.3e-3441.71Show/hide
Query:  NSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFL----------------------------------------------
        +S + LL+NICNLVS+RLDS+NFVLW FQ++ +LK+HKL+ ++DGS   P  FL                                              
Subjt:  NSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFL----------------------------------------------

Query:  ---NDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFDELHI
           + +VW TL KH+SSS+R+ +V+LK++LQSI KK   S+D YVQR+K+L ++LA V V+++ EDL+IYT++ LP   N F+TS+R R+Q  +F+ELH+
Subjt:  ---NDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTKSVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFDELHI

Query:  LMKTEEIALDK
        L+ +EE A+DK
Subjt:  LMKTEEIALDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTTCTACGAATTCTTCAACTGATGCCCTAATTTCCTCTACGATTTCTTCTCCGAATTCTTCTATGTCTCTTCTCAATAACATCTGTAATCTCGTCTCTGTAAG
GCTCGATTCCACAAATTTTGTTCTGTGGTGTTTTCAAATTTCGCCTCTCCTCAAATCTCATAAGCTTTTCAAGTATGTTGATGGATCGATCAAGGCTCCTAAGATGTTTC
TGAATGATGAAGTCTGGGACACTCTTGAGAAGCACTTCTCTTCATCCAACAGATCGATCATTGTTAGCCTAAAAACTGAATTACAGAGTATCGTCAAGAAACGTACTAAA
TCAGTTGATTTATATGTTCAGCGAGTCAAAGATCTGGTCAATCGCCTTGCCGCCGTTTCTGTTATAATCAATGCTGAAGATCTCATTATCTACACGATCGATGGCCTACC
GACATCCAACAACGTGTTCAAGACCTCTTTGAGAATAAGAGCTCAGGTCCGTACATTTGATGAACTCCATATCTTAATGAAGACCGAAGAAATTGCGCTTGATAAACAAT
CCAAACTTAAGGAAGCGTCTTTGATCCCTAATCTTGCAATGCAACAAATTTGGCTTTTAATGTCGGGCATTGCCCGACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTTCTACGAATTCTTCAACTGATGCCCTAATTTCCTCTACGATTTCTTCTCCGAATTCTTCTATGTCTCTTCTCAATAACATCTGTAATCTCGTCTCTGTAAG
GCTCGATTCCACAAATTTTGTTCTGTGGTGTTTTCAAATTTCGCCTCTCCTCAAATCTCATAAGCTTTTCAAGTATGTTGATGGATCGATCAAGGCTCCTAAGATGTTTC
TGAATGATGAAGTCTGGGACACTCTTGAGAAGCACTTCTCTTCATCCAACAGATCGATCATTGTTAGCCTAAAAACTGAATTACAGAGTATCGTCAAGAAACGTACTAAA
TCAGTTGATTTATATGTTCAGCGAGTCAAAGATCTGGTCAATCGCCTTGCCGCCGTTTCTGTTATAATCAATGCTGAAGATCTCATTATCTACACGATCGATGGCCTACC
GACATCCAACAACGTGTTCAAGACCTCTTTGAGAATAAGAGCTCAGGTCCGTACATTTGATGAACTCCATATCTTAATGAAGACCGAAGAAATTGCGCTTGATAAACAAT
CCAAACTTAAGGAAGCGTCTTTGATCCCTAATCTTGCAATGCAACAAATTTGGCTTTTAATGTCGGGCATTGCCCGACATTAA
Protein sequenceShow/hide protein sequence
MSSSTNSSTDALISSTISSPNSSMSLLNNICNLVSVRLDSTNFVLWCFQISPLLKSHKLFKYVDGSIKAPKMFLNDEVWDTLEKHFSSSNRSIIVSLKTELQSIVKKRTK
SVDLYVQRVKDLVNRLAAVSVIINAEDLIIYTIDGLPTSNNVFKTSLRIRAQVRTFDELHILMKTEEIALDKQSKLKEASLIPNLAMQQIWLLMSGIARH