; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021823 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021823
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr7:12650225..12651048
RNA-Seq ExpressionLag0021823
SyntenyLag0021823
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8515344.1 hypothetical protein F0562_018426 [Nyssa sinensis]9.7e-2435.29Show/hide
Query:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------
        M++ IS SN +  ++TS         IFLLSNICNL+  RLD+SNY+ W+FQ+ S+LRAHSL   I                                  
Subjt:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------

Query:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS
         LM    A  S+  L H          W+ +  R    T+                  SID Y+ +IK+  D LA+VSV+I DED+L+Y LNGL  EYN+
Subjt:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS

Query:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFARGYNPSQSFFR--------GRGRNQG
        F+TSI TR E++TL+E++ +LK + + IE  +K   +S  P  M A  Y P+ S  R        GRGR +G
Subjt:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFARGYNPSQSFFR--------GRGRNQG

KAA8518236.1 hypothetical protein F0562_015710 [Nyssa sinensis]4.3e-2435.16Show/hide
Query:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------
        M++  S SN +  ++TS         IFLLSNICNL+  RLD+SNY+ W+FQ+ S+LRAHSL   I                                  
Subjt:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------

Query:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS
         LM    A  S+  L H          W+ +  R    T+                  SID Y+ +IK+  D LA+VSV+I DED+L+Y LNGL  EYN+
Subjt:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS

Query:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNK----------ALTSSVNPTTMFARGYNPSQSFFRGRGRNQ
        F+TSI TR E +TL+E++A+LK + + IE  +K           + S+  PT    RGY+P  S  RGRGR +
Subjt:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNK----------ALTSSVNPTTMFARGYNPSQSFFRGRGRNQ

KAA8539318.1 hypothetical protein F0562_026010 [Nyssa sinensis]7.4e-2434.63Show/hide
Query:  SNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------LLMDRSPAQPSRFLLRH------------------------------
        S + IFLLSNICNL+  RLD+SNY+ W+FQ+ S+L+AHSL   I          + D   A  ++  L +                              
Subjt:  SNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------LLMDRSPAQPSRFLLRH------------------------------

Query:  -----WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLK
             W+ +  R    T+                  SID Y+ +IK+  D LA VSV+I DED+L+Y LNGL  EYN+F+TSI T+ +++TL+E++A+LK
Subjt:  -----WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLK

Query:  SKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG
         + + IE  +K   S   P  M A          RGY+PS    RGRGR    N+GG
Subjt:  SKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-2333.33Show/hide
Query:  MSSMISESNISLHSN--TSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLF--------------------------------------------
        M S  S S+ S  +N  + I LLSNICNL+ ++LD++NY+ W+FQ+ ++L+AH LF                                            
Subjt:  MSSMISESNISLHSN--TSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLF--------------------------------------------

Query:  --------DIILLMDRSPAQPSRFLLRHWMQIRQGGPTKSIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHA
                +++  +  S ++ +   L+  +Q       +SID Y+ RIKEI DKLA VS V+NDED+L+Y LNGL +EYN+FRTS+ TR   VT +ELH 
Subjt:  --------DIILLMDRSPAQPSRFLLRHWMQIRQGGPTKSIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHA

Query:  LLKSKSKFIEQHNKALTSSVNPTTMFARGYNP-------SQSFFRGRGRNQG
        LLK++   + + +K       PT + A   +        + +F RGRGR +G
Subjt:  LLKSKSKFIEQHNKALTSSVNPTTMFARGYNP-------SQSFFRGRGRNQG

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]9.7e-2433.1Show/hide
Query:  SSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDIILLMDRSPAQPSRFL----------------------LR
        SS+  M+S  + +   LHS   IFLLSNICNLV +RLD+++++ W+FQ+ ++L+AH LF  I   D S + PS+FL                        
Subjt:  SSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDIILLMDRSPAQPSRFL----------------------LR

Query:  HWM--------------------QIRQGGPTK----------------------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLN
         W+                     + + G +K                                  SID Y+ RIKEI DK A VS+ INDE +L+Y LN
Subjt:  HWM--------------------QIRQGGPTK----------------------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLN

Query:  GLSSEYNSFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFA---------RGYNPSQSFFRGRGRNQG
        GLS+EYN+  TS+ TR +SV+ +ELH  +KS+   IE+  K       P  +FA           ++P+QS  RGRG+N G
Subjt:  GLSSEYNSFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFA---------RGYNPSQSFFRGRGRNQG

TrEMBL top hitse value%identityAlignment
A0A5J4ZC67 Retrotran_gag_3 domain-containing protein4.7e-2435.29Show/hide
Query:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------
        M++ IS SN +  ++TS         IFLLSNICNL+  RLD+SNY+ W+FQ+ S+LRAHSL   I                                  
Subjt:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------

Query:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS
         LM    A  S+  L H          W+ +  R    T+                  SID Y+ +IK+  D LA+VSV+I DED+L+Y LNGL  EYN+
Subjt:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS

Query:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFARGYNPSQSFFR--------GRGRNQG
        F+TSI TR E++TL+E++ +LK + + IE  +K   +S  P  M A  Y P+ S  R        GRGR +G
Subjt:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFARGYNPSQSFFR--------GRGRNQG

A0A5J4ZCR1 Retrotran_gag_3 domain-containing protein2.3e-2335.71Show/hide
Query:  MASSSSSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------------------------------
        MA+++S+  T +   S S     S + IFLLSNICNL+  RLD+SNY+ W+FQ+ S+LRAHSL   I                                 
Subjt:  MASSSSSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------------------------------

Query:  -LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYN
          LM    A  S+  L H          W+ +  R    T+                  SID Y+ +IK+  D LA+VSV+I DED+L+Y LNGL  EYN
Subjt:  -LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYN

Query:  SFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG
        + +TSI TR E++TL+E++A+LK K + IE  +K   +S  P  M A          +GY+P  S  RGRGR    N+GG
Subjt:  SFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG

A0A5J4ZKU4 Retrotran_gag_3 domain-containing protein2.1e-2435.16Show/hide
Query:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------
        M++  S SN +  ++TS         IFLLSNICNL+  RLD+SNY+ W+FQ+ S+LRAHSL   I                                  
Subjt:  MSSMISESNISLHSNTS---------IFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII----------------------------------

Query:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS
         LM    A  S+  L H          W+ +  R    T+                  SID Y+ +IK+  D LA+VSV+I DED+L+Y LNGL  EYN+
Subjt:  LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNS

Query:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNK----------ALTSSVNPTTMFARGYNPSQSFFRGRGRNQ
        F+TSI TR E +TL+E++A+LK + + IE  +K           + S+  PT    RGY+P  S  RGRGR +
Subjt:  FRTSICTRGESVTLDELHALLKSKSKFIEQHNK----------ALTSSVNPTTMFARGYNPSQSFFRGRGRNQ

A0A5J5A0G3 Retrotran_gag_3 domain-containing protein3.0e-2335.36Show/hide
Query:  MASSSSSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------------------------------
        MA+++ ++ T +S  S S     S + IFLLSNICNL+  RLD+SNY+ W+FQ+ S+L+AHSL   I                                 
Subjt:  MASSSSSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------------------------------

Query:  -LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYN
          LM    A  S+  L H          W+ +  R    T+                  SID Y+ +IK+  D LA+VSV+I DED+L+Y LNGL  EYN
Subjt:  -LLMDRSPAQPSRFLLRH----------WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYN

Query:  SFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG
        +F+TSI T+ E++TL+E++A+LK + + IE  +K   S   P  M A          RGY+PS    RGRGR    N+GG
Subjt:  SFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG

A0A5J5B9N3 Retrotran_gag_3 domain-containing protein3.6e-2434.63Show/hide
Query:  SNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------LLMDRSPAQPSRFLLRH------------------------------
        S + IFLLSNICNL+  RLD+SNY+ W+FQ+ S+L+AHSL   I          + D   A  ++  L +                              
Subjt:  SNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDII---------LLMDRSPAQPSRFLLRH------------------------------

Query:  -----WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLK
             W+ +  R    T+                  SID Y+ +IK+  D LA VSV+I DED+L+Y LNGL  EYN+F+TSI T+ +++TL+E++A+LK
Subjt:  -----WMQI--RQGGPTK------------------SIDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLK

Query:  SKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG
         + + IE  +K   S   P  M A          RGY+PS    RGRGR    N+GG
Subjt:  SKSKFIEQHNKALTSSVNPTTMFA----------RGYNPSQSFFRGRGR----NQGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.9e-0425.86Show/hide
Query:  IDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLKSKSKFIEQ----------HNKALT----SSVNPTTMF
        + DY  ++K++ D L  V V + D ++++Y LNGL+ ++++    I  R    + D+   +L+ +   +++          H+ + T    S   P T F
Subjt:  IDDYLIRIKEIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLKSKSKFIEQ----------HNKALT----SSVNPTTMF

Query:  ARGYNPSQSFFRGRGR
         R    +Q  +RGRGR
Subjt:  ARGYNPSQSFFRGRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCTTCGATGGCCTCCTCTTCTTCTTCGATGGCGACAATGTCTTCCATGATCTCTGAGTCCAACATTTCTCTTCATTCCAACACTTCAATCTTTCTTCTGTC
TAATATTTGCAACCTTGTTCCTGTTCGATTGGATGCATCAAACTATCTATTCTGGCGATTCCAAGTCGAATCCATGCTGCGTGCACATTCTCTCTTCGACATTATATTGT
TGATGGATCGAAGCCCTGCCCAGCCAAGCCGCTTTCTCCTTCGTCATTGGATGCAAATCCGCCAAGGAGGACCGACTAAATCCATTGATGATTACTTGATTAGGATTAAG
GAAATTGTTGATAAATTGGCAACTGTTTCGGTTGTAATTAACGATGAAGACGTACTTCTTTACACTCTTAATGGTCTGTCCTCCGAGTATAATTCTTTTAGAACTTCAAT
TTGCACAAGAGGCGAGTCTGTGACTCTCGATGAATTACATGCCCTATTAAAATCGAAGTCCAAGTTTATTGAGCAACATAATAAAGCTTTAACTTCCTCTGTTAATCCTA
CGACAATGTTTGCCCGTGGATATAATCCTTCTCAATCTTTCTTTCGTGGCCGAGGACGAAATCAAGGTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCTTCGATGGCCTCCTCTTCTTCTTCGATGGCGACAATGTCTTCCATGATCTCTGAGTCCAACATTTCTCTTCATTCCAACACTTCAATCTTTCTTCTGTC
TAATATTTGCAACCTTGTTCCTGTTCGATTGGATGCATCAAACTATCTATTCTGGCGATTCCAAGTCGAATCCATGCTGCGTGCACATTCTCTCTTCGACATTATATTGT
TGATGGATCGAAGCCCTGCCCAGCCAAGCCGCTTTCTCCTTCGTCATTGGATGCAAATCCGCCAAGGAGGACCGACTAAATCCATTGATGATTACTTGATTAGGATTAAG
GAAATTGTTGATAAATTGGCAACTGTTTCGGTTGTAATTAACGATGAAGACGTACTTCTTTACACTCTTAATGGTCTGTCCTCCGAGTATAATTCTTTTAGAACTTCAAT
TTGCACAAGAGGCGAGTCTGTGACTCTCGATGAATTACATGCCCTATTAAAATCGAAGTCCAAGTTTATTGAGCAACATAATAAAGCTTTAACTTCCTCTGTTAATCCTA
CGACAATGTTTGCCCGTGGATATAATCCTTCTCAATCTTTCTTTCGTGGCCGAGGACGAAATCAAGGTGGTTGA
Protein sequenceShow/hide protein sequence
MSSSSMASSSSSMATMSSMISESNISLHSNTSIFLLSNICNLVPVRLDASNYLFWRFQVESMLRAHSLFDIILLMDRSPAQPSRFLLRHWMQIRQGGPTKSIDDYLIRIK
EIVDKLATVSVVINDEDVLLYTLNGLSSEYNSFRTSICTRGESVTLDELHALLKSKSKFIEQHNKALTSSVNPTTMFARGYNPSQSFFRGRGRNQGG