; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032402 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032402
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:31995386..31999043
RNA-Seq ExpressionLag0032402
SyntenyLag0032402
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030478190.1 uncharacterized protein LOC115695250 [Cannabis sativa]8.9e-3732.35Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMR-----------KKL
        EDPHLH++SFL VSDSF ++GVS +ALRL LFP+SLRD A+AWLN+ +P S++ WN LVEKFL KYFPPTR+AK +SEI+ F+Q+            K+L
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMR-----------KKL

Query:  LAR-----------------------------------LGK-----------------------------------------------------------
        L +                                   L K                                                           
Subjt:  LAR-----------------------------------LGK-----------------------------------------------------------

Query:  --------------------------------------GNQ----RNNPYSNFYNP------------------GIAPQNKQAL---------------P
                                              GNQ     NNPYSN YNP                  G   Q KQ                 P
Subjt:  --------------------------------------GNQ----RNNPYSNFYNP------------------GIAPQNKQAL---------------P

Query:  QQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSE-------LGSGQYDGGSSKDAGAIS
        Q +  +SLE++M+DYMA+ND IIQSQ ASLR LE Q+GQLAN+LK R QG +PSD ++P R+GK+  +AVTLRSE         +G  +  S +  G I 
Subjt:  QQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSE-------LGSGQYDGGSSKDAGAIS

Query:  SVPDV
          P +
Subjt:  SVPDV

XP_030494694.1 uncharacterized protein LOC115710474 [Cannabis sativa]1.3e-4035.96Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMR-----------KKL
        EDPHLH++SFL VSDSF ++GVS +ALRL LFP+SLRD A+AWLN+  P S++ WN L EKFL KYFPPTR+AK RSEI+ F+Q+            K+L
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMR-----------KKL

Query:  LAR-----------------------------------------------------------------------------------LGKG----------
        L +                                                                                    G G          
Subjt:  LAR-----------------------------------------------------------------------------------LGKG----------

Query:  -----------NQRNNPYSNFYNP------------------GIAPQNKQAL---------------PQQNSGNSLESMMKDYMARNDVIIQSQQASLRV
                   N+ NNPYSN YNP                  G   Q KQ+                PQ +  +SLES+M+DYM +ND +IQSQ ASLR 
Subjt:  -----------NQRNNPYSNFYNP------------------GIAPQNKQAL---------------PQQNSGNSLESMMKDYMARNDVIIQSQQASLRV

Query:  LEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS
        LE Q+GQLAN+LK RPQG +PSD E+P R+ K+  +AVTLRS
Subjt:  LEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]7.3e-3934.34Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQ----------------
        EDPHLH++SFL VSDSF ++GVS +ALRL LFP+SLRD A+AWLN+  P S++ WN L EKFL KYFPPTR+AK RSEI+ F+Q                
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQ----------------

Query:  MRK--------------------------------------------KLLARL-----------------------------------------------
        +RK                                            ++L R+                                               
Subjt:  MRK--------------------------------------------KLLARL-----------------------------------------------

Query:  -------------------GKG---------------------NQRNNPYSNFYN--------------------PGIAPQNKQALPQQNSG---NSLES
                           G G                     N+ NNPYSN YN                    PG + Q +   P Q  G   +SLES
Subjt:  -------------------GKG---------------------NQRNNPYSNFYN--------------------PGIAPQNKQALPQQNSG---NSLES

Query:  MMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS
        +M+DYMA+ND +IQSQ ASLR LE Q+GQLAN+LK RPQG +PSD E+P R+GK+  +AVTLRS
Subjt:  MMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS

XP_030509064.1 uncharacterized protein LOC115723726 [Cannabis sativa]9.5e-3934.68Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQ----------------
        EDPHLH++SFL VSDSF ++GVS +ALRL LFP+SLRD A+AWLN+  P S++ WN L EKFL KYFPP R+AK +SEI+ F+Q                
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQ----------------

Query:  MRK--------------------------------------------KLLARLGKGNQR-----------------------------------------
        +RK                                            ++L R+   N +                                         
Subjt:  MRK--------------------------------------------KLLARLGKGNQR-----------------------------------------

Query:  ------------------NNPYSNFYNP------------------GIAPQNKQAL---------------PQQNSGNSLESMMKDYMARNDVIIQSQQA
                          NNPYSN YNP                  G   Q KQ+                PQ +  +SLES+M+DYMA+ND IIQSQ A
Subjt:  ------------------NNPYSNFYNP------------------GIAPQNKQAL---------------PQQNSGNSLESMMKDYMARNDVIIQSQQA

Query:  SLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS
        SL+ L+ Q+GQLAN+LK RPQG +PSD E+P R+ K+  +AVTLRS
Subjt:  SLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]4.4e-3633.06Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMR-----------KKL
        EDPHLH+ SFL VSDSF ++GVS +ALRL LFP+SLRD A+AWLN+    S++ WN L E FL KYFPPTR+AK RSEI+ F+Q+            K+L
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMR-----------KKL

Query:  LAR-------------------------------------------------------------------------------------------------
        L +                                                                                                 
Subjt:  LAR-------------------------------------------------------------------------------------------------

Query:  ------------------LGKG---------------------NQRNNPYSNFYNP------------------GIAPQNKQAL----------PQQNSG
                           G G                     N+ NNPYSN YNP                  G   Q KQ+           PQ +  
Subjt:  ------------------LGKG---------------------NQRNNPYSNFYNP------------------GIAPQNKQAL----------PQQNSG

Query:  NSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS
        +SLES+M+DYMA+ND +IQSQ ASLR LE Q+GQLAN+LK RPQG +PSD E+P R+ K+  +AVTLRS
Subjt:  NSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS

TrEMBL top hitse value%identityAlignment
A0A6J1DTD1 uncharacterized protein LOC1110241364.3e-2934.15Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMRKK---------LLA
        EDPH HLK  +GV +SF  EG+S+  +RL LFP+SLRD A+ WL S    SI++W+ L EKFL KYFPP ++AK R+EI  F+Q   +         +L 
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMRKK---------LLA

Query:  RLGKGNQR---------------------------------------------------------------------------NNPYSN----FYNPGI-
        R+   N                                                                              NP S+     +N G  
Subjt:  RLGKGNQR---------------------------------------------------------------------------NNPYSN----FYNPGI-

Query:  -APQNKQALPQQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS
         AP  +Q    + S  SLE +MK YMA ND  ++ Q + LR LE QVGQLA +L +RP G +PSD E P R+GK+Q +A+TL S
Subjt:  -APQNKQALPQQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS

A0A6J1DVZ9 uncharacterized protein LOC1110249704.1e-2740.38Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRS----------------------AKLRSE
        EDPHLHL+ FL VSDSF ++ VS++ALRL LFPY L D  + WLNS    SI++WN L EKF ++ F P +                       A L  +
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRS----------------------AKLRSE

Query:  IVGFRQMRKKLLAR--------LGKG-----NQRNNPYSN--FYNPGIAPQNKQ--ALPQQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLA
        I     M K +           LG+      NQ  + Y       PG+   N+Q    P  NS NS+E+MM++YM RND +IQSQ A  R LE Q+GQ+A
Subjt:  IVGFRQMRKKLLAR--------LGKG-----NQRNNPYSN--FYNPGIAPQNKQ--ALPQQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLA

Query:  NELKARPQ
        N+LK RP+
Subjt:  NELKARPQ

A0A6J1DW02 uncharacterized protein LOC1110248971.3e-2830.89Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQM---------------
        EDPH HLKSF+ ++++F + G++ DA  LTLFP+SL+D A+  LN+F   SI+TW  LVEKFL+K+FPPTR A +R EI+ FRQ                
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQM---------------

Query:  -------------------------RKKLLARLGKG---------------------------------------------------------NQR----
                                  K +L     G                                                         NQR    
Subjt:  -------------------------RKKLLARLGKG---------------------------------------------------------NQR----

Query:  ----NNP-----------------------------YSNF----------YNPGIAPQNKQAL----------------------PQQNSGNSLESMMKD
             NP                             + NF          +N G + QNKQ                        P QN+ ++LE+MMK+
Subjt:  ----NNP-----------------------------YSNF----------YNPGIAPQNKQAL----------------------PQQNSGNSLESMMKD

Query:  YMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSELGSGQYDG
        YMAR D +IQSQ AS+R    Q+G LANELK RPQG+ P   E P REGK+Q +AVTLRS L    YDG
Subjt:  YMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSELGSGQYDG

A0A6J1G7Q6 uncharacterized protein LOC1114515981.1e-2929.4Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMRKKLLA---------
        +DPHLHLKSFLGVSDSF  +GV +D +RL+ F YSLRDGAK+WLN  A   I +WN L EKFL KYFPPTRSA+ R+EIV F++   + L+         
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMRKKLLA---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------RLGKGNQRNNPYSNFYNP-----------GIAPQNKQALPQQN----------
                                                       +  +GN + NP SN YNP           G    N+Q  P+ N          
Subjt:  -----------------------------------------------RLGKGNQRNNPYSNFYNP-----------GIAPQNKQALPQQN----------

Query:  ---------------------SGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREG
                             SG  LES++K+YMARND +IQSQQ SLR LE QVGQLANEL+ RP G +P+D E P REG
Subjt:  ---------------------SGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREG

A0A6J1H7E4 uncharacterized protein LOC1114611689.6e-2967.03Show/hide
Query:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMRKKLLA
        EDPHLHLKSFLGVSDSF  +GV +D +RL+LFPYSLRDGAK+WLN+ AP +I +WN L EKFL KYFPPTR+A+ R+EIV F+Q   + L+
Subjt:  EDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSEIVGFRQMRKKLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAAAATCCGCCGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGATCCTATCCTTAGATGGTTGGCCAATTCCATGGTTTACCATCGAGGACCCTCATCT
TCATCTTAAGTCTTTTCTAGGAGTTAGTGATTCATTTGTTATCGAGGGAGTGTCGAGAGATGCCCTTAGATTAACCCTATTTCCTTATTCCCTTAGAGATGGAGCAAAGG
CATGGTTGAATTCTTTTGCTCCAGCATCGATAAGTACGTGGAATGGGCTAGTAGAGAAATTTCTTAGTAAGTATTTTCCACCAACTAGGAGTGCCAAGTTGAGGAGTGAG
ATAGTGGGATTTAGGCAAATGAGGAAGAAACTTTTAGCGAGGCTTGGGAAAGGTAATCAAAGGAATAACCCATATTCAAATTTTTATAATCCAGGCATTGCCCCACAAAA
TAAGCAGGCTTTGCCCCAGCAAAATTCGGGTAATTCTCTGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCAAAGTCAGCAGGCTTCATTGAGAG
TCCTAGAGTTTCAGGTGGGCCAGCTAGCTAATGAGTTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGTGCAG
GCAGTGACTTTAAGGAGTGAATTGGGGTCTGGTCAATATGATGGAGGCAGCAGCAAAGATGCTGGAGCAATTAGTTCTGTTCCAGATGTGTTGGATGAGATTGCTGAGGA
CCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGCATAAATTTGTAGCTCATA
TTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCGTGAGCCG
GAGCCCACTGTTGATCAACTAGAATGGGAGTTGTACGCCAACATCGATGAAAATGAAGGATTCTTGATTATTGTTCGTAGAGTTGTTGTTGACTGGAGCCATGTAGTGAT
TAATTATCTATTTAGTTTGCAAGATTTCCCTCACACTGTTTTCAACGCAATATTAGTTGTTTCCTCAAACGAGCAACTAAATGCCCAGTGGAGGTTGTCTAGTAAAGGGC
AGAACATTCCAGTCAGCATACATGAAGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAAAATCCGCCGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTAAGATCCTATCCTTAGATGGTTGGCCAATTCCATGGTTTACCATCGAGGACCCTCATCT
TCATCTTAAGTCTTTTCTAGGAGTTAGTGATTCATTTGTTATCGAGGGAGTGTCGAGAGATGCCCTTAGATTAACCCTATTTCCTTATTCCCTTAGAGATGGAGCAAAGG
CATGGTTGAATTCTTTTGCTCCAGCATCGATAAGTACGTGGAATGGGCTAGTAGAGAAATTTCTTAGTAAGTATTTTCCACCAACTAGGAGTGCCAAGTTGAGGAGTGAG
ATAGTGGGATTTAGGCAAATGAGGAAGAAACTTTTAGCGAGGCTTGGGAAAGGTAATCAAAGGAATAACCCATATTCAAATTTTTATAATCCAGGCATTGCCCCACAAAA
TAAGCAGGCTTTGCCCCAGCAAAATTCGGGTAATTCTCTGGAGTCAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCAAAGTCAGCAGGCTTCATTGAGAG
TCCTAGAGTTTCAGGTGGGCCAGCTAGCTAATGAGTTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGGTGCAG
GCAGTGACTTTAAGGAGTGAATTGGGGTCTGGTCAATATGATGGAGGCAGCAGCAAAGATGCTGGAGCAATTAGTTCTGTTCCAGATGTGTTGGATGAGATTGCTGAGGA
CCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGCATAAATTTGTAGCTCATA
TTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAGAACAAATGAAAGAATTTTTCCGTGAGCCG
GAGCCCACTGTTGATCAACTAGAATGGGAGTTGTACGCCAACATCGATGAAAATGAAGGATTCTTGATTATTGTTCGTAGAGTTGTTGTTGACTGGAGCCATGTAGTGAT
TAATTATCTATTTAGTTTGCAAGATTTCCCTCACACTGTTTTCAACGCAATATTAGTTGTTTCCTCAAACGAGCAACTAAATGCCCAGTGGAGGTTGTCTAGTAAAGGGC
AGAACATTCCAGTCAGCATACATGAAGTGTGA
Protein sequenceShow/hide protein sequence
MQQNPPLEQNGQQNNQAKILSLDGWPIPWFTIEDPHLHLKSFLGVSDSFVIEGVSRDALRLTLFPYSLRDGAKAWLNSFAPASISTWNGLVEKFLSKYFPPTRSAKLRSE
IVGFRQMRKKLLARLGKGNQRNNPYSNFYNPGIAPQNKQALPQQNSGNSLESMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQ
AVTLRSELGSGQYDGGSSKDAGAISSVPDVLDEIAEDHFEKELMEYHTQKFGEIQIEDLEIGGLEHEHKFVAHIKAVKTPWYDDFSNYLDFGNLPPGLSKEQMKEFFREP
EPTVDQLEWELYANIDENEGFLIIVRRVVVDWSHVVINYLFSLQDFPHTVFNAILVVSSNEQLNAQWRLSSKGQNIPVSIHEV