; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038825 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038825
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEnzymatic polyprotein
Genome locationchr2:28027833..28033641
RNA-Seq ExpressionLag0038825
SyntenyLag0038825
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]6.4e-2429.56Show/hide
Query:  KAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQ
        +AFL K+FPP K+ +LRT IGTF+Q  DEQL+EAWER+K+LLR+CP+HGYP WL++ +                    + SK  + A T+LE++ T SY 
Subjt:  KAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQ

Query:  WPPERSKLK-KIVVGVFEIDNVQGV------------------------------------------------------LNRSSLQLP--LHPRLR----
        WP ER+        G++E+D V  +                                                       N    QLP   HP LR    
Subjt:  WPPERSKLK-KIVVGVFEIDNVQGV------------------------------------------------------LNRSSLQLP--LHPRLR----

Query:  -----------------------------------RKILSMTSKLEEAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAV
                                           ++  S T+ LE +V AI +TV     A++N+E Q  Q+ + ++TM KGK  +  E +P E CKAV
Subjt:  -----------------------------------RKILSMTSKLEEAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAV

Query:  TVHHEEELH---IAEEDE
        T+   ++L    I +EDE
Subjt:  TVHHEEELH---IAEEDE

XP_018826494.1 uncharacterized protein LOC108995373 [Juglans regia]7.1e-2331.91Show/hide
Query:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENART-LLEEMTTNSYQW
        FL KFFPP KT +LR+ I  F+Q   E L+EAWER+K L+R CP+HG P WL++ +                    L+SKT+E A T LLEEMT+N+YQW
Subjt:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENART-LLEEMTTNSYQW

Query:  PPERSKLKK---IVVGVFEIDNVQGVLNRSSLQLPLHPRLRRKILSMTSK-----------------LEEAVIA------------------INTTVNGH
        P E++  KK   I +       +Q V   +S+ +P +   + ++  + ++                 LE+A+I+                  I+   +  
Subjt:  PPERSKLKK---IVVGVFEIDNVQGVLNRSSLQLPLHPRLRRKILSMTSK-----------------LEEAVIA------------------INTTVNGH

Query:  STAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDET
          AIKNIE Q  +L +++    +G   +  E +P E CKA+T+    EL  +   ET
Subjt:  STAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDET

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]3.5e-3035.13Show/hide
Query:  VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNS
        +++AFL  FFPP KT +LRT I +F++   EQLFE WER+KELLRKCP+HG   WL++ +                    LLS+T ENA  LL++M  NS
Subjt:  VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNS

Query:  YQWPPERSKLKKIVVGVFEID-------NVQGVLNRSS-----------------------------LQLPLHPRLRRKIL------------SMTSKLE
        +QWP ERS  KK V G++EID        VQ + N  S                              Q   HP  ++  L            S  S++E
Subjt:  YQWPPERSKLKKIVVGVFEID-------NVQGVLNRSS-----------------------------LQLPLHPRLRRKIL------------SMTSKLE

Query:  EAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDETTKPKELTGE
          V  +   + G++T+IKN+E Q  Q+   + TM KGK  ++ E  P E+CKAVT+   +EL   E+ +  +P   T E
Subjt:  EAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDETTKPKELTGE

XP_024025398.1 uncharacterized protein LOC112092750 [Morus notabilis]7.1e-2331.7Show/hide
Query:  VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLH-------------------VGLLSKTVENARTLLEEMTTNS
        +++ FL KFFPP KTVK+   I  F Q   E L+EAWER+KEL+R+C  HG P W+++H                     L+ KT + A  LLE+M TN+
Subjt:  VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLH-------------------VGLLSKTVENARTLLEEMTTNS

Query:  YQWPPERSKLKKIVVGVFEIDNVQGV---------LNRSSLQLPLHPRLRRKILSMTSK----------LEEAVIAI-----------NTTVNGHSTAIK
        YQWP ERS  KKI  G+ E++ +  +         L   S+    HP     ILS   +          LE+A+  +            T     S  I+
Subjt:  YQWPPERSKLKKIVVGVFEIDNVQGV---------LNRSSLQLPLHPRLRRKILSMTSK----------LEEAVIAI-----------NTTVNGHSTAIK

Query:  NIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEE--------DETTKPKE
        ++E Q  QL S++    +G   +    +P E CKA+T+   +EL+   E        +E + PK+
Subjt:  NIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEE--------DETTKPKE

XP_030964936.1 uncharacterized protein LOC115986224 [Quercus lobata]8.4e-2431.77Show/hide
Query:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQWP
        FL K FPP KT +LR+ IG F+Q   E L+EAWER+K+L+R+CP+HG P WL++ +                    L+SKT E A +LLEEM +N+YQW 
Subjt:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQWP

Query:  PERSKLKKIVVGVFEID--------------NVQGVLNR------SSLQLPLHPRLR----------RKILSMT-----------SKLEEAVIA------
         ER+  KK V G+ E+D               VQ + NR      + +    HPRLR          + +L  T             LE+A+I+      
Subjt:  PERSKLKKIVVGVFEID--------------NVQGVLNR------SSLQLPLHPRLR----------RKILSMT-----------SKLEEAVIA------

Query:  ------------INTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDET
                    I T  +     +KN+E Q  QL + +    +G   +  E +P E CKA+T+    E+      ET
Subjt:  ------------INTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDET

TrEMBL top hitse value%identityAlignment
A0A1U7Z6K8 uncharacterized protein LOC1045909354.7e-2029.78Show/hide
Query:  VNNQAEKERGVPSLKYFLTLLLSHSIVN-DPNSRGAIAFF-----------VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPK
        +N QA     +P++   + + +  S  N DPNS   IA F           + + FL K+F P KT KLR  I TF Q  +E L+E+WERFKE+LRK P 
Subjt:  VNNQAEKERGVPSLKYFLTLLLSHSIVN-DPNSRGAIAFF-----------VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPK

Query:  HGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQWPPERSKLKKIVVGVFEIDNVQ----------------GVLNRSSLQ
        HG P W+++                      L+ KT E A  LLEEM  NSYQW  E+S  K   VG++  D++                 GV  RS   
Subjt:  HGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQWPPERSKLKKIVVGVFEIDNVQ----------------GVLNRSSLQ

Query:  LP-----------------------LHPRLRRKILSMTSKLEEAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHH
        +P                          + ++        LE+ +      +N     I+ IETQ  QL   +    +G      EK+P E  KA+T+  
Subjt:  LP-----------------------LHPRLRRKILSMTSKLEEAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHH

Query:  EEELHIAE-EDETTKPKEL
         +EL   E +DE  K +E+
Subjt:  EEELHIAE-EDETTKPKEL

A0A2I4F4C8 uncharacterized protein LOC1089953733.4e-2331.91Show/hide
Query:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENART-LLEEMTTNSYQW
        FL KFFPP KT +LR+ I  F+Q   E L+EAWER+K L+R CP+HG P WL++ +                    L+SKT+E A T LLEEMT+N+YQW
Subjt:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENART-LLEEMTTNSYQW

Query:  PPERSKLKK---IVVGVFEIDNVQGVLNRSSLQLPLHPRLRRKILSMTSK-----------------LEEAVIA------------------INTTVNGH
        P E++  KK   I +       +Q V   +S+ +P +   + ++  + ++                 LE+A+I+                  I+   +  
Subjt:  PPERSKLKK---IVVGVFEIDNVQGVLNRSSLQLPLHPRLRRKILSMTSK-----------------LEEAVIA------------------INTTVNGH

Query:  STAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDET
          AIKNIE Q  +L +++    +G   +  E +P E CKA+T+    EL  +   ET
Subjt:  STAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDET

A0A3S3N117 Retrotrans_gag domain-containing protein3.2e-2146.77Show/hide
Query:  KAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQ
        K FL KFFPPTKTVK+R  I TF Q   E L+EAWER+KELLRKCP HG P W+++                      L+ K+ E A  L+EEM TN+YQ
Subjt:  KAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQ

Query:  WPPERSKLKKIVVGVFEIDNVQGV
        WP +  + KKI  GV E+D++  +
Subjt:  WPPERSKLKKIVVGVFEIDNVQGV

A0A6J0ZYV0 uncharacterized protein LOC1104134131.4e-1948.72Show/hide
Query:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQWP
        FL KFFPP KT K+R  I +F Q   E L+EAWERFKELLR+CP HG P WL++                      L+SK   +A  LLEEM +N+YQWP
Subjt:  FLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNSYQWP

Query:  PERSKLKKIVVGVFEID
         ERS  +K  VG +EID
Subjt:  PERSKLKKIVVGVFEID

A0A6J1DU19 uncharacterized protein LOC1110243611.7e-3035.13Show/hide
Query:  VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNS
        +++AFL  FFPP KT +LRT I +F++   EQLFE WER+KELLRKCP+HG   WL++ +                    LLS+T ENA  LL++M  NS
Subjt:  VIKAFLKKFFPPTKTVKLRTGIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHV-------------------GLLSKTVENARTLLEEMTTNS

Query:  YQWPPERSKLKKIVVGVFEID-------NVQGVLNRSS-----------------------------LQLPLHPRLRRKIL------------SMTSKLE
        +QWP ERS  KK V G++EID        VQ + N  S                              Q   HP  ++  L            S  S++E
Subjt:  YQWPPERSKLKKIVVGVFEID-------NVQGVLNRSS-----------------------------LQLPLHPRLRRKIL------------SMTSKLE

Query:  EAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDETTKPKELTGE
          V  +   + G++T+IKN+E Q  Q+   + TM KGK  ++ E  P E+CKAVT+   +EL   E+ +  +P   T E
Subjt:  EAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDETTKPKELTGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACTAACATCTCTCCTAGAGCCTTGAGATCTTCACCTAAAGGATCTACAGTTCTCTTAGAAGCAAATCTCAATAGATCATCAATCACAGTTCCTAAAAGTTTATC
ATGGGAACAAATTACTATAAATCCAACATGGAGACTTACGGAAGCCTTCACTCCACCAAAGAAAAATTCAAACCTAGCACAAATTGTGGAATATCCAGATGAATCAGTAG
AAGTTCAATTTTCTGAAGAACCAGAGACCTCAAAGGTTAAAGACTTCATGTCTTCTCGACCTAGCACATCTGGAACAACTTCAGAAATCGAATCAAAATACTATATGGAT
AGATCTAACTCATTAAGAAATCTGAAAACGAAATCAACATGCTTCAAGAATTCTGGAAAACCACTCAAAGAGGAGTATTCATTCTACTCATCCTCCTCTAGAGAGATCGA
GTTTGATACAATTTATGGTGAGAAGGTCAAAGCTAGTCCATTGAAAGGTGATCCAGCTCAATACCAAGCCAGATCGGCAGATATCCTTTTAAATCTCAAATGTCCAACAC
TAGGAGATTTCAGATGGTGCGATCAAAATTGTATTCGAGAAGGATTAATACCTTCAATATACTTCGAAAAAACTTCTGAATCTCTTCGTGGTGCTAAGAATGATCAACTC
ATGATCAATTGTAAATTATCAAAAGTCCATGTTTGCAATGAAGGAGTTTGCTTTGTAAACTCTTTTCTTTTAGTCAAAGATTTAGGACAAGAATTGATCCTAGAAAGGAC
TCATCAGATCTTCAAAAGCCCTTGGTCATGTGCAGCATTTTATGTCAATAACCAAGCAGAGAAGGAGCGAGGTGTTCCAAGTTTGAAATATTTCCTCACTCTCCTACTTT
CTCACTCTATTGTAAACGATCCTAATTCTCGAGGAGCAATTGCATTCTTTGTCATTAAGGCATTCTTGAAGAAGTTCTTTCCTCCTACTAAGACGGTCAAGCTGAGGACC
GGGATTGGGACGTTCCAGCAGCAGTTTGATGAGCAGTTGTTCGAGGCTTGGGAGCGATTTAAAGAGCTGCTGAGGAAGTGTCCTAAGCATGGTTATCCCTACTGGCTCAA
GCTGCACGTGGGACTGTTGTCCAAAACCGTTGAGAATGCTAGGACTTTGCTAGAGGAAATGACCACCAATAGCTATCAGTGGCCACCTGAGCGGTCGAAACTGAAAAAGA
TTGTTGTTGGAGTGTTCGAGATTGACAACGTACAAGGAGTGCTCAATCGATCGAGTCTGCAGCTGCCCTTGCATCCCAGACTTAGGAGGAAAATCTTGAGCATGACCAGC
AAGCTTGAGGAGGCCGTGATTGCCATAAACACCACTGTCAATGGTCATTCTACAGCCATCAAGAACATAGAGACTCAGTTCAGACAACTGGTAAGTGTTGTCAAGACCAT
GAATAAAGGTAAAGCCTCAGCTGAACAAGAGAAGTCTCCATTGGAGTACTGCAAAGCCGTCACTGTGCATCACGAGGAGGAATTGCACATAGCTGAGGAAGATGAGACTA
CTAAACCAAAGGAACTCACTGGAGAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATACTAACATCTCTCCTAGAGCCTTGAGATCTTCACCTAAAGGATCTACAGTTCTCTTAGAAGCAAATCTCAATAGATCATCAATCACAGTTCCTAAAAGTTTATC
ATGGGAACAAATTACTATAAATCCAACATGGAGACTTACGGAAGCCTTCACTCCACCAAAGAAAAATTCAAACCTAGCACAAATTGTGGAATATCCAGATGAATCAGTAG
AAGTTCAATTTTCTGAAGAACCAGAGACCTCAAAGGTTAAAGACTTCATGTCTTCTCGACCTAGCACATCTGGAACAACTTCAGAAATCGAATCAAAATACTATATGGAT
AGATCTAACTCATTAAGAAATCTGAAAACGAAATCAACATGCTTCAAGAATTCTGGAAAACCACTCAAAGAGGAGTATTCATTCTACTCATCCTCCTCTAGAGAGATCGA
GTTTGATACAATTTATGGTGAGAAGGTCAAAGCTAGTCCATTGAAAGGTGATCCAGCTCAATACCAAGCCAGATCGGCAGATATCCTTTTAAATCTCAAATGTCCAACAC
TAGGAGATTTCAGATGGTGCGATCAAAATTGTATTCGAGAAGGATTAATACCTTCAATATACTTCGAAAAAACTTCTGAATCTCTTCGTGGTGCTAAGAATGATCAACTC
ATGATCAATTGTAAATTATCAAAAGTCCATGTTTGCAATGAAGGAGTTTGCTTTGTAAACTCTTTTCTTTTAGTCAAAGATTTAGGACAAGAATTGATCCTAGAAAGGAC
TCATCAGATCTTCAAAAGCCCTTGGTCATGTGCAGCATTTTATGTCAATAACCAAGCAGAGAAGGAGCGAGGTGTTCCAAGTTTGAAATATTTCCTCACTCTCCTACTTT
CTCACTCTATTGTAAACGATCCTAATTCTCGAGGAGCAATTGCATTCTTTGTCATTAAGGCATTCTTGAAGAAGTTCTTTCCTCCTACTAAGACGGTCAAGCTGAGGACC
GGGATTGGGACGTTCCAGCAGCAGTTTGATGAGCAGTTGTTCGAGGCTTGGGAGCGATTTAAAGAGCTGCTGAGGAAGTGTCCTAAGCATGGTTATCCCTACTGGCTCAA
GCTGCACGTGGGACTGTTGTCCAAAACCGTTGAGAATGCTAGGACTTTGCTAGAGGAAATGACCACCAATAGCTATCAGTGGCCACCTGAGCGGTCGAAACTGAAAAAGA
TTGTTGTTGGAGTGTTCGAGATTGACAACGTACAAGGAGTGCTCAATCGATCGAGTCTGCAGCTGCCCTTGCATCCCAGACTTAGGAGGAAAATCTTGAGCATGACCAGC
AAGCTTGAGGAGGCCGTGATTGCCATAAACACCACTGTCAATGGTCATTCTACAGCCATCAAGAACATAGAGACTCAGTTCAGACAACTGGTAAGTGTTGTCAAGACCAT
GAATAAAGGTAAAGCCTCAGCTGAACAAGAGAAGTCTCCATTGGAGTACTGCAAAGCCGTCACTGTGCATCACGAGGAGGAATTGCACATAGCTGAGGAAGATGAGACTA
CTAAACCAAAGGAACTCACTGGAGAAGCTTAG
Protein sequenceShow/hide protein sequence
MNTNISPRALRSSPKGSTVLLEANLNRSSITVPKSLSWEQITINPTWRLTEAFTPPKKNSNLAQIVEYPDESVEVQFSEEPETSKVKDFMSSRPSTSGTTSEIESKYYMD
RSNSLRNLKTKSTCFKNSGKPLKEEYSFYSSSSREIEFDTIYGEKVKASPLKGDPAQYQARSADILLNLKCPTLGDFRWCDQNCIREGLIPSIYFEKTSESLRGAKNDQL
MINCKLSKVHVCNEGVCFVNSFLLVKDLGQELILERTHQIFKSPWSCAAFYVNNQAEKERGVPSLKYFLTLLLSHSIVNDPNSRGAIAFFVIKAFLKKFFPPTKTVKLRT
GIGTFQQQFDEQLFEAWERFKELLRKCPKHGYPYWLKLHVGLLSKTVENARTLLEEMTTNSYQWPPERSKLKKIVVGVFEIDNVQGVLNRSSLQLPLHPRLRRKILSMTS
KLEEAVIAINTTVNGHSTAIKNIETQFRQLVSVVKTMNKGKASAEQEKSPLEYCKAVTVHHEEELHIAEEDETTKPKELTGEA