; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014797 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014797
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr12:4734277..4736947
RNA-Seq ExpressionLag0014797
SyntenyLag0014797
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025725.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-2562.73Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K++S + SRMKRK SV INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQG
        II TNP ++G
Subjt:  IILTNPTSQG

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.4e-2559.83Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+ S + SRMKRK SV INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQGPDQDHDK
        II TNP ++G ++  D+
Subjt:  IILTNPTSQGPDQDHDK

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]3.1e-2562.73Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+ S + SRMKRK SV INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQG
        II TNP ++G
Subjt:  IILTNPTSQG

KAA0063700.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.1e-2555.04Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +S A  +EENQC TST+TR SAF+RLS+STSKK++PST  FDR+K+T+ Q +R+M +L+ KLF E N D  + S + SRMKRK S+ INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQGPDQ---DHDKIRAFKISFS
        II TNPT++G +Q   +++  R F   FS
Subjt:  IILTNPTSQGPDQ---DHDKIRAFKISFS

TYK18884.1 gag protease polyprotein [Cucumis melo var. makuwa]3.1e-2558.97Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+ S + SRMKRK S+ INT+GSL VKP L
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQGPDQDHDK
        II TNP ++G ++  D+
Subjt:  IILTNPTSQGPDQDHDK

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ5 Retrotransposon gag protein8.9e-2662.73Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K++S + SRMKRK SV INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQG
        II TNP ++G
Subjt:  IILTNPTSQG

A0A5A7TQ06 Retrotransposon gag protein6.8e-2659.83Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+ S + SRMKRK SV INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQGPDQDHDK
        II TNP ++G ++  D+
Subjt:  IILTNPTSQGPDQDHDK

A0A5A7V935 Ty3-gypsy retrotransposon protein2.0e-2555.04Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +S A  +EENQC TST+TR SAF+RLS+STSKK++PST  FDR+K+T+ Q +R+M +L+ KLF E N D  + S + SRMKRK S+ INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQGPDQ---DHDKIRAFKISFS
        II TNPT++G +Q   +++  R F   FS
Subjt:  IILTNPTSQGPDQ---DHDKIRAFKISFS

A0A5D3BBF9 Gag protease polyprotein1.5e-2562.73Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+ S + SRMKRK SV INTEGSL VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQG
        II TNP ++G
Subjt:  IILTNPTSQG

A0A5D3D5Q0 Gag protease polyprotein1.5e-2558.97Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+ S + SRMKRK S+ INT+GSL VKP L
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPTSQGPDQDHDK
        II TNP ++G ++  D+
Subjt:  IILTNPTSQGPDQDHDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATGGCCGCGACAAAAGAAGAAAATCAATGTTCAACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAA
GTAGCATCCTGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGA
CCTGATCAAGACCATGATAAGATAAGAGCTTTTAAAATTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTT
CTTCGTTGTATCCTGTTGCGTTGTTCATTCTCCAAGTTCGAGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTT
GCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAACTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCA
AGGTCGAAGGTTCTCACTCGCTGCGTTGAAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGC
CACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGA
ATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTT
CTCACGCGCATCGCCGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTTCCT
TCTTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCATTCCCCAAGTTCGAAGGTTCTCA
TGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTC
CAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTAGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGATTCTCACGCGCT
TCGTTGCATTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAA
GTTCGAAGGTTCTCACGCGCTTCGCTACAGTTCCTTCCTCCAAGTTCAAAGGTTATCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTG
AAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCA
GTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGG
TTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTT
CCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCT
CACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGG
CTACATTGCTGCTCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGTCCCAAAGCGAGGA
ACATGTCCTTGTACTCGTGCTGAAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGC
ACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATGGCCGCGACAAAAGAAGAAAATCAATGTTCAACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAA
GTAGCATCCTGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGA
CCTGATCAAGACCATGATAAGATAAGAGCTTTTAAAATTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTT
CTTCGTTGTATCCTGTTGCGTTGTTCATTCTCCAAGTTCGAGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTT
GCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAACTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCA
AGGTCGAAGGTTCTCACTCGCTGCGTTGAAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGC
CACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGA
ATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTT
CTCACGCGCATCGCCGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTTCCT
TCTTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCATTCCCCAAGTTCGAAGGTTCTCA
TGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTC
CAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTAGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGATTCTCACGCGCT
TCGTTGCATTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAA
GTTCGAAGGTTCTCACGCGCTTCGCTACAGTTCCTTCCTCCAAGTTCAAAGGTTATCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTG
AAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCA
GTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGG
TTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTT
CCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCT
CACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGG
CTACATTGCTGCTCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGTCCCAAAGCGAGGA
ACATGTCCTTGTACTCGTGCTGAAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGC
ACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSILSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQG
PDQDHDKIRAFKISFSPSSRVLTLYAIALFLLQVRRFFVVSCCVVHSPSSRFSVVQLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHATLLEFLLPKFEGSHALRCSSFFP
RSKVLTRCVEVLSPQVRRFTHFAVPLPPSSKVLTRIATVPSSKFEGSHAHRHSSFLQVRRFSRASPQFLPPSSNPSSKFEGSHAHRHSSFLQVRRFSRASPQFLPPSSKV
LTRIAAVPSSKFEGSHVFHCSSFLTVRRFSCASLQFLLPSLKVLTSLRCDPSSKFEGSHALRSAIHSPSSKVLMRFVQFLPPNSKVLTRFVAFPSPKFEGSHVLRCSSFL
QVRRFSRASLCNSFPKLEGSHALRAVPSSKFEDSHALRCISFPPSSKVLTCFAAFPSPQIRRFSRASLQFLPPSSKVLTRFATVPSSKFKGYHVASLRSCASLQFLPPSL
KVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLMRFVQFLPPNSKVLTRFVAFPSPQIRRFSRASLQFLPPSSKVLTRFALQFLPPSSKVLTRFVQFLPPNSKVLTRFVAV
PSPKFEGSHALRCSSFPQVRRFSRTSLQFLPPSSKVLTRFAALQRYFLKSKDVNCPHAALLPSPSSRVLMLRSATLLLYFLKSKDVNCPCTHAEKGMAATQVQGHVPKRG
TCPCTRAERRGGGTSPRNMSQLKEHVRALVLKGVAAAQVQGTCPNSRNTSLHSC