; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000884 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000884
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:18811303..18813207
RNA-Seq ExpressionLag0000884
SyntenyLag0000884
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025725.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.0e-2557.5Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK R S S FDRLK+T+DQ QR++ +L+ K F E N D+K+ S + S MKRK SV INTEG L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKT
        II TNP ++G  +  DE+K+
Subjt:  IILTNPTSQGSDQDHDEDKT

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-2557.5Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK R S S FDRLK+T+DQ QR++ +L+ K F E N D+K+ S + S MKRK SV INTEG L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKT
        II TNP ++G ++  DE+K+
Subjt:  IILTNPTSQGSDQDHDEDKT

KAA0052018.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.4e-2658.33Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  KEENQC TST+ R SAF+RLS+STSKK R S S FDRLK+T+DQ QR++  L+ K F E N D+K+ S + SHMKRK SV INT+G L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKT
        II TNP ++G ++  DE+K+
Subjt:  IILTNPTSQGSDQDHDEDKT

KAA0063700.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.1e-2654.33Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +S A  +EENQC TST+TR SAF+RLS+STSKK+R S   FDR+K+T+ Q QR++ +L+ KLF E N DN + S + S MKRK S+ INTEG L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKTVRCCVVL
        II TNPT++G +Q  DE++T R  + +
Subjt:  IILTNPTSQGSDQDHDEDKTVRCCVVL

TYK06348.1 gag protease polyprotein [Cucumis melo var. makuwa]2.7e-2557.14Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  ++ENQC TSTFTR SAF+ LS+STSKK R S   FD LK+T+DQ QR++  L  K F E N D+K+ S + SHMKRK S+ INTEGFL +K   
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDK
        II TNPT++G +Q  DE+K
Subjt:  IILTNPTSQGSDQDHDEDK

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ5 Retrotransposon gag protein9.9e-2657.5Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK R S S FDRLK+T+DQ QR++ +L+ K F E N D+K+ S + S MKRK SV INTEG L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKT
        II TNP ++G  +  DE+K+
Subjt:  IILTNPTSQGSDQDHDEDKT

A0A5A7TQ06 Retrotransposon gag protein7.6e-2657.5Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  +EENQC TST+ R SAF+RLS+STSKK R S S FDRLK+T+DQ QR++ +L+ K F E N D+K+ S + S MKRK SV INTEG L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKT
        II TNP ++G ++  DE+K+
Subjt:  IILTNPTSQGSDQDHDEDKT

A0A5A7U9V3 Retrotransposon gag protein1.2e-2658.33Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  KEENQC TST+ R SAF+RLS+STSKK R S S FDRLK+T+DQ QR++  L+ K F E N D+K+ S + SHMKRK SV INT+G L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKT
        II TNP ++G ++  DE+K+
Subjt:  IILTNPTSQGSDQDHDEDKT

A0A5A7V935 Ty3-gypsy retrotransposon protein4.4e-2654.33Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +S A  +EENQC TST+TR SAF+RLS+STSKK+R S   FDR+K+T+ Q QR++ +L+ KLF E N DN + S + S MKRK S+ INTEG L VKP  
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDKTVRCCVVL
        II TNPT++G +Q  DE++T R  + +
Subjt:  IILTNPTSQGSDQDHDEDKTVRCCVVL

A0A5D3C5C5 Gag protease polyprotein1.3e-2557.14Show/hide
Query:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL
        +SMA  ++ENQC TSTFTR SAF+ LS+STSKK R S   FD LK+T+DQ QR++  L  K F E N D+K+ S + SHMKRK S+ INTEGFL +K   
Subjt:  MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNL

Query:  IILTNPTSQGSDQDHDEDK
        II TNPT++G +Q  DE+K
Subjt:  IILTNPTSQGSDQDHDEDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGCTGCGACAAAAGAAGAAAATCAATGTTCCACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
AATATCTGTCTTTGATCGCCTCAAAGTAACAGACGATCAACCTCAAAGAAAGATAGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAGCGACAATAAGCTTCTTA
GTAGCATCCTATCACATATGAAGAGGAAGTTTTCTGTTCTCATAAATACAGAAGGTTTCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGA
TCTGATCAAGACCATGATGAAGATAAGACTGTACGTTGCTGCGTTGTTCTTTCTCCAAGTTTGAGGTTTCTTACGTGGCACGTTACTGCATTTTTCCTTCTCACCAAGTT
TGAAGGTTCTCACGTTGCTACGCTGCCAGTCCATTCCTTCACCAAGTTCAAAGGTTCTCAGTTGTATGAATGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGT
TGTGCACTGCTGCATTGTTCCCTCTTCTCTCAAGTTCGAAGTTCCTTCTCTCAAGTTTGAAGGTTCTCATGCTGCTTGACTGCAATTCCTTCTCTCCAAGTTCGAAGATT
CTCATGCTCTTCGTTGTAGTTCCTTCTCTTCAAGTTCGAAGGTTCACACGTTGCGCTGCTTCTTCACCAAGTTCGAAGTTCCTTCTCCCAAGAGTTCACCCACTGTATGC
TGCTATTTCTTCCAAGTTCGAAGGTTCTAACGCTGCTTTTGCTCCTCCGAAGGTTCTAATGTTGTTCTGCTTCCTCACGTCGCTTCGCTGCAGTTTCTTCCTTCTGCAGT
TCCTTGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGCTACGATGTTCGACGGTCCTTCCTCTTATACTGCATTGCATCACTGTTCCTTCAACAAGTTCGAAGGTTCTTA
CGTTGCAACAAGAAAGGAGCCCAAGTGGGAAGGACATCATGTGTCCTTAGGCCGAGGGACGTCGTAGCAACAAAAGTCCAAGGAGCACGTCATGTCCTTGTACTCTTGCT
AAAAGGCATGGCGGTGACACAAGTCCAAGGAAAATGTCCTTATACTCATGCTGGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTTGTACTCATGCTAAGGGGCGTG
GCGGCGACATAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGCTGCGACAAAAGAAGAAAATCAATGTTCCACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
AATATCTGTCTTTGATCGCCTCAAAGTAACAGACGATCAACCTCAAAGAAAGATAGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAGCGACAATAAGCTTCTTA
GTAGCATCCTATCACATATGAAGAGGAAGTTTTCTGTTCTCATAAATACAGAAGGTTTCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGA
TCTGATCAAGACCATGATGAAGATAAGACTGTACGTTGCTGCGTTGTTCTTTCTCCAAGTTTGAGGTTTCTTACGTGGCACGTTACTGCATTTTTCCTTCTCACCAAGTT
TGAAGGTTCTCACGTTGCTACGCTGCCAGTCCATTCCTTCACCAAGTTCAAAGGTTCTCAGTTGTATGAATGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTATGT
TGTGCACTGCTGCATTGTTCCCTCTTCTCTCAAGTTCGAAGTTCCTTCTCTCAAGTTTGAAGGTTCTCATGCTGCTTGACTGCAATTCCTTCTCTCCAAGTTCGAAGATT
CTCATGCTCTTCGTTGTAGTTCCTTCTCTTCAAGTTCGAAGGTTCACACGTTGCGCTGCTTCTTCACCAAGTTCGAAGTTCCTTCTCCCAAGAGTTCACCCACTGTATGC
TGCTATTTCTTCCAAGTTCGAAGGTTCTAACGCTGCTTTTGCTCCTCCGAAGGTTCTAATGTTGTTCTGCTTCCTCACGTCGCTTCGCTGCAGTTTCTTCCTTCTGCAGT
TCCTTGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGCTACGATGTTCGACGGTCCTTCCTCTTATACTGCATTGCATCACTGTTCCTTCAACAAGTTCGAAGGTTCTTA
CGTTGCAACAAGAAAGGAGCCCAAGTGGGAAGGACATCATGTGTCCTTAGGCCGAGGGACGTCGTAGCAACAAAAGTCCAAGGAGCACGTCATGTCCTTGTACTCTTGCT
AAAAGGCATGGCGGTGACACAAGTCCAAGGAAAATGTCCTTATACTCATGCTGGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTTGTACTCATGCTAAGGGGCGTG
GCGGCGACATAAGTTGA
Protein sequenceShow/hide protein sequence
MSMAATKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSISVFDRLKVTDDQPQRKIDNLEVKLFDEVNSDNKLLSSILSHMKRKFSVLINTEGFLKVKPNLIILTNPTSQG
SDQDHDEDKTVRCCVVLSPSLRFLTWHVTAFFLLTKFEGSHVATLPVHSFTKFKGSQLYECYVVPPPSANDLMLCTAALFPLLSSSKFLLSSLKVLMLLDCNSFSPSSKI
LMLFVVVPSLQVRRFTRCAASSPSSKFLLPRVHPLYAAISSKFEGSNAAFAPPKVLMLFCFLTSLRCSFFLLQFLVPSLQVRRFSRYDVRRSFLLYCIASLFLQQVRRFL
RCNKKGAQVGRTSCVLRPRDVVATKVQGARHVLVLLLKGMAVTQVQGKCPYTHAGRGGDTSPRNMSCTHAKGRGGDIS