; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038449 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038449
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr2:17701729..17707290
RNA-Seq ExpressionLag0038449
SyntenyLag0038449
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]3.3e-2166.67Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.6e-2166.67Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]5.6e-2166.67Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.8e-2064.58Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K++S +PSR+KRK S+ INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

TYK16519.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-2065.62Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TN+Q KR+M + + K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein1.6e-2166.67Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5A7TQ06 Retrotransposon gag protein2.7e-2166.67Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5A7U974 Retrotransposon gag protein1.4e-2064.58Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K++S +PSR+KRK S+ INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5D3BBF9 Gag protease polyprotein2.7e-2166.67Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5D3D209 Retrotransposon gag protein7.9e-2165.62Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TN+Q KR+M + + K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
GACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTCGAA
GGTTCTGACGCTGCGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCCTTCTCTCCAAACTCGAAGGTGTTCTCACGCGCGCCGCTGCAGTT
CCTTCTCTCCAAGTTTGAAGGTTCTCTCAAGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCCTTCTCTCCAAGTTCGAAG
GTGTTCTCGCGCACTTTGCTGCCGTTCCTTCCTCTCAAATTCGAAGGTTCTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCCAAATTTGAAGGTTCTCACGACGCTCCGC
TGCAGTTCCTTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCGCTTCCGCTGCAGTTCATTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCC
GCTGCAGTTCCTTCGCTTTCGCTGCAATTCCTTCTCTCCGAGTTCGAAGGTTCTCACGACGTTTCGTTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTCC
GCTATAGTTCCTTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCATCTCTCCAAGTTCGAAGGTGTTCTCGCGCGCTTCGCTGCAGTTCCTTCCTCC
CAAATTCGAAGGTTCTCTCACGCGCTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCT
CCAAGTTCGAAGGTTCTCACGTTGCTTCGTCGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCTGAAGGTGCTCACGTGCTTCGGTAAAGTTCCTTCCT
CCCAAGTTCGAAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
CCCAAATTCGAAAGTTTCAAAGGCCCTCACGCGCTGCGCTTCGTTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCATGCGTTTCGATGCTACCTTCCTCCAAGTTCGAA
GGTTCTCTCACGCGCTGCTGCAGTTCCTGCCTCCAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTCCTTCT
CCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTTGAAGGTT
CCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAA
GGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGT
TGCTCCTTTTCAAATGTTTGGCGGAGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCTGCTGCGCTTCATCTTCAAATGTTGGCAGAAAC
TACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTC
AAATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAG
CAGGAGTGCATCGCTGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGACTGAAGACTCCTTCA
AGATTTGGAAGACTTCAAGCTCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCG
AACAGATCATCAAGCTAACCGACCGATCAAGAAGATCAACAACCGACAGGCAGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCC
GATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCA
ACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCG
ATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCC
AACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
GACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTCGAA
GGTTCTGACGCTGCGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCCTTCTCTCCAAACTCGAAGGTGTTCTCACGCGCGCCGCTGCAGTT
CCTTCTCTCCAAGTTTGAAGGTTCTCTCAAGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCCTTCTCTCCAAGTTCGAAG
GTGTTCTCGCGCACTTTGCTGCCGTTCCTTCCTCTCAAATTCGAAGGTTCTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCCAAATTTGAAGGTTCTCACGACGCTCCGC
TGCAGTTCCTTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCGCTTCCGCTGCAGTTCATTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCC
GCTGCAGTTCCTTCGCTTTCGCTGCAATTCCTTCTCTCCGAGTTCGAAGGTTCTCACGACGTTTCGTTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTCC
GCTATAGTTCCTTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCATCTCTCCAAGTTCGAAGGTGTTCTCGCGCGCTTCGCTGCAGTTCCTTCCTCC
CAAATTCGAAGGTTCTCTCACGCGCTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCT
CCAAGTTCGAAGGTTCTCACGTTGCTTCGTCGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCTGAAGGTGCTCACGTGCTTCGGTAAAGTTCCTTCCT
CCCAAGTTCGAAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
CCCAAATTCGAAAGTTTCAAAGGCCCTCACGCGCTGCGCTTCGTTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCATGCGTTTCGATGCTACCTTCCTCCAAGTTCGAA
GGTTCTCTCACGCGCTGCTGCAGTTCCTGCCTCCAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTCCTTCT
CCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTTGAAGGTT
CCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAA
GGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGT
TGCTCCTTTTCAAATGTTTGGCGGAGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCTGCTGCGCTTCATCTTCAAATGTTGGCAGAAAC
TACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTC
AAATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAG
CAGGAGTGCATCGCTGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGACTGAAGACTCCTTCA
AGATTTGGAAGACTTCAAGCTCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCG
AACAGATCATCAAGCTAACCGACCGATCAAGAAGATCAACAACCGACAGGCAGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCC
GATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCA
ACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCG
ATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCC
AACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGA
Protein sequenceShow/hide protein sequence
MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKVPTLRCCAASFSKFE
GSDAALLPSSKFEGFHALCCSSFSPNSKVFSRAPLQFLLSKFEGSLKLLRCSSFLQVRRFSCALLQFLLSKFEGVLAHFAAVPSSQIRRFSHALRCSSFLPNLKVLTTLR
CSSFSLQIRRFSRAPLQFLRFRCSSFSLQIRRFSRAPLQFLRFRCNSFSPSSKVLTTFRCSSFLPNSKVLTTLRYSSFSPNLKVFSRAPLQFHLSKFEGVLARFAAVPSS
QIRRFSHALRYSSFSPSMKVLSSKSKVLTLLHCSSFLQVRRFSRCFVVVPSLQVRRFSPPSLKVLTCFGKVPSSQVRSFFSLSSKVLTRFAAVPSSLSSKVLTRFAAVPS
PKFESFKGPHALRFVAVPSSKFEGSHAFRCYLPPSSKVLSRAAAVPASKFEGSLTRFARSFSKFEGASLRCSFSKFEGASLYCSFSKFEGASLRCYLPPSSKFLPPSLKV
PSRASLAPSPSSKALLSVATSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAEVDVLVPLHLQMLVVDGVCCASSSNVGRN
YSHQSDWSRQVVKSLQLNLMTTVEGESGLVTTPAGYSNHPIKWGLGLAGVHEANLVTTPAGYSDHPIKWGLGLAGVHRCYSDHPIKWGLGLAGVHEGESGDYPCRLKTPS
RFGRLQAPRDQQANRPIKKINKSAGRSSKKINKPTDRTDHQANRPIKKINNRQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRPIIQKINKSAGRSSKRS
TSQPTDQEDQQVSRPIIQEINKPTGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRPIIQEDQQANKPIQQIIKPTG