; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000617 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000617
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:11521677..11526603
RNA-Seq ExpressionLag0000617
SyntenyLag0000617
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025725.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.5e-1962Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC  ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+ S +SSRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]2.7e-1961Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC +ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M + + K F E N D K+HS + SRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-1962Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC  ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS + SRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]1.6e-1962Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC  ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS + SRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

KAA0057874.1 gag protease polyprotein [Cucumis melo var. makuwa]1.6e-1963.27Show/hide
Query:  MVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        M   EEENQC  ST TR SAF+RLS+STSKK R STS FD LK+TNDQ +R+M  L+ K F E N D K+H+ +SSRMKRK SV IN EGSL V  RF
Subjt:  MVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ5 Retrotransposon gag protein1.7e-1962Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC  ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+ S +SSRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

A0A5A7SZJ7 Retrotransposon gag protein1.3e-1961Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC +ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M + + K F E N D K+HS + SRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

A0A5A7TQ06 Retrotransposon gag protein7.7e-2062Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC  ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS + SRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

A0A5A7URU7 Gag protease polyprotein7.7e-2063.27Show/hide
Query:  MVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        M   EEENQC  ST TR SAF+RLS+STSKK R STS FD LK+TNDQ +R+M  L+ K F E N D K+H+ +SSRMKRK SV IN EGSL V  RF
Subjt:  MVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

A0A5D3BBF9 Gag protease polyprotein7.7e-2062Show/hide
Query:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF
        +SM   EEENQC  ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS + SRMKRK SV IN EGSL V  RF
Subjt:  MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGTCGCGACAGAGGAAGAAAATCAATGTTCGATATCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
AACATCTGTCTTTGATCACCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGCATCTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATGCGGAAGGTTCCTTGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGG
TTCTCACGTTGCTTCGCTGCAGTTTCCTTCCTCCAAGTACGAAGGCTTCCCCAAGTCGAGTCGAAGGCTCACACGTTGTTGTTTGCACGTGTTGTTGTTATGCTGTTGTT
CCTTCTCCAAAGTTCGAAGGTTCCCACGTTGTGCGTTGTCCGCTTCGCTGCAGTTCCTTCTCCAAGTTCTCCAAGTTCGAAGGTTCTCTCACGTTGCTTCGCTGCAGTTC
CTTCTCTCCAATCCGAAGGTTTGCACGTGTTAAAGGTTCCCACGTTGTGCGTTGTCGTTATGCTGCTTCACGCTGCTGTTCCTTCTCCAAAGTTTGAAGGGGTTCCCACG
TTGCGCGCGGTTGTGTTGCTTCCCTTCACCAAGTTCGAAGGTTCTGACGTTGCGCTGCTTCTGCTTCCTTCACCAAGTTCGAAGAAGGTTCTCATGCTGCGTTGTTTCGT
TGTTCCTTCTCCAAAGTTTGAAGGTCCTCATGCACTCTGCTACTGTTCCTTGCCCTCTCACGTGTTTCTTACGCTGCAGTTCCTTCCCCACAAGTTCGAATCACGCGCTT
CGCTCCTTCCTTCCTCCAAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAG
GTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACATGTGGTGAAATCACTGCAAGA
GAAGTTGATGGCGACCGTGGTGACCACCCTTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGG
TGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAGGTGAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAA
CTACAGTCATCAAAGAGGGATTCAGATCAGAGAACTCAGAGTCGAGAGAATTCTGTCAGAGTCCAGAGTCATCAGAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCTAG
ATTCAGAATTCAACCAACTCAAGACTCAGAAGATCAGCAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGGAGATCAATAAGCCAATCG
ACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACACGTTAGCAGGCCGATCATCCAAGAAGATC
AACAAGCCAACTGATCGAACAGATCATCAAACCAACAGGCCGATCAAGAAGATCAACAAGTCAGTAGGCCGATCATTCAAGAGATCAACAAGCCAACCGACCGATCAAGA
AGATAAACAAGTCAGCAAGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCTGA
TCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAATAGGTCGATCCAATAGATCATCAAGTCACCAGGTCGATCATCCAAGAGGATCAACA
AGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGTCGCGACAGAGGAAGAAAATCAATGTTCGATATCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
AACATCTGTCTTTGATCACCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGCATCTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATGCGGAAGGTTCCTTGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGG
TTCTCACGTTGCTTCGCTGCAGTTTCCTTCCTCCAAGTACGAAGGCTTCCCCAAGTCGAGTCGAAGGCTCACACGTTGTTGTTTGCACGTGTTGTTGTTATGCTGTTGTT
CCTTCTCCAAAGTTCGAAGGTTCCCACGTTGTGCGTTGTCCGCTTCGCTGCAGTTCCTTCTCCAAGTTCTCCAAGTTCGAAGGTTCTCTCACGTTGCTTCGCTGCAGTTC
CTTCTCTCCAATCCGAAGGTTTGCACGTGTTAAAGGTTCCCACGTTGTGCGTTGTCGTTATGCTGCTTCACGCTGCTGTTCCTTCTCCAAAGTTTGAAGGGGTTCCCACG
TTGCGCGCGGTTGTGTTGCTTCCCTTCACCAAGTTCGAAGGTTCTGACGTTGCGCTGCTTCTGCTTCCTTCACCAAGTTCGAAGAAGGTTCTCATGCTGCGTTGTTTCGT
TGTTCCTTCTCCAAAGTTTGAAGGTCCTCATGCACTCTGCTACTGTTCCTTGCCCTCTCACGTGTTTCTTACGCTGCAGTTCCTTCCCCACAAGTTCGAATCACGCGCTT
CGCTCCTTCCTTCCTCCAAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAG
GTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACATGTGGTGAAATCACTGCAAGA
GAAGTTGATGGCGACCGTGGTGACCACCCTTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGG
TGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAGGTGAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAA
CTACAGTCATCAAAGAGGGATTCAGATCAGAGAACTCAGAGTCGAGAGAATTCTGTCAGAGTCCAGAGTCATCAGAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCTAG
ATTCAGAATTCAACCAACTCAAGACTCAGAAGATCAGCAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGGAGATCAATAAGCCAATCG
ACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACACGTTAGCAGGCCGATCATCCAAGAAGATC
AACAAGCCAACTGATCGAACAGATCATCAAACCAACAGGCCGATCAAGAAGATCAACAAGTCAGTAGGCCGATCATTCAAGAGATCAACAAGCCAACCGACCGATCAAGA
AGATAAACAAGTCAGCAAGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCTGA
TCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAATAGGTCGATCCAATAGATCATCAAGTCACCAGGTCGATCATCCAAGAGGATCAACA
AGCTAA
Protein sequenceShow/hide protein sequence
MSMVATEEENQCSISTSTRPSAFQRLSVSTSKKSRSSTSVFDHLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINAEGSLKVLTRFAAVPSLQVRR
FSRCFAAVSFLQVRRLPQVESKAHTLLFARVVVMLLFLLQSSKVPTLCVVRFAAVPSPSSPSSKVLSRCFAAVPSLQSEGLHVLKVPTLCVVVMLLHAAVPSPKFEGVPT
LRAVVLLPFTKFEGSDVALLLLPSPSSKKVLMLRCFVVPSPKFEGPHALCYCSLPSHVFLTLQFLPHKFESRASLLPSSKVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQ
VVKSLQVKLMTTVVTTPAGNYSHQSDWSRHVVKSLQEKLMATVVTTLAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKADDDRGDHPCRK
LQSSKRDSDQRTQSRENSVRVQSHQKSESLENSEDPRFRIQPTQDSEDQQANRPIKKINKSAGRSSKEINKPIDRSRRSTSQQADHPRDQQANRPIKKINTLAGRSSKKI
NKPTDRTDHQTNRPIKKINKSVGRSFKRSTSQPTDQEDKQVSKPIIQEDQQANRSNRSSSQQADPRDHQVSRLIIQEDQQANKPIQQIIKPIGRSNRSSSHQVDHPRGST
S