; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022814 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022814
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr7:38722327..38728075
RNA-Seq ExpressionLag0022814
SyntenyLag0022814
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.4e-2163.81Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ  MST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK  V INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.4e-2163.81Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ   ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  V INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]2.4e-2163.81Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ   ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  V INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

TYK00108.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.4e-2162.86Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EE+NQ   ST TR SAF+RLS+STSKK R STS FD +K+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK FV INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

TYK18884.1 gag protease polyprotein [Cucumis melo var. makuwa]5.3e-2162.86Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ   ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  + INT+GSL VKP L
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein6.7e-2263.81Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ  MST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK  V INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

A0A5A7TQ06 Retrotransposon gag protein1.2e-2163.81Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ   ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  V INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

A0A5D3BBF9 Gag protease polyprotein1.2e-2163.81Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ   ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  V INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

A0A5D3BLW3 Retrotransposon gag protein6.7e-2262.86Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EE+NQ   ST TR SAF+RLS+STSKK R STS FD +K+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK FV INTEGSL VKP  
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

A0A5D3D5Q0 Gag protease polyprotein2.6e-2162.86Show/hide
Query:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL
        +SMA  EEENQ   ST  R SAF+RLS+STSKK R STS FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  + INT+GSL VKP L
Subjt:  MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNL

Query:  IILTN
        II TN
Subjt:  IILTN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAACGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
AACATCTGTCTTTGATAGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGCATCCCGTCACGTATGAAGAGGAAGTTTTTTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAATCTCATTATCTTGACCAATCTTGCAATGAAGGAT
CTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAAAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTTCTTCGT
TGTATCCTGTTGCGTTGTTCCTTCTCCAAGTTCGAGGTTCTCAGTTGTACGACTACTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGCTGTTGCATTG
TTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTCCTTCTCCCCAAGTTCGAAGGTTCTCACGCCGCTTCGCTGCAGTTCTTTCCTCCAAGTTTG
AAGGTCCTCACGCCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCACACGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGTTTCGCTG
CAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCACTTCGTTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCTGTTCCTTCCTCCAAGTTCGAGGGT
TCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTCACGTTTCACACATCGCTTCGCTG
CGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCAACAGTTCATTCTCTCCAAATTCGAGGGTGCTTACGTTGTACACTACTGGGTTGTTCCTTCTCCAAGTTT
GAGGGTTCTCCGATGCACGCTCCTGCGTTGCTTCGCTGGAGTTCATTTTCTTTTTTTCAAGTTCGAAGATTTTCATGTTGTGTGGTTTCACTTGTGTCCTTCTTTAAGTT
CGAAGGTTCTCAAGTGTTACACTTCCTTCTTTAAGTTCAAAGGATCCCACGTTGCGCTGTTGTGTTGCTTCCTTCTCCAAGTTCAAAGGTTCACGCACTTCGCTGCAGTC
CCTTCTCTCAAGTTTGAAGGTTTTCACGCTGCTTCACTGCAGTTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGT
TCTCATGCAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTGTTGCAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCT
TTGTTGCAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCAAGTCGAGTCGAAGGCTCACACGTTGCTTCGTTGTAGTTTCCTTCCTCCAAGTTCGAAAGTTCTCACGTTGC
ATCGTTGTAGTCCCTTCTCTCAAGTTTGAAAGTTCTCACACTGCTTCGAAGGTTCTCACGCGCTTCGTTACAGTTCCTTCAGGTCCTTACATTGAGTGCATCACTGAAGG
CGAATCTGGTGACTACCCCTTCAGGCTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGTAGGAGTGCATCACTGTAGGCAAATCTGGCGACTACTCCTGCAGACAG
GTGGTGAAATCACTGCAAGTGAAGCTGATGAGACCGTGGTGACCACCCCTACAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTG
AAAGCTGATGACGACCGTGGTGGTGAAATCACTGCAAGTGAAGCTGATGAGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGAAACTACAGTCATCAAAG
TGACTGGTCTAAACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCATGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGG
TGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACATTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAA
GCTGATGACGACCGTGGTGACCACCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGAC
CACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGGATTGAGATTGAGATCAGAGAACTCAGAGTCCAGAGAATTCAGCCAGAGTCCAGAGTCATCAG
AAGTCAGAGAGTCTAGAGAATTTAGAAGATCCAAGATTCAGAATTCAACCAACTCAAGACTCAGAAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCCGATCC
AAGAGATCATTAAGCCAGCAGGCCGATCATCCAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGCCGATCATCCAAGAAGCTCAACAAACCAGCCCAAGAAGA
TCAAGAAGCTAGAGACCAAAGTTTATATATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAACGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTC
AACATCTGTCTTTGATAGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGCATCCCGTCACGTATGAAGAGGAAGTTTTTTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAATCTCATTATCTTGACCAATCTTGCAATGAAGGAT
CTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAAAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTTCTTCGT
TGTATCCTGTTGCGTTGTTCCTTCTCCAAGTTCGAGGTTCTCAGTTGTACGACTACTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGCTGTTGCATTG
TTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTCCTTCTCCCCAAGTTCGAAGGTTCTCACGCCGCTTCGCTGCAGTTCTTTCCTCCAAGTTTG
AAGGTCCTCACGCCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCACACGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGTTTCGCTG
CAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCACTTCGTTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCTGTTCCTTCCTCCAAGTTCGAGGGT
TCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTCACGTTTCACACATCGCTTCGCTG
CGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCAACAGTTCATTCTCTCCAAATTCGAGGGTGCTTACGTTGTACACTACTGGGTTGTTCCTTCTCCAAGTTT
GAGGGTTCTCCGATGCACGCTCCTGCGTTGCTTCGCTGGAGTTCATTTTCTTTTTTTCAAGTTCGAAGATTTTCATGTTGTGTGGTTTCACTTGTGTCCTTCTTTAAGTT
CGAAGGTTCTCAAGTGTTACACTTCCTTCTTTAAGTTCAAAGGATCCCACGTTGCGCTGTTGTGTTGCTTCCTTCTCCAAGTTCAAAGGTTCACGCACTTCGCTGCAGTC
CCTTCTCTCAAGTTTGAAGGTTTTCACGCTGCTTCACTGCAGTTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGT
TCTCATGCAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTGTTGCAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCT
TTGTTGCAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCAAGTCGAGTCGAAGGCTCACACGTTGCTTCGTTGTAGTTTCCTTCCTCCAAGTTCGAAAGTTCTCACGTTGC
ATCGTTGTAGTCCCTTCTCTCAAGTTTGAAAGTTCTCACACTGCTTCGAAGGTTCTCACGCGCTTCGTTACAGTTCCTTCAGGTCCTTACATTGAGTGCATCACTGAAGG
CGAATCTGGTGACTACCCCTTCAGGCTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGTAGGAGTGCATCACTGTAGGCAAATCTGGCGACTACTCCTGCAGACAG
GTGGTGAAATCACTGCAAGTGAAGCTGATGAGACCGTGGTGACCACCCCTACAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTG
AAAGCTGATGACGACCGTGGTGGTGAAATCACTGCAAGTGAAGCTGATGAGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGAAACTACAGTCATCAAAG
TGACTGGTCTAAACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCATGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGG
TGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACATTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAA
GCTGATGACGACCGTGGTGACCACCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGAC
CACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGGATTGAGATTGAGATCAGAGAACTCAGAGTCCAGAGAATTCAGCCAGAGTCCAGAGTCATCAG
AAGTCAGAGAGTCTAGAGAATTTAGAAGATCCAAGATTCAGAATTCAACCAACTCAAGACTCAGAAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCCGATCC
AAGAGATCATTAAGCCAGCAGGCCGATCATCCAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGCCGATCATCCAAGAAGCTCAACAAACCAGCCCAAGAAGA
TCAAGAAGCTAGAGACCAAAGTTTATATATTTAA
Protein sequenceShow/hide protein sequence
MSMAATEEENQRSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDSLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFFVLINTEGSLKVKPNLIILTNLAMKD
LIKTMTKIRAFKCKSSLSQKPKLHDAPSPHELKRFFVVSCCVVPSPSSRFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAASLEFLLPKFEGSHAASLQFFPPSL
KVLTPLRCSSFSPSLKVLTRFAAVPSSKFEGSHAFRCSSSSKFEGSHALRCSSSSKFEGSHALRCCSFLQVRGFSRASLQFLPPSSKVLTCFAAVPSSKFEGHVSHIASL
RSFLQVRRFSCASQQFILSKFEGAYVVHYWVVPSPSLRVLRCTLLRCFAGVHFLFFKFEDFHVVWFHLCPSLSSKVLKCYTSFFKFKGSHVALLCCFLLQVQRFTHFAAV
PSLKFEGFHAASLQFLLQVRRFSRCFAAVLSSKFKGVLMQRLPPSSKDLTHLLQFLLSEFKVPSLQVRRFSRCFVAVSFLQVRRLPKSSRRLTRCFVVVSFLQVRKFSRC
IVVVPSLKFESSHTASKVLTRFVTVPSGPYIECITEGESGDYPFRLLRSPNKMGTGLVGVHHCRQIWRLLLQTGGEITASEADETVVTTPTGNYSHQSDWSRQVVKSLQV
KADDDRGGEITASEADETVVTTPAGNYSHQRNYSHQSDWSKQVVKSLQVKLMTTMVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYIHQSDWSRQVVKSLQVK
ADDDRGDHPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQGLRLRSENSESREFSQSPESSEVRESREFRRSKIQNSTNSRLRRPIIQEDQQVTTGRS
KRSLSQQADHPANRPIQEIINLASRSSKKLNKPAQEDQEARDQSLYI