; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008862 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008862
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr9:31268908..31272744
RNA-Seq ExpressionLag0008862
SyntenyLag0008862
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.8e-0864.81Show/hide
Query:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FD LK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.8e-0864.81Show/hide
Query:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0051997.1 hypothetical protein E6C27_scaffold60G004810 [Cucumis melo var. makuwa]2.8e-1043.52Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK--FEGSSLYPVALF-L
        M +++LH  FSF K K    ++       FD LK+T DQ +R+M +L++K F E N D K+HS +PSRMKRK SV INTEG+    F+      + LF +
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK--FEGSSLYPVALF-L

Query:  LQVRGSHC
        LQ R   C
Subjt:  LQVRGSHC

KAA0060302.1 gag protease polyprotein [Cucumis melo var. makuwa]8.9e-0966.67Show/hide
Query:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FDCLK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

TYK30948.1 hypothetical protein E5676_scaffold455G001730 [Cucumis melo var. makuwa]5.2e-0952.56Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN
        M +V+ HS FSF KAK LHIEE       FD LK+TNDQ +R+M  L+ K F E N+D K++S + S MKRK  V IN
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN

TrEMBL top hitse value%identityAlignment
A0A5A7UC34 Uncharacterized protein1.3e-1043.52Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK--FEGSSLYPVALF-L
        M +++LH  FSF K K    ++       FD LK+T DQ +R+M +L++K F E N D K+HS +PSRMKRK SV INTEG+    F+      + LF +
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK--FEGSSLYPVALF-L

Query:  LQVRGSHC
        LQ R   C
Subjt:  LQVRGSHC

A0A5A7UWS9 Gag protease polyprotein4.3e-0966.67Show/hide
Query:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FDCLK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7VRG7 Retrotransposon gag protein4.8e-0864.81Show/hide
Query:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein4.8e-0864.81Show/hide
Query:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FD LK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5D3E580 Uncharacterized protein2.5e-0952.56Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN
        M +V+ HS FSF KAK LHIEE       FD LK+TNDQ +R+M  L+ K F E N+D K++S + S MKRK  V IN
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGATGTCCACCTCCACTCGACCTTTAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTTTTGATTGCCTCAAAGTAACAAA
CGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTT
CTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGTTGCGTTGTTCCTTCTCCAAGTTCGAGGTTCTCATTGTACGACTGCTACGTTG
TTCCTCCTCCAAGTGCGAAGATCTTATGTGGTGCGATGTTGCATTGTTCCCTCTTCTTTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTCCTTCTCCCAAGTTC
GAAGGTTCTCACGCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTGAAGGTCTCACGCACTTCGCTGCAGTTCCTTCTCTCCAAGTTGAAGGTTCTCACGCGCTTCGTTGCA
GTTCCTTCTCCCCAAGTTTGAAGGTTCACGCACTTCGTTGCAGTTCTATCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTC
TCACACATTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCCTTCCTCCAAGATCGAAGGTTCTCACGCCGCTTCGCTGCAGTTCC
TTCCTCCAAGTTCGAAGGTCCTCACGCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAGGGTTCTCACGCACTTTACTGCGCGCTTCGCCGCAGTTCCTTCCTCCAAGT
TCGAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACACGCTTCGCTGCAATTCCTCCTCCAAGTTCGAGGGTTCTCACACGCTTCGCTGCA
GTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGT
CACGTTTCACACGTCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTTATGTGCTTCGCAACAGTTCATTCTCTCAAATTCGAGGGTGCTCACGTTGTACACTAC
TGGGTTGTTCCTTCTCCAAGTTTGAGGGTTCTCCGATGCACGCTCCTGCGTTCTCAAGTGTTACACTTTCTTCTTTAAGTTCGAAGGTTCCCACGTTGCGCTGTTGTGTT
GCTTCTTCTCCAAGTTCAAAGTTCCTTCTCTCCAAGTTTAAAGGTTCACGCACTTCGCTGCAGTCCCTTCTCTCAAGTTTGAAGGTTTTCACGCTGCTTCGCTGCAGTTC
TTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAGGGGTTCTCATGCAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCG
TTGCAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCAACAGTTCATTCTCTCCAAATTCGAGGGTGCTTACGTTGTACA
CTACTGGGTTGTTCCTTCTCCAAGTTTGAGGGTTCTCCGATGCACGCTCCTGCGTTCTCAAGTGTTACACTTCCTTCTTTAAGTTCGAAGGTTCCCACGTTGCGCTGTTG
TGTTGCTTCCTTCTCCAAGTTCAAAGGTTCTAACGCTGCGGTGCTTCCTTCATCAAGTTCGAAGTTCCTTATCTCCAATTTCGAAGGATCTCGCACATTTCGCTTTCCTT
CTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGTTCTCATGCAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCGT
TGCAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTTGTTGCAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCCAAGTCGAG
TCGAAGGCTCACACGTTGCTTCGCTGTAGTTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCATCGCTGCAGTCCCTTCTCTCAAGTTTGAAAGTTCTCACACTGCTT
CGAAGGTTCTCACGCGCTTCGCTACAGTTCCTTCAGGTCCTTACATTGAGTGCATCACTGAAGGCGAATCTGGTGACTACCCCTGCAGGCTACTCAGATCACCCAATAAA
ATGGGGACTGGTCTAGTAGGAGTGCATCACTGTAGGCAAATCTGGCGACTACTCCTGCAGGTTACTCAGATCACCCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGATGTCCACCTCCACTCGACCTTTAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTTTTGATTGCCTCAAAGTAACAAA
CGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTT
CTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGTTGCGTTGTTCCTTCTCCAAGTTCGAGGTTCTCATTGTACGACTGCTACGTTG
TTCCTCCTCCAAGTGCGAAGATCTTATGTGGTGCGATGTTGCATTGTTCCCTCTTCTTTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTCCTTCTCCCAAGTTC
GAAGGTTCTCACGCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTGAAGGTCTCACGCACTTCGCTGCAGTTCCTTCTCTCCAAGTTGAAGGTTCTCACGCGCTTCGTTGCA
GTTCCTTCTCCCCAAGTTTGAAGGTTCACGCACTTCGTTGCAGTTCTATCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTC
TCACACATTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCCTTCCTCCAAGATCGAAGGTTCTCACGCCGCTTCGCTGCAGTTCC
TTCCTCCAAGTTCGAAGGTCCTCACGCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAGGGTTCTCACGCACTTTACTGCGCGCTTCGCCGCAGTTCCTTCCTCCAAGT
TCGAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACACGCTTCGCTGCAATTCCTCCTCCAAGTTCGAGGGTTCTCACACGCTTCGCTGCA
GTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGT
CACGTTTCACACGTCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTTATGTGCTTCGCAACAGTTCATTCTCTCAAATTCGAGGGTGCTCACGTTGTACACTAC
TGGGTTGTTCCTTCTCCAAGTTTGAGGGTTCTCCGATGCACGCTCCTGCGTTCTCAAGTGTTACACTTTCTTCTTTAAGTTCGAAGGTTCCCACGTTGCGCTGTTGTGTT
GCTTCTTCTCCAAGTTCAAAGTTCCTTCTCTCCAAGTTTAAAGGTTCACGCACTTCGCTGCAGTCCCTTCTCTCAAGTTTGAAGGTTTTCACGCTGCTTCGCTGCAGTTC
TTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAGGGGTTCTCATGCAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCG
TTGCAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCAACAGTTCATTCTCTCCAAATTCGAGGGTGCTTACGTTGTACA
CTACTGGGTTGTTCCTTCTCCAAGTTTGAGGGTTCTCCGATGCACGCTCCTGCGTTCTCAAGTGTTACACTTCCTTCTTTAAGTTCGAAGGTTCCCACGTTGCGCTGTTG
TGTTGCTTCCTTCTCCAAGTTCAAAGGTTCTAACGCTGCGGTGCTTCCTTCATCAAGTTCGAAGTTCCTTATCTCCAATTTCGAAGGATCTCGCACATTTCGCTTTCCTT
CTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGTTCTCATGCAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCGT
TGCAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTTGTTGCAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCCAAGTCGAG
TCGAAGGCTCACACGTTGCTTCGCTGTAGTTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCATCGCTGCAGTCCCTTCTCTCAAGTTTGAAAGTTCTCACACTGCTT
CGAAGGTTCTCACGCGCTTCGCTACAGTTCCTTCAGGTCCTTACATTGAGTGCATCACTGAAGGCGAATCTGGTGACTACCCCTGCAGGCTACTCAGATCACCCAATAAA
ATGGGGACTGGTCTAGTAGGAGTGCATCACTGTAGGCAAATCTGGCGACTACTCCTGCAGGTTACTCAGATCACCCAATAA
Protein sequenceShow/hide protein sequence
MFDVHLHSTFSFPKAKCLHIEEKSIFNICFDCLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPVALFLLQVRGSHCTTATL
FLLQVRRSYVVRCCIVPSSFKFDGSHAASLEFLLPSSKVLTPLRCSSFLQVEGLTHFAAVPSLQVEGSHALRCSSFSPSLKVHALRCSSISPSSKVHALRCSSFSQIRRF
SHISLQFLPPSSKVLTSLRSSSFLQDRRFSRRFAAVPSSKFEGPHAASLQFLPPSLRVLTHFTARFAAVPSSKFEVLTRFAAVPSSKFEVLTRFAAIPPPSSRVLTRFAA
VPSSKFEGSHALRCSSFLQVRRFSRVSLQFLPPSSKVTFHTSLRCVPSSKFEGSYVLRNSSFSQIRGCSRCTLLGCSFSKFEGSPMHAPAFSSVTLSSLSSKVPTLRCCV
ASSPSSKFLLSKFKGSRTSLQSLLSSLKVFTLLRCSSSPSSKVLTLLRCSSFLQVQGVLMQRLPPSSKDLTHFVAVPSLRVQSSFSPSSKVLMRFATVHSLQIRGCLRCT
LLGCSFSKFEGSPMHAPAFSSVTLPSLSSKVPTLRCCVASFSKFKGSNAAVLPSSSSKFLISNFEGSRTFRFPSPSSKVLTLLRCSSFLQVQRGSHATPSSKFEGSHAFR
CSSFSQSSKFLLSKFEGSHVALLQFPSSKSEGSPKSSRRLTRCFAVVSFLQVRRFSRCIAAVPSLKFESSHTASKVLTRFATVPSGPYIECITEGESGDYPCRLLRSPNK
MGTGLVGVHHCRQIWRLLLQVTQITQ