; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017628 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017628
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr5:6123730..6125496
RNA-Seq ExpressionLag0017628
SyntenyLag0017628
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]9.5e-1767.9Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-1668.75Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]1.2e-1668.75Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.6e-1850.76Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK R ST VFDRLK+TNDQ +R+M  L+ K F E N D K+H+R+PSRMKRK SV IN E                       
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELKRLRCSSFLQVRRFSHAS
         EPKLH APSP ELK    +S  + R +  AS
Subjt:  QEPKLHDAPSPHELKRLRCSSFLQVRRFSHAS

KAA0065966.1 hypothetical protein E6C27_scaffold62G00430 [Cucumis melo var. makuwa]1.9e-1753.91Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK R STS FDR K+TN+Q +R++ +L+ KLF E N D K+HSR+PSRMKRK SV INTE                       
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELK
         +PKLH APSP ELK
Subjt:  QEPKLHDAPSPHELK

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein4.6e-1767.9Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5A7TQ06 Retrotransposon gag protein6.0e-1768.75Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5A7UM99 Ty3-gypsy retrotransposon protein3.2e-1850.76Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK R ST VFDRLK+TNDQ +R+M  L+ K F E N D K+H+R+PSRMKRK SV IN E                       
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELKRLRCSSFLQVRRFSHAS
         EPKLH APSP ELK    +S  + R +  AS
Subjt:  QEPKLHDAPSPHELKRLRCSSFLQVRRFSHAS

A0A5A7VHY3 Uncharacterized protein9.3e-1853.91Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK R STS FDR K+TN+Q +R++ +L+ KLF E N D K+HSR+PSRMKRK SV INTE                       
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELK
         +PKLH APSP ELK
Subjt:  QEPKLHDAPSPHELK

A0A5D3BBF9 Gag protease polyprotein6.0e-1768.75Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCT
AGCCCACACGAGCTTAAAAGGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTGCAGTTCATTCCTCAAAGTTCGAAGGTTCTCACGCGCTT
CGCTGCAGTTCCTTCCTCACAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGT
TCGAAGGTTCTCACGCGCTTCGTGTAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTG
CAGTTCCTTCCCCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAG
GTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAATT
CATTCCTCAACGCGCTTCGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTTCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCGCTTT
ACAGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCGCTTCGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCC
CACGTTGCGCTATTGTGTTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTACGGTGCTTCCTTCATCAAGTTCGAAGTTCCTTCTCTCCAATTTCGAAGGATCTCGCA
CATTTCGCTGCAGTTCCTTCTCTCCAAATTTGAAGTTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTTCTCCAAGTTCAAAGGGGTTCTCATG
CAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCATTGTAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGTTG
CAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCCAAGTCGAGTCGAAGGCTCACATGTTGCTTCGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCT
AGCCCACACGAGCTTAAAAGGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTGCAGTTCATTCCTCAAAGTTCGAAGGTTCTCACGCGCTT
CGCTGCAGTTCCTTCCTCACAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGT
TCGAAGGTTCTCACGCGCTTCGTGTAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTG
CAGTTCCTTCCCCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAG
GTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAATT
CATTCCTCAACGCGCTTCGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTTCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCGCTTT
ACAGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCGCTTCGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCC
CACGTTGCGCTATTGTGTTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTACGGTGCTTCCTTCATCAAGTTCGAAGTTCCTTCTCTCCAATTTCGAAGGATCTCGCA
CATTTCGCTGCAGTTCCTTCTCTCCAAATTTGAAGTTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTTCTCCAAGTTCAAAGGGGTTCTCATG
CAACGCCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCATTGTAGTTCCTTCTCTCAGAGTTCAAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGTTG
CAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCCAAGTCGAGTCGAAGGCTCACATGTTGCTTCGCTGTAG
Protein sequenceShow/hide protein sequence
MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAP
SPHELKRLRCSSFLQVRRFSHASLQFIPQSSKVLTRFAAVPSSQFEGSHALRCSSFPQIRRFSRASLCNSFPKFEGSHALRVVPSSKFEGSHTLRCISFPQIRRFSRASL
QFLPPSSKVLMRFAAVPSSKFKGSHVLRCSSFLQVRRFSRASLQFLPPSSKVLTCFAAVPSSKFEGSHALRCNSFLNALRCSSFLQVRRFSRASPQFLPPSLKVLTSLRF
TAFLPPSSKVLTSLRSSFLQVRRFSSPQFLPPSSKVPTLRYCVASFSKFEGSDATVLPSSSSKFLLSNFEGSRTFRCSSFSPNLKFLLQVRRFSRCFAAVLFSKFKGVLM
QRLPPSSKDLTHFIVVPSLRVQSSFSPSSKVLTLLRCSFLPPSPKAPPSRVEGSHVASL