; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032090 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032090
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr11:24299662..24303003
RNA-Seq ExpressionLag0032090
SyntenyLag0032090
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051228.1 hypothetical protein E6C27_scaffold1250G00350 [Cucumis melo var. makuwa]3.8e-1560.76Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR
        +SM   EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS +PSR
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR

KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.8e-1852.68Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP
        +SM   +EEN+C  ST+TR SAF+RLS+STSKK +PST VFDRLK+T+DQ +R+M  L+ K F E N D K+H+ +PSR  +    D          +EP
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP

Query:  KLHDAPSPHELK
        KLH APSP ELK
Subjt:  KLHDAPSPHELK

KAA0065966.1 hypothetical protein E6C27_scaffold62G00430 [Cucumis melo var. makuwa]3.3e-1948.18Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP
        +SM   EEENQC  ST+TR SAF+RLS+STSKK +PSTS FDR K+T++Q +R++ +L+ KLF E N D K+HS +PSR  +    D         +++P
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP

Query:  KLHDAPSPHELKRFSRTSLQFLPPSSKVLKRFTAVPS
        KLH APSP ELK F   +   +  S KVL+  +  PS
Subjt:  KLHDAPSPHELKRFSRTSLQFLPPSSKVLKRFTAVPS

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-1562.03Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR
        +SM   EEENQC  ST+TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N D K+HS IPSR
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR

TYK18884.1 gag protease polyprotein [Cucumis melo var. makuwa]5.0e-1552.94Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP
        +SM   EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS +PSR  +    D     K SL  +P
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP

Query:  KL
        +L
Subjt:  KL

TrEMBL top hitse value%identityAlignment
A0A5A7U7F9 Uncharacterized protein1.8e-1560.76Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR
        +SM   EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS +PSR
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR

A0A5A7UM99 Ty3-gypsy retrotransposon protein2.3e-1852.68Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP
        +SM   +EEN+C  ST+TR SAF+RLS+STSKK +PST VFDRLK+T+DQ +R+M  L+ K F E N D K+H+ +PSR  +    D          +EP
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP

Query:  KLHDAPSPHELK
        KLH APSP ELK
Subjt:  KLHDAPSPHELK

A0A5A7VFA5 Ty3-gypsy retrotransposon protein1.8e-1562.03Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR
        +SM   EEENQC  ST+TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E N D K+HS IPSR
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSR

A0A5A7VHY3 Uncharacterized protein1.6e-1948.18Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP
        +SM   EEENQC  ST+TR SAF+RLS+STSKK +PSTS FDR K+T++Q +R++ +L+ KLF E N D K+HS +PSR  +    D         +++P
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP

Query:  KLHDAPSPHELKRFSRTSLQFLPPSSKVLKRFTAVPS
        KLH APSP ELK F   +   +  S KVL+  +  PS
Subjt:  KLHDAPSPHELKRFSRTSLQFLPPSSKVLKRFTAVPS

A0A5D3D5Q0 Gag protease polyprotein2.4e-1552.94Show/hide
Query:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP
        +SM   EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS +PSR  +    D     K SL  +P
Subjt:  MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEP

Query:  KL
        +L
Subjt:  KL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGTCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAAGTGAAACTTTTCGATGAAGTAAACAGCGACAAGAAGCTTCATA
GTAGCATCCCGTCACGATCTGATCAAGACCATGACAAAGATAAAGCTTTTAAATGTAAAAACTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAG
CTTAAAAGGTTCTCACGCACTTCACTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCAAGCGCTTCACTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCG
CTGCAGTTCCTTCTCCAATTTCGAAGTTCGAAGGTTCTCTCACGTCGCTTCGCTGCAGTTCCTTCTCCAAGTTCTCTCCAAGTTCGAAAGTTCTCTCACGTTGCTTCACT
GCAGTTCCTTGTTAGTTCCTTGTTTCTCCAAGTTCGAAGGTTCTCTCACGTTGCTTTGCTTCACTGCAGTTCCTTGTTTCTCCAAGTTCGAAGGTTTTCTCACGTCGCTT
CGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTTTCTCACGTCGCTCCAACAAAGTTCCTTCTCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTTTCTCACGTCGCTCCA
ACAAAGTTCCTTCTCTCCAAGTTATCTTCCCTCCAAGTTCGAAGGTTTTGAAGGTGGTTCTCACGTCGCTTCACTAGCAGAGTTCCTTCCTCACGTCGCTTCGCTAGCAG
TTCCTCCCTCCAAGTCCGAAGGCTTTGAAGGTTCTCACGTCGCTTCGCTAGCAGTTCCTTCCTCCAAGTTCAAGTTCGCTGTTGCAGTTTCTCCCGCGCTTCAAAGGCTC
TCATGGGAAATATGGGGATTCAGATTCGGAGATTCAGATCAGAGAACTCAGAGTCGAGAGAATTCTGTCAGAGTCCAGAGTCATAAGTCAGAGAGTCTAGAGAATTCAGA
ATATCCTAGATTCAGAATTCAACCAACTCAAGACTCAGAAGATCATCAAGCTAATCGACCGATCAAGAAGATCAACAAGTCAGCAGACCCATCATCCAAGAAGATCAACA
AGTCAGCAGACCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGTAGACCGATCATCCAAAAAGATCAACAAGTCACAACAGGCCGATCCAAGAGATCATCAA
GCCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCTAACAGGCCGATCCAAGAGATCTCAACCTAGCAAGCTGATCATCTA
A
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGTCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAAGTGAAACTTTTCGATGAAGTAAACAGCGACAAGAAGCTTCATA
GTAGCATCCCGTCACGATCTGATCAAGACCATGACAAAGATAAAGCTTTTAAATGTAAAAACTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAG
CTTAAAAGGTTCTCACGCACTTCACTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCAAGCGCTTCACTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCG
CTGCAGTTCCTTCTCCAATTTCGAAGTTCGAAGGTTCTCTCACGTCGCTTCGCTGCAGTTCCTTCTCCAAGTTCTCTCCAAGTTCGAAAGTTCTCTCACGTTGCTTCACT
GCAGTTCCTTGTTAGTTCCTTGTTTCTCCAAGTTCGAAGGTTCTCTCACGTTGCTTTGCTTCACTGCAGTTCCTTGTTTCTCCAAGTTCGAAGGTTTTCTCACGTCGCTT
CGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTTTCTCACGTCGCTCCAACAAAGTTCCTTCTCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTTTCTCACGTCGCTCCA
ACAAAGTTCCTTCTCTCCAAGTTATCTTCCCTCCAAGTTCGAAGGTTTTGAAGGTGGTTCTCACGTCGCTTCACTAGCAGAGTTCCTTCCTCACGTCGCTTCGCTAGCAG
TTCCTCCCTCCAAGTCCGAAGGCTTTGAAGGTTCTCACGTCGCTTCGCTAGCAGTTCCTTCCTCCAAGTTCAAGTTCGCTGTTGCAGTTTCTCCCGCGCTTCAAAGGCTC
TCATGGGAAATATGGGGATTCAGATTCGGAGATTCAGATCAGAGAACTCAGAGTCGAGAGAATTCTGTCAGAGTCCAGAGTCATAAGTCAGAGAGTCTAGAGAATTCAGA
ATATCCTAGATTCAGAATTCAACCAACTCAAGACTCAGAAGATCATCAAGCTAATCGACCGATCAAGAAGATCAACAAGTCAGCAGACCCATCATCCAAGAAGATCAACA
AGTCAGCAGACCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGTAGACCGATCATCCAAAAAGATCAACAAGTCACAACAGGCCGATCCAAGAGATCATCAA
GCCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCTAACAGGCCGATCCAAGAGATCTCAACCTAGCAAGCTGATCATCTA
A
Protein sequenceShow/hide protein sequence
MSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSSIPSRSDQDHDKDKAFKCKNSLSQEPKLHDAPSPHE
LKRFSRTSLQFLPPSSKVLKRFTAVPSSKFKGSHALRCSSFSNFEVRRFSHVASLQFLLQVLSKFESSLTLLHCSSLLVPCFSKFEGSLTLLCFTAVPCFSKFEGFLTSL
RCSSFPPSSKVFSRRSNKVPSLQFLPSKFEGFLTSLQQSSFSPSYLPSKFEGFEGGSHVASLAEFLPHVASLAVPPSKSEGFEGSHVASLAVPSSKFKFAVAVSPALQRL
SWEIWGFRFGDSDQRTQSRENSVRVQSHKSESLENSEYPRFRIQPTQDSEDHQANRPIKKINKSADPSSKKINKSADRSSNRSSEKINKSVDRSSKKINKSQQADPRDHQ
ASRPIIQEDQQANKPIQEIIKLTGRSKRSQPSKLII