; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021436 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021436
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr7:7680923..7685051
RNA-Seq ExpressionLag0021436
SyntenyLag0021436
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]6.0e-1360.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC MST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.0e-1260.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

KAA0051228.1 hypothetical protein E6C27_scaffold1250G00350 [Cucumis melo var. makuwa]1.0e-1260.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

KAA0066532.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.0e-1260.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

TYK18884.1 gag protease polyprotein [Cucumis melo var. makuwa]1.0e-1260.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein2.9e-1360.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC MST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

A0A5A7TQ06 Retrotransposon gag protein5.0e-1360.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

A0A5D3BBF9 Gag protease polyprotein5.0e-1360.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

A0A5D3D5Q0 Gag protease polyprotein5.0e-1360.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

A0A5D3DQU7 Retrotransposon gag protein5.0e-1360.26Show/hide
Query:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS
        +SMA  EEENQC  ST +R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N + ++HS VPS
Subjt:  MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCAGTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACCGCGAGAAGCAGCTTCATA
GTAGCGTCCCGTCACTTCCTTTTCTCCAAGTACGAAGGTGCTCCTTCTCCAAGTTCGAAGGCTCTTACATTGCTGCGATGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTC
ACGTTGCTTCGCCGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAATGTTCTCTCACGTTGCTTCGCTGCAGTTC
CTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCTTTCTCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCA
CACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACGCGCATCGCCACGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCG
CATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTT
CGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCA
CAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAG
TTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACGCCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACG
CGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACACGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCC
AAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAGTTCGAAGGTTCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTT
CACACGCGCATCGCCACAGTTCTTTCCTCCAAGTTTCGAAGGTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACAC
GCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCAAGTTTGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTT
CGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTCCCTCCAAGTT
CGAAGGTTCATACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCTTCGCCACA
GTTCCTTCCTCCAAGTTCGAAGGTTCACACGCTTCGTTCGAAGGTTCACACGCGCATATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCGAAG
GTTCCTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGGTTCAGAAATTCTACACTCCACAAAGACAAGAGTTTAGAGTTTCA
AAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCGGAAGACCGAAGACTCTCTGCAATCCACAAGTCCAAGTGTTGAACACTTCTTGAAGACCAAACA
CTCTTCAAGACTTCAACACTCCTTGAAGACCAAACACCTTCAAGACTCAACACTCCTTGAAGACCAACACCCTTCAAGACTTCAACACTCCTTGAAGATCAAATACTCTT
CAGGACATCAACACTTCTTGAAGACCAAACGCTCTTCAAGACCTCGACACTCCTTGAAGATCAAAAACTCTTCAGGACATCAACATTTCTTGAAGACCAAGCACTCTTCA
AGATTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGATATCAACACTTCTTGAAGACTGAAGACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAGAATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCAGTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACCGCGAGAAGCAGCTTCATA
GTAGCGTCCCGTCACTTCCTTTTCTCCAAGTACGAAGGTGCTCCTTCTCCAAGTTCGAAGGCTCTTACATTGCTGCGATGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTC
ACGTTGCTTCGCCGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAATGTTCTCTCACGTTGCTTCGCTGCAGTTC
CTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCTTTCTCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCA
CACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACGCGCATCGCCACGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCG
CATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTT
CGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCA
CAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAG
TTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACGCCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTTCACACG
CGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACACGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCC
AAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAATCCTTCCTACAGTTCGAAGGTTCTCCAAGTTCGAATCCTTCCTACAAGTTCGAAGGTT
CACACGCGCATCGCCACAGTTCTTTCCTCCAAGTTTCGAAGGTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACAC
GCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCCAAGTTTGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTT
CGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTCCCTCCAAGTT
CGAAGGTTCATACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCTTCGCCACA
GTTCCTTCCTCCAAGTTCGAAGGTTCACACGCTTCGTTCGAAGGTTCACACGCGCATATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCGAAG
GTTCCTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGGTTCAGAAATTCTACACTCCACAAAGACAAGAGTTTAGAGTTTCA
AAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCGGAAGACCGAAGACTCTCTGCAATCCACAAGTCCAAGTGTTGAACACTTCTTGAAGACCAAACA
CTCTTCAAGACTTCAACACTCCTTGAAGACCAAACACCTTCAAGACTCAACACTCCTTGAAGACCAACACCCTTCAAGACTTCAACACTCCTTGAAGATCAAATACTCTT
CAGGACATCAACACTTCTTGAAGACCAAACGCTCTTCAAGACCTCGACACTCCTTGAAGATCAAAAACTCTTCAGGACATCAACATTTCTTGAAGACCAAGCACTCTTCA
AGATTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGATATCAACACTTCTTGAAGACTGAAGACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAGAATCCATTGA
Protein sequenceShow/hide protein sequence
MSMAATEEENQCSMSTSSRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNREKQLHSSVPSLPFLQVRRCSFSKFEGSYIAAMQFLPPSSKVL
TLLRRSSFSPSTKVLSSKYEGSLLQVECSLTLLRCSSFLQVRRFSCALLQFFLSKFEGSHAHRHSSFLQVRRFTRASPQFLPPSSNPSYKFEGSHAHRHVPSSKFEGSHA
HRHSSFLQVRRFLLQVRRFTRASPQFLPPSSKVHTRFATVPSSKFESFLQVRRFTRASPQFLPPSSKVHTRIATVPSSKFEGSSSKFEGSHAHRHSSFLQVRRFTRASPQ
FLPPSSNPSYKFEGSHAIATVPSSKFESFLQVRRFTRASPQFLPPSSKVHTRIATVPSSKFEGSHAHRHSSFLQVRRFTRASPQFLPPSSNPSYSSKVLQVRILPTSSKV
HTRIATVLSSKFRRFLQVRRFTRASPQFLPPSSKVHTRIATVPSSKFEGSSKFEGSHAHRHSSFLQVRRFTRASPQFLPPSSKVHTRFATVPSSKFEGSHALRHSSSLQV
RRFIRASPQFLPPSSKVHTRIATVPSSKFEGSHASPQFLPPSSKVHTLRSKVHTRISPQFLPPSSKVHTRFAEGSSSKFEGSHAHRHSSFLQVRRVQKFYTPQRQEFRVS
KLSSRTREFRETPPSRKTEDSLQSTSPSVEHFLKTKHSSRLQHSLKTKHLQDSTLLEDQHPSRLQHSLKIKYSSGHQHFLKTKRSSRPRHSLKIKNSSGHQHFLKTKHSS
RFQHSLKIKDSSGYQHFLKTEDSFKTRRLQAPRIH