; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038802 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038802
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr2:27178580..27180923
RNA-Seq ExpressionLag0038802
SyntenyLag0038802
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]2.5e-1962.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC MST  R SAF+RLS+STSKK RPSTS FDRLK+T+D+ +R+M + + K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.3e-1962.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK RPSTS FDRLK+T+D+ +R+M +L+ K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]4.3e-1962.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK RPSTS FDRLK+T+D+ +R+M +L+ K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.6e-1961.7Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC MST TR SAF+RLS+S SKK RPSTS FDRLK+T+D+ +R+M +L+ K + E N D K+ S +PSR+KRK S+ INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

TYK02797.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-1862.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK RPSTS FDRLK+T+DR +R+M +L+ K + E N D K+ S +PSRMKRK SV INTE  L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein1.2e-1962.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC MST  R SAF+RLS+STSKK RPSTS FDRLK+T+D+ +R+M + + K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

A0A5A7TQ06 Retrotransposon gag protein2.1e-1962.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK RPSTS FDRLK+T+D+ +R+M +L+ K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

A0A5A7U974 Retrotransposon gag protein2.7e-1961.7Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC MST TR SAF+RLS+S SKK RPSTS FDRLK+T+D+ +R+M +L+ K + E N D K+ S +PSR+KRK S+ INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

A0A5D3BBF9 Gag protease polyprotein2.1e-1962.77Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK RPSTS FDRLK+T+D+ +R+M +L+ K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

A0A5D3D209 Retrotransposon gag protein6.0e-1961.7Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK RPSTS FDRLK+T+++ KR+M + + K + E N D K+ S +PSRMKRK SV INTEG L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCAACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGACCTTC
AACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCGACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTACGATGAAGTAAACAGCGACAAGAAACTTCAAA
GTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTTCTTGAAGTGGGGGCAACACAGCGAATGGAAGTTCATTCCTCCAAGTTTGAAG
GTTCACGCACTTCACTGTAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCACGCGCTTCGTTGCAGTTCC
TTCTCCCAAATTCGAAGGTCTCACGGGTTTCACTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCACACACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGT
GCTTCGCTGCAATTCCTTCCTCCAAGTTCGAAGGTTTTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCACTGCAGTTCCTTCCTCCA
AGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCATGTTCGAAGGTTTTCACACGCTTAGTTCGTTCTTCCAAGTTCAAAGGTTCTCACGTGTTTCGCTGCA
GTTCCTTCCTCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTC
ACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTAGAAGGTTCACGCACT
TTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTAACGTGTTTCGCTGCAGTTCCTTCCTCCAAGT
TCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCATTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTGAAGGTTCTCACGC
GCTTCATTGCAGTTCCTTCCTCCAGGTTTGAAGGTTCTCACGCGCTTCATTGCAGTTCGTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCATTGCAGTTCCTTCCTCCA
AGTTCGAAGGTCCTCACATGCTTCGCTGAGGTTCTCACGCGCTTCATTGCAGTTCCTTCATTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCATTGCAGT
TCCTTCCTCCAAGTTCGAATGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTGCAGACAGTTCCTTCCTCCAAGTTCGAAGGTTC
ACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGTGCTTCGCTGCAGTTCCTACC
TTCGAACTGTTCCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCTCACGCGCTTCATTGCAGAACTGCAGAAGGTTCTCCG
CTGCTGCCCAAGTTCGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCACAAGTTCGAAGGCTCTCATGGAAAATATGTATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCAACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGACCTTC
AACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCGACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTACGATGAAGTAAACAGCGACAAGAAACTTCAAA
GTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTTCTTGAAGTGGGGGCAACACAGCGAATGGAAGTTCATTCCTCCAAGTTTGAAG
GTTCACGCACTTCACTGTAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCACGCGCTTCGTTGCAGTTCC
TTCTCCCAAATTCGAAGGTCTCACGGGTTTCACTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCACACACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGT
GCTTCGCTGCAATTCCTTCCTCCAAGTTCGAAGGTTTTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCACTGCAGTTCCTTCCTCCA
AGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCATGTTCGAAGGTTTTCACACGCTTAGTTCGTTCTTCCAAGTTCAAAGGTTCTCACGTGTTTCGCTGCA
GTTCCTTCCTCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTC
ACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTAGAAGGTTCACGCACT
TTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTAACGTGTTTCGCTGCAGTTCCTTCCTCCAAGT
TCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCATTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTGAAGGTTCTCACGC
GCTTCATTGCAGTTCCTTCCTCCAGGTTTGAAGGTTCTCACGCGCTTCATTGCAGTTCGTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCATTGCAGTTCCTTCCTCCA
AGTTCGAAGGTCCTCACATGCTTCGCTGAGGTTCTCACGCGCTTCATTGCAGTTCCTTCATTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCATTGCAGT
TCCTTCCTCCAAGTTCGAATGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTGCAGACAGTTCCTTCCTCCAAGTTCGAAGGTTC
ACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGTGCTTCGCTGCAGTTCCTACC
TTCGAACTGTTCCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCTCACGCGCTTCATTGCAGAACTGCAGAAGGTTCTCCG
CTGCTGCCCAAGTTCGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCACAAGTTCGAAGGCTCTCATGGAAAATATGTATCTTAA
Protein sequenceShow/hide protein sequence
MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDRPKRKMDNLEVKLYDEVNSDKKLQSSIPSRMKRKFSVLINTEGFLKWGQHSEWKFIPPSLK
VHALHCSSFLQVRRFSRASLQFLPPSSKVFTRFVAVPSPKFEGLTGFTAVPSSKFEGSHTSLQFLPPSLKVLTCFAAIPSSKFEGFHALRCSSFLQVRRFSRASLQFLPP
SSKVLTCFAAVPSSMFEGFHTLSSFFQVQRFSRVSLQFLPPSSKVHALRCSSFLQVRRFTHFAAVPSSKFEGSHVLRCSSFLQVRSSFLQVRRFTHFAAVPSSKLEGSRT
LLQFLPPSSKVLTCFAAVPSSKFEGSNVFRCSSFLQVRRFSRASLQFLPPSLKVLTRFIAVPSSKFEGLKVLTRFIAVPSSRFEGSHALHCSSFLQVRRFSCASLQFLPP
SSKVLTCFAEVLTRFIAVPSLQFLPPSSKVLTRFIAVPSSKFECSHVLRCSSFLQVRRFSRRCRQFLPPSSKVHALRCSSFLQVRRFSCASLQFLPPSSKVFMCFAAVPT
FELFPSSKVLTCFAAVPSSKFEGSLTRFIAELQKVLRCCPSSFSRASLCNSFHKFEGSHGKYVS