; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038656 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038656
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr2:22527522..22530110
RNA-Seq ExpressionLag0038656
SyntenyLag0038656
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025725.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.6e-2964Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD
        EEVDNS + +QRTSVFDRIKP  TR S FQR+SMA  EEE QC  ST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ +L+ K + E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD

Query:  KKLQSSISSRIKRKFSVLINTEGSL
         K++S +SSR+KRK SV INTEGSL
Subjt:  KKLQSSISSRIKRKFSVLINTEGSL

KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-2860.14Show/hide
Query:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID
        E +N   SY    EEVDNS + +QRTSVFDRIKP  TR  VFQR+SMA  EEE QC MST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ 
Subjt:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID

Query:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL
        + + K + E N D K+ S + SR+KRK SV INTEGSL
Subjt:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.4e-2964Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD
        EEVDNS + +QRTSVFDRIKP  TR SVFQR+SMA  EEE QC  ST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ +L+ K + E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD

Query:  KKLQSSISSRIKRKFSVLINTEGSL
         K+ S + SR+KRK SV INTEGSL
Subjt:  KKLQSSISSRIKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]2.8e-2862.4Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD
        EEVDNS + +QRTS+FDRIKP  TR  VFQR+SMA  EEE QC  ST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ +L+ K + E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD

Query:  KKLQSSISSRIKRKFSVLINTEGSL
         K+ S + SR+KRK SV INTEGSL
Subjt:  KKLQSSISSRIKRKFSVLINTEGSL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-2961.59Show/hide
Query:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID
        E +N   SY    EEVDNS + +QRTSVFDRIKP  TR SVFQR+SMA  EEE QC MST TR SAF+RLS+S SKK RPSTS F RLK+T+DQ +R++ 
Subjt:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID

Query:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL
        +L+ K + E N D K+ S + SRIKRK S+ INTEGSL
Subjt:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ5 Retrotransposon gag protein1.2e-2964Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD
        EEVDNS + +QRTSVFDRIKP  TR S FQR+SMA  EEE QC  ST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ +L+ K + E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD

Query:  KKLQSSISSRIKRKFSVLINTEGSL
         K++S +SSR+KRK SV INTEGSL
Subjt:  KKLQSSISSRIKRKFSVLINTEGSL

A0A5A7SZJ7 Retrotransposon gag protein8.1e-2960.14Show/hide
Query:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID
        E +N   SY    EEVDNS + +QRTSVFDRIKP  TR  VFQR+SMA  EEE QC MST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ 
Subjt:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID

Query:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL
        + + K + E N D K+ S + SR+KRK SV INTEGSL
Subjt:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL

A0A5A7TQ06 Retrotransposon gag protein2.1e-2964Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD
        EEVDNS + +QRTSVFDRIKP  TR SVFQR+SMA  EEE QC  ST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ +L+ K + E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD

Query:  KKLQSSISSRIKRKFSVLINTEGSL
         K+ S + SR+KRK SV INTEGSL
Subjt:  KKLQSSISSRIKRKFSVLINTEGSL

A0A5A7U974 Retrotransposon gag protein5.6e-3061.59Show/hide
Query:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID
        E +N   SY    EEVDNS + +QRTSVFDRIKP  TR SVFQR+SMA  EEE QC MST TR SAF+RLS+S SKK RPSTS F RLK+T+DQ +R++ 
Subjt:  EKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKID

Query:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL
        +L+ K + E N D K+ S + SRIKRK S+ INTEGSL
Subjt:  NLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein1.4e-2862.4Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD
        EEVDNS + +QRTS+FDRIKP  TR  VFQR+SMA  EEE QC  ST  R SAF+RLS+STSKK RPSTS F RLK+T+DQ +R++ +L+ K + E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKRKIDNLEVKLYDEVNSD

Query:  KKLQSSISSRIKRKFSVLINTEGSL
         K+ S + SR+KRK SV INTEGSL
Subjt:  KKLQSSISSRIKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAATAAATCCTTCTCCAAAATTTTCCACAAAAAGGAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACA
AAGGACTTCCGTCTTCGATCGCATCAAGCCTCCAATTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAATTCAATGTTCGATGTCCACCT
CCACTCGACCTTCAGCTTTCCAAAGACTAAGTGTCTCCACATCAAAGAAAAGTCGACCTTCAACATCTGTTTTTTATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGA
AAGATTGATAACTTGGAGGTGAAACTTTACGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATCTCGTCACGTATAAAGAGGAAGTTCTCTGTTCTCATAAATAC
AGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGTGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTT
GTTCCTTCTCCAAGTTCGAGCGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGTGTTGTTGCATTGTTCCCTCTTCTCTCA
AGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGC
TGCGTTGCAGTTCTTTCTCTCCAAGTTCGAAGGTTCAAGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAG
TTCGAAGGTTCTCCGCTGCTGCAGTTCATTCTTCAAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCA
GTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAATTCCTTCTCCCAAATTCGAAGGT
TCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTC
CTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTTCTTCCCTCCAAGTTTGAAGGTTCT
CACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCT
TCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCTTTCCCTCCAAGTTTGAAGGTTCTCA
CATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCAAGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGTGCTTCGTGCAATTCCTTCC
TCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCAGGCGCTTCGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCTCACGC
GCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGAGCTTCGTTGGTGGTGTGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTT
GCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCTCCAAG
TTCGAAGGTTCAAGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGT
GCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGCTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCTAAGTTCG
AAGGTTCTCATGCGCTTCGTTGCAGTTTCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCAATGCTACGCTCAGCTGTACTGCTGCGCTACT
TCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAATAAATCCTTCTCCAAAATTTTCCACAAAAAGGAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACA
AAGGACTTCCGTCTTCGATCGCATCAAGCCTCCAATTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAATTCAATGTTCGATGTCCACCT
CCACTCGACCTTCAGCTTTCCAAAGACTAAGTGTCTCCACATCAAAGAAAAGTCGACCTTCAACATCTGTTTTTTATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGA
AAGATTGATAACTTGGAGGTGAAACTTTACGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATCTCGTCACGTATAAAGAGGAAGTTCTCTGTTCTCATAAATAC
AGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGTGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTT
GTTCCTTCTCCAAGTTCGAGCGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGTGTTGTTGCATTGTTCCCTCTTCTCTCA
AGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGC
TGCGTTGCAGTTCTTTCTCTCCAAGTTCGAAGGTTCAAGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAG
TTCGAAGGTTCTCCGCTGCTGCAGTTCATTCTTCAAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCA
GTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAATTCCTTCTCCCAAATTCGAAGGT
TCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTC
CTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTTCTTCCCTCCAAGTTTGAAGGTTCT
CACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCT
TCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCTTTCCCTCCAAGTTTGAAGGTTCTCA
CATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCAAGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGTGCTTCGTGCAATTCCTTCC
TCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCAGGCGCTTCGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCTCACGC
GCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGAGCTTCGTTGGTGGTGTGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTT
GCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCTCCAAG
TTCGAAGGTTCAAGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGT
GCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGCTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCTAAGTTCG
AAGGTTCTCATGCGCTTCGTTGCAGTTTCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCAATGCTACGCTCAGCTGTACTGCTGCGCTACT
TCCTAA
Protein sequenceShow/hide protein sequence
MLNKSFSKIFHKKEKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPITRPSVFQRMSMAATEEEIQCSMSTSTRPSAFQRLSVSTSKKSRPSTSVFYRLKVTSDQPKR
KIDNLEVKLYDEVNSDKKLQSSISSRIKRKFSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFERSQLYNCYVVPPPSAKDLMWCVVALFPLLS
SSMVLTQLCWSFFSPSSKVLTRSVAVPSFQGRRFSLAALQFFLSKFEGSSTSLQFLLPNSKVLTRFALQFLPQVRRFSAAAVHSSKFEGSHVASLQFLPPSSKVLTCFAA
VPSLQGRRFSRAALQFFLPKFEGSRTSLQFLLPNSKVLTRFALQFLPPSSKVLTRFVQFLPPNSKVLTRFAAVPSSKFEGSHALRCSSFLTIRRFSRASLQFLPSKFEGS
HIASLRSFLQVRRFSRASLCNSFPQVRRFSRASLQFLPPSLKVLTRFVAVPSSQFEGSHALRCSSFPPSLKVLTSLRCDPSSKFEGSQALRSAIPSPKFEGSHVLRAIPS
SKFEGSHALRCISFPPNSKVLRRFAAAPSSKFEGSHALRAVPSSKFEGSHELRWWCVVALFPLLSSSMVLTQLCWSFFSPSSKVLTRSVAVPSFQGRRFSLAALQFFLSK
FEGSSTSLQFLLPNSKVLTRFALQFLPPSSKVLTRFVQFLPPNSKVLTRFAAALSSKFEGSHALRSAIPSPKFEGSHALRCSFFLQVRRFSRASLQFLPQCYAQLYCCAT
S