; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015593 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015593
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr12:17156581..17157413
RNA-Seq ExpressionLag0015593
SyntenyLag0015593
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.0e-7253.66Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        MAR+ +F+G  +EDPH HLRSFLEIC T+KMNGV  DAI+LRLFPFSLQD+AK WLE++   SI+TW+ LAQAFL K+FPP K+ +LRTEI TFRQL+DE
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM----------------------------------M
        QLYEAWERYK++LRRCPQHGY DWLQ+QLFYNGL  STK++LD +AGG+  SK   EA  +LE++                                  M
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM----------------------------------M

Query:  ASLTNALNKLTSSEVVK----SISTLAEGYSKK-ENQDVEEVQYVGNKAF----NQGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF
        ASLTNAL+KLT+    +    SI++LA   S+   + D E   YV    +    +Q +P  YH +LRNHENF Y+N KNVLQ P GF
Subjt:  ASLTNALNKLTSSEVVK----SISTLAEGYSKK-ENQDVEEVQYVGNKAF----NQGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.2e-6748.75Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M +   F G P +DP+ HL  FLEIC TIKMNGV  D IRLRLFPFSL+DKA+ WL+S++ GSI++W ++A+ FL KFFPP KT +LR+EI  FRQ D E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS
         LYEAWERYK+++R CPQHG  DWLQVQ+FYNGL   T+T++D ++GGT +SKT   A  LLEEM                                +AS
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS

Query:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF
        L++ ++ LT+  + +    +A         +   E+VQY+ N+ +N     +PN+YH  LRNHENF Y NTKNVLQPPPGF
Subjt:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]2.9e-6748.4Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M +   F G P +DP+ HL  FLEIC T+KMNGV  D IRLRLFPFSL+DKA+ WL+S++ GSI++W ++A+ FL KFFPP KT +LR+EI  FRQ D E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS
         LYEAWERYK+++R CPQHG  DWLQVQ+FYNGL   T+T++D ++GGT +SKT   A  LLEEM                                +AS
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS

Query:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF
        L++ ++ L++  + +S   +A         +   E+VQY+ N+ +N     +PN+YH  LRNHENF Y NTKNVLQPPPGF
Subjt:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]4.4e-6848.75Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M +   F G P +DP+ HL  FLEIC  +KMNGV  D IRLRLFPFSL+DKA+ WL+S++ GSI++W ++A+ FL KFFPP KT +LR+EI  FRQ D E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS
         LYEAWERYK+++R CPQHG LDWLQVQ+FYNGL   T+T++D ++GGT +SKT   A  LLEEM                                +AS
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS

Query:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF
        L++ ++ LT+  + +S+  +A         +   E VQY+ N+ +N     +PN+YH  LRNHENF Y NTKNVLQPPPGF
Subjt:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF

XP_030970157.1 uncharacterized protein LOC115990464 [Quercus lobata]1.7e-6748.4Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M     F G P  DP+ HL  FLEIC T+KMNGV  D IRLRLFPFSL+DKA+ WL+S++ GSI++W ++A+ FL KFFPP KT +LR+EI  FRQ D E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS
         LYEAWERYK+++RRCPQHG+ +WLQ+Q+FYNGL   T+T++D ++ GT +SKTV     LLEEM                                +AS
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS

Query:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF
        L++ ++ LT+  + +S   +A         +V  E+VQY+ N+ +N     +PN+YH  LRNHENF Y NTKNVLQPPPGF
Subjt:  LTNALNKLTSSEVVKSISTLAEGYSKKENQDV--EEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVLQPPPGF

TrEMBL top hitse value%identityAlignment
A0A2I4G4Q3 uncharacterized protein LOC1090047121.5e-6145.61Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M +   F G P +DP+ HL  FLEIC T+K+NGV  D IRLRLFPFSL+D+A+ WL+S++  SI++W ++A+ F  KFFPP KTT+LR+EI  F+Q D E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS
         LYEAWE YK+++RRCPQHG  DWLQVQ+FYNGL   T+T++D ++GGT + KT+  A  LLEEM                                +A+
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEM--------------------------------MAS

Query:  LTNALNKLTSSEVVKS----ISTLAEGYSKKENQDVEEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVL--QPPPGF
        L++ ++ LT+  + +S    ++T     S + +Q  E+VQY+ N+ +N     +PN+YH   +NHEN  Y NTKNVL  QPPPGF
Subjt:  LTNALNKLTSSEVVKS----ISTLAEGYSKKENQDVEEVQYVGNKAFN---QGVPNFYHSSLRNHENFLYSNTKNVL--QPPPGF

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.0e-5748.93Show/hide
Query:  FKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDEQLYEAW
        F G PS+DP+SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+  GSI+TW++LAQ FL KFFPP KT K+R +I +F Q D E LYEAW
Subjt:  FKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDEQLYEAW

Query:  ERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMM------------------ASLTNALNKLTS--SEVVKSIS
        ER+KE+LRRCP HG  DWLQVQ FYNGL  S KT++D +AGG  +SK   +A +LLEEM                   A   +AL  LT+  + + K + 
Subjt:  ERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMM------------------ASLTNALNKLTS--SEVVKSIS

Query:  TLAE-------------GYSKKENQ---DVEEVQYVGNKAFNQGVP--NFYHSSLRNHENFLYSNTKNVLQP----PPGF
        TL               G S   +Q   + E VQ+VGN    Q  P  N Y+   RNH NF +SN      P    PPGF
Subjt:  TLAE-------------GYSKKENQ---DVEEVQYVGNKAFNQGVP--NFYHSSLRNHENFLYSNTKNVLQP----PPGF

A0A6J0ZYV0 uncharacterized protein LOC1104134131.0e-5748.93Show/hide
Query:  FKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDEQLYEAW
        F G PS+DP+SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+  GSI+TW++LAQ FL KFFPP KT K+R +I +F Q D E LYEAW
Subjt:  FKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDEQLYEAW

Query:  ERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMM------------------ASLTNALNKLTS--SEVVKSIS
        ER+KE+LRRCP HG  DWLQVQ FYNGL  S KT++D +AGG  +SK   +A +LLEEM                   A   +AL  LT+  + + K + 
Subjt:  ERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMM------------------ASLTNALNKLTS--SEVVKSIS

Query:  TLAE-------------GYSKKENQ---DVEEVQYVGNKAFNQGVP--NFYHSSLRNHENFLYSNTKNVLQP----PPGF
        TL               G S   +Q   + E VQ+VGN    Q  P  N Y+   RNH NF +SN      P    PPGF
Subjt:  TLAE-------------GYSKKENQ---DVEEVQYVGNKAFNQGVP--NFYHSSLRNHENFLYSNTKNVLQP----PPGF

A0A6P6XAQ1 Reverse transcriptase1.4e-5444.41Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M + + + G+ +EDP+SHL +FLEIC TIK NGV  DAI+LRLFPFSL+DKAK WL+S    + +TWDELA+AFL KFFPPGKT KLR +I +F Q + E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMMASLTNALNK-----------------LTSSEVVK
         LYEAWERY+E+ RRCP HG  DWL VQ FYNGL   TKT +D +AGG  + KT  EA  L+EEM A+     N+                 + S+++  
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMMASLTNALNK-----------------LTSSEVVK

Query:  SISTLAEGYSKKENQDV--------------------EEVQYVGN---KAFNQGVPNFYHSSLRNHENFLYSNTKNVLQP--PPGF
         +  L        NQ V                    E+VQY+ N      N    N Y+   RNH NF + +  N  +P  PPGF
Subjt:  SISTLAEGYSKKENQDV--------------------EEVQYVGN---KAFNQGVPNFYHSSLRNHENFLYSNTKNVLQP--PPGF

A0A803PT47 Uncharacterized protein8.2e-6045.74Show/hide
Query:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE
        M + N F    +EDP+ HL  FLE+C  +KMNGV  DAIRLRLFP SL+D+ + WL+S++  SISTWDE+A+ F+ KFFPP K+ +LR+EI  FR LD E
Subjt:  MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDE

Query:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMMASLTNALN------KLTSSEVVKSISTLAEGYSK
          YEAWER K++LR  PQHGY  W+QV +FYNGL   T+T++D + GG  LSK +TEA +LLEEM  +  N  N      KL     V  ++ +A   S 
Subjt:  QLYEAWERYKEMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMMASLTNALN------KLTSSEVVKSISTLAEGYSK

Query:  KENQD---------------------------VEEVQYVGNKAFNQG-----VPNFYHSSLRNHENFLYSNTKNVLQPPPGF
          NQ+                           +E+ QY+  K +N       +PN+YH  LRNHEN  Y NTKNVLQ P GF
Subjt:  KENQD---------------------------VEEVQYVGNKAFNQG-----VPNFYHSSLRNHENFLYSNTKNVLQPPPGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGAGACAATTCTTTTAAGGGACACCCCTCGGAGGATCCACATTCCCATCTTCGATCATTCTTGGAGATATGCAGGACGATAAAGATGAATGGAGTTCCAGCCGA
CGCGATAAGATTAAGGTTATTCCCATTTTCTCTTCAGGATAAGGCGAAATATTGGCTTGAATCAGTCGAGACGGGTAGTATCAGCACATGGGACGAGCTTGCCCAGGCAT
TCCTGACAAAATTTTTCCCGCCTGGAAAGACCACGAAGCTCCGGACTGAGATCAGAACATTCAGACAATTGGACGATGAGCAACTTTACGAGGCATGGGAGAGATATAAA
GAAATGCTTAGGAGGTGCCCGCAGCATGGCTATCTAGATTGGCTCCAGGTGCAGTTATTTTACAATGGATTGTATCCTTCCACCAAGACGGTTCTAGACACATCAGCAGG
TGGGACTTTTCTTTCCAAGACAGTGACAGAAGCCAATGATCTTTTAGAGGAAATGATGGCATCATTGACTAATGCTCTGAATAAGCTTACTTCATCTGAGGTGGTCAAAT
CTATCTCCACCTTGGCTGAAGGATATTCAAAGAAGGAAAATCAAGATGTCGAAGAAGTTCAGTATGTGGGAAACAAAGCATTCAATCAAGGAGTACCAAACTTCTACCAT
TCCAGTCTGCGCAATCATGAGAACTTTTTGTACTCCAACACCAAGAACGTATTGCAGCCACCGCCAGGATTTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACGAGACAATTCTTTTAAGGGACACCCCTCGGAGGATCCACATTCCCATCTTCGATCATTCTTGGAGATATGCAGGACGATAAAGATGAATGGAGTTCCAGCCGA
CGCGATAAGATTAAGGTTATTCCCATTTTCTCTTCAGGATAAGGCGAAATATTGGCTTGAATCAGTCGAGACGGGTAGTATCAGCACATGGGACGAGCTTGCCCAGGCAT
TCCTGACAAAATTTTTCCCGCCTGGAAAGACCACGAAGCTCCGGACTGAGATCAGAACATTCAGACAATTGGACGATGAGCAACTTTACGAGGCATGGGAGAGATATAAA
GAAATGCTTAGGAGGTGCCCGCAGCATGGCTATCTAGATTGGCTCCAGGTGCAGTTATTTTACAATGGATTGTATCCTTCCACCAAGACGGTTCTAGACACATCAGCAGG
TGGGACTTTTCTTTCCAAGACAGTGACAGAAGCCAATGATCTTTTAGAGGAAATGATGGCATCATTGACTAATGCTCTGAATAAGCTTACTTCATCTGAGGTGGTCAAAT
CTATCTCCACCTTGGCTGAAGGATATTCAAAGAAGGAAAATCAAGATGTCGAAGAAGTTCAGTATGTGGGAAACAAAGCATTCAATCAAGGAGTACCAAACTTCTACCAT
TCCAGTCTGCGCAATCATGAGAACTTTTTGTACTCCAACACCAAGAACGTATTGCAGCCACCGCCAGGATTTGCGTGA
Protein sequenceShow/hide protein sequence
MARDNSFKGHPSEDPHSHLRSFLEICRTIKMNGVPADAIRLRLFPFSLQDKAKYWLESVETGSISTWDELAQAFLTKFFPPGKTTKLRTEIRTFRQLDDEQLYEAWERYK
EMLRRCPQHGYLDWLQVQLFYNGLYPSTKTVLDTSAGGTFLSKTVTEANDLLEEMMASLTNALNKLTSSEVVKSISTLAEGYSKKENQDVEEVQYVGNKAFNQGVPNFYH
SSLRNHENFLYSNTKNVLQPPPGFA