; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022020 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022020
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr7:16022140..16022885
RNA-Seq ExpressionLag0022020
SyntenyLag0022020
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]2.7e-4852.57Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------
        +A D ALMT+INATLSPEALAY+VG  SSK  W+VLA+ YSS  RSN+VNLK+DLQ I KK DESID+YIKR   IKDKLANVS                
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------

Query:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMF-ASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTS
                TSMRTRS  VTF+ELHVLL++EES L KQSK +D   QPT +  +SQS    +P  N       NNF RG   G+N G    + +F   T  
Subjt:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMF-ASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTS

Query:  VPNRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
                   D   +CQI +R GH ALDC+NRMNY++QGRHPP QLAAMVA+
Subjt:  VPNRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.1e-4953.67Show/hide
Query:  ANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS-----------------
        A D ALMT+INATLSPEALAY+VG  +SK  W VLA+ YSSS RSN+VNLK+DLQ ISKK DESID+YIKR   IKDKLANVS                 
Subjt:  ANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS-----------------

Query:  -------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTSVP
               TSMRTRS  VTF+ELHVLLK+EES L KQSKR+DL  QPTA+ A      SS    S  S  +NNF RG+ RGR  G            +S  
Subjt:  -------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTN--------FDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
         + RGG +         +   SCQI  R GH ALDC+NRMNY++ GRHPP  LAAMVA+
Subjt:  NRSRGGTN--------FDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-4953.67Show/hide
Query:  ANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS-----------------
        A D ALMT+INATLSPEALAY+VG  +SK  W VLA+ YSSS RSN+VNLK+DLQ ISKK DESID+YIKR   IKDKLANVS                 
Subjt:  ANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS-----------------

Query:  -------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTSVP
               TSMRTRS  VTF+ELHVLLK+EES L KQSKR+DL  QPTA+ A      SS    S  S  +NNF RG+ RGR  G            +S  
Subjt:  -------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTN--------FDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
         + RGG +         +   SCQI  R GH ALDC+NRMNY++ GRHPP  LAAMVA+
Subjt:  NRSRGGTN--------FDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]2.7e-4852.57Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------
        +A D ALMT+INATLSPEALAY+VG  SSK  W+VLA+ YSS  RSN+VNLK+DLQ I KK DESID+YIKR   IKDKLANVS                
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------

Query:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMF-ASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTS
                TSMRTRS  VTF+ELHVLL++EES L KQSK +D   QPT +  +SQS    +P  N       NNF RG   G+N G    + +F   T  
Subjt:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMF-ASQSYAQSSPRPNSTPSPPSNNF-RGQERGRNSGRRGRKNNFSLPTTS

Query:  VPNRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
                   D   +CQI +R GH ALDC+NRMNY++QGRHPP QLAAMVA+
Subjt:  VPNRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.2e-5353.12Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANV-----------------
        +A D ALMTLINATLS EALAY+V   +SK  WEVL +HYSS+ R+N+VNLK+DLQ+I KK +ESID+Y+KR   IKDK ANV                 
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANV-----------------

Query:  -------STSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP
               STSMRTR+ +V+F+ELHV +KSEES +EKQ KREDL  QP A+FAS   +Q     N T +   N    + RG+N+GR   K NF+ PT +  
Subjt:  -------STSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTNF------DTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVA
         R R   NF      D R  CQI  + GH ALDCYNRMN+H+QGRHPPPQLAAMVA
Subjt:  NRSRGGTNF------DTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVA

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X25.0e-4851Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------
        +A D ALMT+INATLSPEALAY+VG  SSK  W+VLA+ YSS  RSN+VNLK+DLQ I KK DESID+YIKR   IKDKLANVS                
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------

Query:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP
                TSMRTRS  VTF+ELHVLL++EES L KQSK +D   QPT + +S     S    +  P+  +N  RG   G++ G    + +F   T    
Subjt:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
        +     +  D   +CQI +R GH ALDC+NRMNY++QGRHPP QLAAMVA+
Subjt:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X35.0e-4851Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------
        +A D ALMT+INATLSPEALAY+VG  SSK  W+VLA+ YSS  RSN+VNLK+DLQ I KK DESID+YIKR   IKDKLANVS                
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------

Query:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP
                TSMRTRS  VTF+ELHVLL++EES L KQSK +D   QPT + +S     S    +  P+  +N  RG   G++ G    + +F   T    
Subjt:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
        +     +  D   +CQI +R GH ALDC+NRMNY++QGRHPP QLAAMVA+
Subjt:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X15.0e-4851Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------
        +A D ALMT+INATLSPEALAY+VG  SSK  W+VLA+ YSS  RSN+VNLK+DLQ I KK DESID+YIKR   IKDKLANVS                
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------

Query:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP
                TSMRTRS  VTF+ELHVLL++EES L KQSK +D   QPT + +S     S    +  P+  +N  RG   G++ G    + +F   T    
Subjt:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
        +     +  D   +CQI +R GH ALDC+NRMNY++QGRHPP QLAAMVA+
Subjt:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

A0A5D3CLI6 T4.55.0e-4851Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------
        +A D ALMT+INATLSPEALAY+VG  SSK  W+VLA+ YSS  RSN+VNLK+DLQ I KK DESID+YIKR   IKDKLANVS                
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANVS----------------

Query:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP
                TSMRTRS  VTF+ELHVLL++EES L KQSK +D   QPT + +S     S    +  P+  +N  RG   G++ G    + +F   T    
Subjt:  --------TSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT
        +     +  D   +CQI +R GH ALDC+NRMNY++QGRHPP QLAAMVA+
Subjt:  NRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVAT

A0A6J1D9L6 uncharacterized protein LOC1110188921.0e-5353.12Show/hide
Query:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANV-----------------
        +A D ALMTLINATLS EALAY+V   +SK  WEVL +HYSS+ R+N+VNLK+DLQ+I KK +ESID+Y+KR   IKDK ANV                 
Subjt:  MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKR---IKDKLANV-----------------

Query:  -------STSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP
               STSMRTR+ +V+F+ELHV +KSEES +EKQ KREDL  QP A+FAS   +Q     N T +   N    + RG+N+GR   K NF+ PT +  
Subjt:  -------STSMRTRSGTVTFDELHVLLKSEESTLEKQSKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVP

Query:  NRSRGGTNF------DTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVA
         R R   NF      D R  CQI  + GH ALDCYNRMN+H+QGRHPPPQLAAMVA
Subjt:  NRSRGGTNF------DTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAAMVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACGACCACGCGTTGATGACACTCATCAATGCAACACTTTCTCCTGAAGCACTTGCATACATTGTTGGCTGCAAATCTTCGAAAGATGAATGGGAGGTACTTGC
TCGACATTACTCTTCTAGTTATCGATCCAACATTGTGAATCTCAAAACGGATCTGCAAGCTATTTCGAAGAAACAGGATGAATCGATAGATTCCTATATCAAACGAATCA
AAGATAAGTTGGCTAATGTTTCGACTTCGATGCGCACACGATCTGGCACAGTCACGTTTGATGAACTCCATGTTCTTCTGAAATCCGAAGAATCAACCCTTGAAAAACAG
TCGAAACGTGAGGATCTTACAATTCAACCAACTGCCATGTTTGCTTCTCAGAGTTATGCTCAGAGTTCTCCTCGACCGAATTCGACTCCATCACCTCCTTCAAATAATTT
TCGTGGCCAAGAAAGAGGCAGGAACTCTGGCCGTAGAGGTAGAAAAAATAATTTCTCACTTCCTACTACTTCAGTTCCAAATCGCAGTCGTGGTGGTACCAACTTTGATA
CTCGATTCAGCTGCCAAATTTTTAATCGCCCTGGACATCAAGCTCTTGACTGCTACAATCGCATGAATTATCACTACCAAGGGCGTCATCCTCCTCCACAGCTGGCTGCA
ATGGTTGCCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACGACCACGCGTTGATGACACTCATCAATGCAACACTTTCTCCTGAAGCACTTGCATACATTGTTGGCTGCAAATCTTCGAAAGATGAATGGGAGGTACTTGC
TCGACATTACTCTTCTAGTTATCGATCCAACATTGTGAATCTCAAAACGGATCTGCAAGCTATTTCGAAGAAACAGGATGAATCGATAGATTCCTATATCAAACGAATCA
AAGATAAGTTGGCTAATGTTTCGACTTCGATGCGCACACGATCTGGCACAGTCACGTTTGATGAACTCCATGTTCTTCTGAAATCCGAAGAATCAACCCTTGAAAAACAG
TCGAAACGTGAGGATCTTACAATTCAACCAACTGCCATGTTTGCTTCTCAGAGTTATGCTCAGAGTTCTCCTCGACCGAATTCGACTCCATCACCTCCTTCAAATAATTT
TCGTGGCCAAGAAAGAGGCAGGAACTCTGGCCGTAGAGGTAGAAAAAATAATTTCTCACTTCCTACTACTTCAGTTCCAAATCGCAGTCGTGGTGGTACCAACTTTGATA
CTCGATTCAGCTGCCAAATTTTTAATCGCCCTGGACATCAAGCTCTTGACTGCTACAATCGCATGAATTATCACTACCAAGGGCGTCATCCTCCTCCACAGCTGGCTGCA
ATGGTTGCCACTTAG
Protein sequenceShow/hide protein sequence
MANDHALMTLINATLSPEALAYIVGCKSSKDEWEVLARHYSSSYRSNIVNLKTDLQAISKKQDESIDSYIKRIKDKLANVSTSMRTRSGTVTFDELHVLLKSEESTLEKQ
SKREDLTIQPTAMFASQSYAQSSPRPNSTPSPPSNNFRGQERGRNSGRRGRKNNFSLPTTSVPNRSRGGTNFDTRFSCQIFNRPGHQALDCYNRMNYHYQGRHPPPQLAA
MVAT