; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027810 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027810
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:5364373..5365413
RNA-Seq ExpressionLag0027810
SyntenyLag0027810
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8517066.1 hypothetical protein F0562_017116 [Nyssa sinensis]4.2e-3539.92Show/hide
Query:  STSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------
        +T + +S + S   ++  S++IE  ++PY+LHH+D+     V++ LT  NY +WSR+MLIALSVKNKLGF+DGSI +P                      
Subjt:  STSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------

Query:  ---------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIPI-
                        LKT+W+E   YRP C+CG+C+CG  + L+   Q EY+MSFLM L++SF+  R Q+LLMDP P  +RVFSL+ Q+EQQ RT P  
Subjt:  ---------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIPI-

Query:  -FATPPPAVAFA----------APSRPQNQSTSRSRK-DRPICTHYAI
          +     +AFA          + S+  N S S+++K DRP CTH  I
Subjt:  -FATPPPAVAFA----------APSRPQNQSTSRSRK-DRPICTHYAI

KAA8550199.1 hypothetical protein F0562_001883 [Nyssa sinensis]3.1e-3842.86Show/hide
Query:  STLSSSTSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPT---------GSLKTLW
        +TLSS + +      +S  S     +IE  ++PY+LHH+D+    LV++ LT  NY +WSR+MLIALSVK KLGF+DGSI +P            LKT+W
Subjt:  STLSSSTSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPT---------GSLKTLW

Query:  DEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIP-------------IFATPPPAVA
        +E   YRP C+CG+C+CG  + +++  Q EY+MSFLMGL++SF+  R Q+LLMDP P  +RVFSL+ Q+EQQ RT P             +  T      
Subjt:  DEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIP-------------IFATPPPAVA

Query:  FAAPSRPQNQSTSRS---RKDRPICTHYAIQ
         + P   QN ++S S   +KD+P CTH  I+
Subjt:  FAAPSRPQNQSTSRS---RKDRPICTHYAIQ

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]4.9e-3636.78Show/hide
Query:  IEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGS----------------------------------------
        +E   +PYFLHH+D T+L LV++LLTD NY SWSRS++IAL+VKNK+GF+DGSI +PT                                          
Subjt:  IEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGS----------------------------------------

Query:  ------------------------------------LKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSA
                                            LKTLW E   YRP C+CGRC+ G  + ++  +Q EY+M+FLMGLN SF+  RAQ+LLM+P P+ 
Subjt:  ------------------------------------LKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSA

Query:  SRVFSLLSQKEQQRTIPI-FATPPPAVAFAAPSRPQNQSTSRS------RKDRPICTHYAI
        +R F+L++Q+ QQR+I +   T P A A  A S   N   + S      RKD+ +CTH  I
Subjt:  SRVFSLLSQKEQQRTIPI-FATPPPAVAFAAPSRPQNQSTSRS------RKDRPICTHYAI

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]5.2e-4644.4Show/hide
Query:  LHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------------------------------------
        +HHNDT+NL LV++ LT+ NYVSWSRSM IALS+KNKLGFI+GS+ KP G                                                  
Subjt:  LHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------------------------------------

Query:  --------------------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQ
                                   LK LWDEY++YRPGCTCG C+CG  R++++F QFE+LM FLMGLNESFA  RAQILLMDPPPS  + FSL+SQ
Subjt:  --------------------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQ

Query:  KEQQRTIPIFATPPPAVAFAA-PSRPQNQSTSRSRKDR---PICTHYAIQ
        +EQQR IP+F+TP PAV  A   SR  + S S SR+     P CT+  I+
Subjt:  KEQQRTIPIFATPPPAVAFAA-PSRPQNQSTSRSRKDR---PICTHYAIQ

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]2.2e-3638.7Show/hide
Query:  PYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGSLKTLW-----------------------------------------
        PY LHH+DT+NL LV+ELLTD NYVSWSRSM++ L ++NKLGFIDGS+ +PTG L  LW                                         
Subjt:  PYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGSLKTLW-----------------------------------------

Query:  -----------------------------------DEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSL
                                           DEY++YRPGCTCG+CTCG  + +++F QFEYL+ F MGLN+SF  TR+Q+LLMDPPP  ++ FS 
Subjt:  -----------------------------------DEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSL

Query:  LSQKEQQRTIPIFATPPPAVAFAAPSRPQN
        + Q+EQ +++   A PP +V      +  N
Subjt:  LSQKEQQRTIPIFATPPPAVAFAAPSRPQN

TrEMBL top hitse value%identityAlignment
A0A5J4ZHF9 Uncharacterized protein2.0e-3539.92Show/hide
Query:  STSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------
        +T + +S + S   ++  S++IE  ++PY+LHH+D+     V++ LT  NY +WSR+MLIALSVKNKLGF+DGSI +P                      
Subjt:  STSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------

Query:  ---------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIPI-
                        LKT+W+E   YRP C+CG+C+CG  + L+   Q EY+MSFLM L++SF+  R Q+LLMDP P  +RVFSL+ Q+EQQ RT P  
Subjt:  ---------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIPI-

Query:  -FATPPPAVAFA----------APSRPQNQSTSRSRK-DRPICTHYAI
          +     +AFA          + S+  N S S+++K DRP CTH  I
Subjt:  -FATPPPAVAFA----------APSRPQNQSTSRSRK-DRPICTHYAI

A0A5J5C4X6 Retrotran_gag_3 domain-containing protein1.5e-3842.86Show/hide
Query:  STLSSSTSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPT---------GSLKTLW
        +TLSS + +      +S  S     +IE  ++PY+LHH+D+    LV++ LT  NY +WSR+MLIALSVK KLGF+DGSI +P            LKT+W
Subjt:  STLSSSTSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPT---------GSLKTLW

Query:  DEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIP-------------IFATPPPAVA
        +E   YRP C+CG+C+CG  + +++  Q EY+MSFLMGL++SF+  R Q+LLMDP P  +RVFSL+ Q+EQQ RT P             +  T      
Subjt:  DEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQ-RTIP-------------IFATPPPAVA

Query:  FAAPSRPQNQSTSRS---RKDRPICTHYAIQ
         + P   QN ++S S   +KD+P CTH  I+
Subjt:  FAAPSRPQNQSTSRS---RKDRPICTHYAIQ

A0A6J1DIP8 uncharacterized protein LOC1110203996.5e-3438.15Show/hide
Query:  SSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGS-----------------------------
        +STS+ +   +IE  T+PYFLHH+D T+L LV++ LT+ NY SWSRSMLIAL+VKNK+GF+DGSI++PTG                              
Subjt:  SSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGS-----------------------------

Query:  -----------------------------------------------LKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRA
                                                       LKTLW E  +Y P CT GRC+CG  + +  F Q E++M FLMGLNESF+  R 
Subjt:  -----------------------------------------------LKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRA

Query:  QILLMDPPPSASRVFSLLSQKEQQRTIPIFATPPPAVAFAAPSRPQNQS
        Q+LLM+P P+ +RVFSL+SQ+ QQR I + +T P  +  A  +R  + S
Subjt:  QILLMDPPPSASRVFSLLSQKEQQRTIPIFATPPPAVAFAAPSRPQNQS

A0A6J1DLQ9 uncharacterized protein LOC1110221172.5e-4644.4Show/hide
Query:  LHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------------------------------------
        +HHNDT+NL LV++ LT+ NYVSWSRSM IALS+KNKLGFI+GS+ KP G                                                  
Subjt:  LHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTG--------------------------------------------------

Query:  --------------------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQ
                                   LK LWDEY++YRPGCTCG C+CG  R++++F QFE+LM FLMGLNESFA  RAQILLMDPPPS  + FSL+SQ
Subjt:  --------------------------SLKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQ

Query:  KEQQRTIPIFATPPPAVAFAA-PSRPQNQSTSRSRKDR---PICTHYAIQ
        +EQQR IP+F+TP PAV  A   SR  + S S SR+     P CT+  I+
Subjt:  KEQQRTIPIFATPPPAVAFAA-PSRPQNQSTSRSRKDR---PICTHYAIQ

A0A6J1DNP7 uncharacterized protein LOC1110220652.4e-3636.78Show/hide
Query:  IEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGS----------------------------------------
        +E   +PYFLHH+D T+L LV++LLTD NY SWSRS++IAL+VKNK+GF+DGSI +PT                                          
Subjt:  IEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGS----------------------------------------

Query:  ------------------------------------LKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSA
                                            LKTLW E   YRP C+CGRC+ G  + ++  +Q EY+M+FLMGLN SF+  RAQ+LLM+P P+ 
Subjt:  ------------------------------------LKTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSA

Query:  SRVFSLLSQKEQQRTIPI-FATPPPAVAFAAPSRPQNQSTSRS------RKDRPICTHYAI
        +R F+L++Q+ QQR+I +   T P A A  A S   N   + S      RKD+ +CTH  I
Subjt:  SRVFSLLSQKEQQRTIPI-FATPPPAVAFAAPSRPQNQSTSRS------RKDRPICTHYAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCTACAGATTTTCTTGTCACAACGTCTGGTTCTACTTGGATCGTCATTCCGCCGGCTATATCTTCTACTTTGTCTAGTTCTACTTCGACCGTCATTTCACCCAC
TATATCTTCTACTTCGTCTTCGCTGATTTCAACGTCGATTGAAGCCGCTACAGATCCATATTTTCTTCACCACAATGATACCACCAACCTTTTTCTCGTCACAGAGCTTC
TTACTGATGGCAACTATGTTTCATGGAGTCGTTCAATGCTTATCGCTCTCTCTGTTAAGAATAAACTTGGATTCATTGATGGCTCGATTTTGAAACCCACTGGTAGCTTG
AAGACTCTTTGGGATGAATACATCACATATCGTCCTGGTTGTACTTGTGGTCGATGTACTTGTGGTGATCAACGTGTTCTTGATGAATTTTTCCAATTTGAGTATCTTAT
GAGTTTTTTGATGGGGTTGAATGAGTCTTTTGCTCCCACAAGAGCTCAAATTTTACTCATGGATCCACCTCCTTCTGCCAGCAGAGTTTTTTCCCTTTTATCTCAGAAAG
AACAACAACGTACCATTCCTATTTTTGCAACACCTCCACCTGCTGTTGCCTTTGCTGCACCTTCACGACCGCAAAATCAATCGACTTCTCGTTCACGGAAAGATCGCCCT
ATTTGTACCCATTATGCTATTCAAAAGGACATACTGTTGATAGATGTTATAAACTTCACGGTTATCCTCTGGGTTATAAACAACGGGGAGTTCAACGTTCTGTTAATGCT
CCAGCGCCACATATATCATCCTCCAGTTCTGTCTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCCTACAGATTTTCTTGTCACAACGTCTGGTTCTACTTGGATCGTCATTCCGCCGGCTATATCTTCTACTTTGTCTAGTTCTACTTCGACCGTCATTTCACCCAC
TATATCTTCTACTTCGTCTTCGCTGATTTCAACGTCGATTGAAGCCGCTACAGATCCATATTTTCTTCACCACAATGATACCACCAACCTTTTTCTCGTCACAGAGCTTC
TTACTGATGGCAACTATGTTTCATGGAGTCGTTCAATGCTTATCGCTCTCTCTGTTAAGAATAAACTTGGATTCATTGATGGCTCGATTTTGAAACCCACTGGTAGCTTG
AAGACTCTTTGGGATGAATACATCACATATCGTCCTGGTTGTACTTGTGGTCGATGTACTTGTGGTGATCAACGTGTTCTTGATGAATTTTTCCAATTTGAGTATCTTAT
GAGTTTTTTGATGGGGTTGAATGAGTCTTTTGCTCCCACAAGAGCTCAAATTTTACTCATGGATCCACCTCCTTCTGCCAGCAGAGTTTTTTCCCTTTTATCTCAGAAAG
AACAACAACGTACCATTCCTATTTTTGCAACACCTCCACCTGCTGTTGCCTTTGCTGCACCTTCACGACCGCAAAATCAATCGACTTCTCGTTCACGGAAAGATCGCCCT
ATTTGTACCCATTATGCTATTCAAAAGGACATACTGTTGATAGATGTTATAAACTTCACGGTTATCCTCTGGGTTATAAACAACGGGGAGTTCAACGTTCTGTTAATGCT
CCAGCGCCACATATATCATCCTCCAGTTCTGTCTCTATGA
Protein sequenceShow/hide protein sequence
MNPTDFLVTTSGSTWIVIPPAISSTLSSSTSTVISPTISSTSSSLISTSIEAATDPYFLHHNDTTNLFLVTELLTDGNYVSWSRSMLIALSVKNKLGFIDGSILKPTGSL
KTLWDEYITYRPGCTCGRCTCGDQRVLDEFFQFEYLMSFLMGLNESFAPTRAQILLMDPPPSASRVFSLLSQKEQQRTIPIFATPPPAVAFAAPSRPQNQSTSRSRKDRP
ICTHYAIQKDILLIDVINFTVILWVINNGEFNVLLMLQRHIYHPPVLSL