; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001754 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001754
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr4:35037900..35042057
RNA-Seq ExpressionLag0001754
SyntenyLag0001754
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035612.1 No apical meristem (NAM) protein [Cucumis melo var. makuwa]1.0e-2230.51Show/hide
Query:  INPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSSKKSHSGQLVDYL-----------
        INPY +HHS   T A+V + L+GA NYISWS AML+A+S +NK GF+ G I+K + G L  AW         ++L+S        + Y            
Subjt:  INPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSSKKSHSGQLVDYL-----------

Query:  ---ELCLAV----------------AESVALAVATDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQ-------------
           + C+++                 + VAL   T N        +  +RK+E  RP C +CG+KGH+ D+CYK HGYPPGY   N              
Subjt:  ---ELCLAV----------------AESVALAVATDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQ-------------

Query:  QGNSFIA------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLFIIYDLLTTYLFTYNLL
          NS  A      FS L+ E                                           +D W++ SGAS H+CHDKSLF        +  T N+ 
Subjt:  QGNSFIA------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLFIIYDLLTTYLFTYNLL

Query:  SISALLKDSRYSVDFTGDSCCIQDKLLLRKI
         +  L    R SVD  GD   I   LLL+ +
Subjt:  SISALLKDSRYSVDFTGDSCCIQDKLLLRKI

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]7.2e-2124.84Show/hide
Query:  SSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSS-----------------
        S  +AQ+NPY +HHS   T A+VT+ L+GA NY SWS AML+A+S +NK GF+ G I+K + G L  AW         ++L+S                 
Subjt:  SSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSS-----------------

Query:  ---------KKSHSGQL------------------------------------------------VDYLE------LCLAVAESVA--------------
                 K+S+   +                                                +D+LE        + + +S A              
Subjt:  ---------KKSHSGQL------------------------------------------------VDYLE------LCLAVAESVA--------------

Query:  ------LAVATDNSKK---------------GSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQG----------------NSFI
              L +  +  +                 S       R R+ +RP CS+CG+KGH+ D+CYK HGYPPGY   N                   NS  
Subjt:  ------LAVATDNSKK---------------GSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQG----------------NSFI

Query:  A------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLF----------------------
        A      FS L+ E                                           +D W++DSGAS HICHDKSLF                      
Subjt:  A------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLF----------------------

Query:  -----------IIYDLLTTYLFTYNLLSISALLKDSRYSVDFTGDSCCIQDKLLLRKIGKVDC
                    + D+L    F YNL+S+S LL     S+DF    C IQD      IGK  C
Subjt:  -----------IIYDLLTTYLFTYNLLSISALLKDSRYSVDFTGDSCCIQDKLLLRKIGKVDC

KAA8535721.1 hypothetical protein F0562_030716 [Nyssa sinensis]3.9e-1939.25Show/hide
Query:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S
        M+DS +    N   + S+++   NPY LHHS +    LV++QL+G ENY +WS AMLIALS KNKLGFV+G+I   +++S +L      + S        
Subjt:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S

Query:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS
        K+++           +A A    +A +  +S + SQ+  S  S+ ++  RP C HC + GH VDR YKIHGYPPGY  R N   N+
Subjt:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS

KAA8535725.1 hypothetical protein F0562_030773 [Nyssa sinensis]1.1e-1838.71Show/hide
Query:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S
        M+DS +    N   + S+++   NPY LHHS +    LV++QL+G ENY +WS AMLIAL  KNKLGFV+G+I   +++S +L      + S        
Subjt:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S

Query:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS
        K+++           +A A    +A +  +S + SQ+  S  S+ ++  RP C+HC + GH VDR YKIHGYPPGY  R N   N+
Subjt:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS

XP_022154608.1 uncharacterized protein LOC111021831 [Momordica charantia]3.7e-1729.55Show/hide
Query:  PLLDESS----------LEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKK---------LASGNLSTAWCFLLSSK
        P+LD+SS          +++ +NPY LHH+  T   LVT+ L+  ENY SWS +MLIALS KNKLGF++G+I +         + + ++  AW     SK
Subjt:  PLLDESS----------LEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKK---------LASGNLSTAWCFLLSSK

Query:  KSHSG-------------------------------------------------------QLVDYLELC------LAVAESVALAVATDNSKKGSQDRSQ
        +  S                                                        +L+ Y  LC       + + + +LA+A  +S   S+  ++
Subjt:  KSHSG-------------------------------------------------------QLVDYLELC------LAVAESVALAVATDNSKKGSQDRSQ

Query:  ---MSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQGNS
            + K + +RP C+HCG+ GH +DRCYK+HGYPPGY   N + +S
Subjt:  ---MSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQGNS

TrEMBL top hitse value%identityAlignment
A0A2N9J0I6 Integrase catalytic domain-containing protein1.4e-1737.84Show/hide
Query:  NPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI-----KKLASGNLSTAWCFLLSSKKSHSGQLVDYLELCLAVAESVALAVA
        N + LHH  +    LV++ LSG +NY +WS ++++AL+ KNK+GF+NGTI     + L S NL T  C  +  ++S     +          +S+AL   
Subjt:  NPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI-----KKLASGNLSTAWCFLLSSKKSHSGQLVDYLELCLAVAESVALAVA

Query:  TDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY
        ++  +     RS   +K   +RP+CSHCG+ GH+VD+CYK+HG+PPG+
Subjt:  TDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.5e-2124.84Show/hide
Query:  SSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSS-----------------
        S  +AQ+NPY +HHS   T A+VT+ L+GA NY SWS AML+A+S +NK GF+ G I+K + G L  AW         ++L+S                 
Subjt:  SSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSS-----------------

Query:  ---------KKSHSGQL------------------------------------------------VDYLE------LCLAVAESVA--------------
                 K+S+   +                                                +D+LE        + + +S A              
Subjt:  ---------KKSHSGQL------------------------------------------------VDYLE------LCLAVAESVA--------------

Query:  ------LAVATDNSKK---------------GSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQG----------------NSFI
              L +  +  +                 S       R R+ +RP CS+CG+KGH+ D+CYK HGYPPGY   N                   NS  
Subjt:  ------LAVATDNSKK---------------GSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQG----------------NSFI

Query:  A------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLF----------------------
        A      FS L+ E                                           +D W++DSGAS HICHDKSLF                      
Subjt:  A------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLF----------------------

Query:  -----------IIYDLLTTYLFTYNLLSISALLKDSRYSVDFTGDSCCIQDKLLLRKIGKVDC
                    + D+L    F YNL+S+S LL     S+DF    C IQD      IGK  C
Subjt:  -----------IIYDLLTTYLFTYNLLSISALLKDSRYSVDFTGDSCCIQDKLLLRKIGKVDC

A0A5D3E5P0 No apical meristem (NAM) protein4.9e-2330.51Show/hide
Query:  INPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSSKKSHSGQLVDYL-----------
        INPY +HHS   T A+V + L+GA NYISWS AML+A+S +NK GF+ G I+K + G L  AW         ++L+S        + Y            
Subjt:  INPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAW--------CFLLSSKKSHSGQLVDYL-----------

Query:  ---ELCLAV----------------AESVALAVATDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQ-------------
           + C+++                 + VAL   T N        +  +RK+E  RP C +CG+KGH+ D+CYK HGYPPGY   N              
Subjt:  ---ELCLAV----------------AESVALAVATDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQ-------------

Query:  QGNSFIA------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLFIIYDLLTTYLFTYNLL
          NS  A      FS L+ E                                           +D W++ SGAS H+CHDKSLF        +  T N+ 
Subjt:  QGNSFIA------FSYLHEE------------------------------------------VNDSWVVDSGASSHICHDKSLFIIYDLLTTYLFTYNLL

Query:  SISALLKDSRYSVDFTGDSCCIQDKLLLRKI
         +  L    R SVD  GD   I   LLL+ +
Subjt:  SISALLKDSRYSVDFTGDSCCIQDKLLLRKI

A0A5J5AX79 Retrotran_gag_3 domain-containing protein5.6e-1938.71Show/hide
Query:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S
        M+DS +    N   + S+++   NPY LHHS +    LV++QL+G ENY +WS AMLIAL  KNKLGFV+G+I   +++S +L      + S        
Subjt:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S

Query:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS
        K+++           +A A    +A +  +S + SQ+  S  S+ ++  RP C+HC + GH VDR YKIHGYPPGY  R N   N+
Subjt:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS

A0A5J5B3G7 Retrotran_gag_3 domain-containing protein1.9e-1939.25Show/hide
Query:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S
        M+DS +    N   + S+++   NPY LHHS +    LV++QL+G ENY +WS AMLIALS KNKLGFV+G+I   +++S +L      + S        
Subjt:  MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTI--KKLASGNLSTAWCFLLS-------S

Query:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS
        K+++           +A A    +A +  +S + SQ+  S  S+ ++  RP C HC + GH VDR YKIHGYPPGY  R N   N+
Subjt:  KKSHSGQLVDYLELCLAVAESVALAVATDNSKKGSQDR-SQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGY-MRPNQQGNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACTCAACTGCAGTCGTCGCCGCTAATCCCCTTCTTGATGAATCTTCCCTTGAAGCCCAGATCAATCCCTACATCTTGCATCACTCGACTGCCACCACCACTGC
TCTTGTGACTGAGCAACTTAGTGGCGCAGAGAATTACATTTCTTGGAGCATGGCTATGCTGATTGCTCTTTCCGAGAAGAATAAACTGGGATTTGTGAACGGAACTATCA
AGAAACTTGCTAGCGGGAACTTGTCGACAGCATGGTGTTTTCTCTTGTCATCCAAGAAGAGTCATAGCGGGCAGTTGGTGGATTACCTGGAACTTTGCCTGGCGGTTGCG
GAGTCAGTAGCCTTAGCAGTTGCAACTGACAATTCGAAGAAAGGCTCGCAGGACAGATCTCAAATGTCGCGCAAAAGGGAAGGACAACGCCCTATATGCTCACATTGTGG
AGTTAAAGGCCATGTTGTCGATCGTTGTTATAAGATACATGGTTATCCACCGGGTTACATGCGACCTAATCAACAAGGTAATTCCTTTATCGCTTTCTCATACTTACATG
AAGAAGTCAATGATTCTTGGGTAGTAGATTCAGGTGCATCCAGCCATATTTGTCATGACAAATCCCTATTCATAATCTACGACCTGCTCACAACATATCTGTTCACTTAC
AACCTCCTGTCGATCAGTGCTTTATTAAAGGATAGTAGATACTCTGTTGATTTTACTGGTGATTCTTGTTGCATCCAGGACAAGTTGCTATTGAGGAAGATTGGCAAGGT
TGATTGTGTTAAAGATTCTATCAGTGATCGAAAAAATTTCTATCGACCTCGACTCCGCTCTCCATTTAACGCCCCCGACTCTGTCGACCTTGGGATTGTCCGTCGGAAAC
TTGATCTCGAGGCGGAAGATCCAGGCGAACTGGCAATAAGACCATGGAGGCTGCTTCACCATCGGGAAGAAGAGATGGCAATAAGTCTAGAGGAATCGAAGTTCGAGCGA
CGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCGACTCAACTGCAGTCGTCGCCGCTAATCCCCTTCTTGATGAATCTTCCCTTGAAGCCCAGATCAATCCCTACATCTTGCATCACTCGACTGCCACCACCACTGC
TCTTGTGACTGAGCAACTTAGTGGCGCAGAGAATTACATTTCTTGGAGCATGGCTATGCTGATTGCTCTTTCCGAGAAGAATAAACTGGGATTTGTGAACGGAACTATCA
AGAAACTTGCTAGCGGGAACTTGTCGACAGCATGGTGTTTTCTCTTGTCATCCAAGAAGAGTCATAGCGGGCAGTTGGTGGATTACCTGGAACTTTGCCTGGCGGTTGCG
GAGTCAGTAGCCTTAGCAGTTGCAACTGACAATTCGAAGAAAGGCTCGCAGGACAGATCTCAAATGTCGCGCAAAAGGGAAGGACAACGCCCTATATGCTCACATTGTGG
AGTTAAAGGCCATGTTGTCGATCGTTGTTATAAGATACATGGTTATCCACCGGGTTACATGCGACCTAATCAACAAGGTAATTCCTTTATCGCTTTCTCATACTTACATG
AAGAAGTCAATGATTCTTGGGTAGTAGATTCAGGTGCATCCAGCCATATTTGTCATGACAAATCCCTATTCATAATCTACGACCTGCTCACAACATATCTGTTCACTTAC
AACCTCCTGTCGATCAGTGCTTTATTAAAGGATAGTAGATACTCTGTTGATTTTACTGGTGATTCTTGTTGCATCCAGGACAAGTTGCTATTGAGGAAGATTGGCAAGGT
TGATTGTGTTAAAGATTCTATCAGTGATCGAAAAAATTTCTATCGACCTCGACTCCGCTCTCCATTTAACGCCCCCGACTCTGTCGACCTTGGGATTGTCCGTCGGAAAC
TTGATCTCGAGGCGGAAGATCCAGGCGAACTGGCAATAAGACCATGGAGGCTGCTTCACCATCGGGAAGAAGAGATGGCAATAAGTCTAGAGGAATCGAAGTTCGAGCGA
CGATAA
Protein sequenceShow/hide protein sequence
MTDSTAVVAANPLLDESSLEAQINPYILHHSTATTTALVTEQLSGAENYISWSMAMLIALSEKNKLGFVNGTIKKLASGNLSTAWCFLLSSKKSHSGQLVDYLELCLAVA
ESVALAVATDNSKKGSQDRSQMSRKREGQRPICSHCGVKGHVVDRCYKIHGYPPGYMRPNQQGNSFIAFSYLHEEVNDSWVVDSGASSHICHDKSLFIIYDLLTTYLFTY
NLLSISALLKDSRYSVDFTGDSCCIQDKLLLRKIGKVDCVKDSISDRKNFYRPRLRSPFNAPDSVDLGIVRRKLDLEAEDPGELAIRPWRLLHHREEEMAISLEESKFER
R