; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005417 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005417
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:17302409..17303309
RNA-Seq ExpressionLag0005417
SyntenyLag0005417
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]1.2e-6346.33Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN L EKFL KYFPP RNA+ R+EIV F+Q E++T SEA ERFKE+L+KCP+H LPHCIQME FYNGLN+AT+ +VDASA GA+L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P
        + +CQW+DVR +  +K + ++EVD +S+I A +AS+ N L+++AL  +      V ++A +N  AAESCVYCGEEH F+ CP+NPAS+F++GN      P
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P

Query:  KN-------------------------KQALPHK--------------------NSE------------SSLETLMKEYMTRTDATIQSNQASFRALELQ
        KN                          Q +P K                    N++            +S+E+L+KEYM + D  IQ+ QAS R LE+Q
Subjt:  KN-------------------------KQALPHK--------------------NSE------------SSLETLMKEYMTRTDATIQSNQASFRALELQ

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]3.6e-6546.36Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAE FL KYFPP RNA+ ++EIV F+Q E+ET SEA ERFKE+L+KCP+H LPHCIQME FYNGLN+ T+ +VDASA GA+L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPK----
        + +CQW+DVR +  +K + ++EVD +S+I A +AS+ N L+++AL  +      V + A +N  AAESCVYCGEEH F+ CP+NPAS+F++GN       
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPK----

Query:  ---------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ
                                   N+Q  P  N                               SE+S+E+L+KEYM + DA IQS QAS R LE+Q
Subjt:  ---------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ

Query:  MG
        +G
Subjt:  MG

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]2.6e-6345.83Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAEKFL KYFPP R+A+ R+EIV F++ E ET SEA ERFKE L+KCP+H LPHCIQ+E FYNGLN AT+ +VDASA G +L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P
        + +CQW DVR +  KK + ++EVD +S+I A +AS+ N L+++A            +   +   A ESCVYCGE+H F+ CP+NPAS+F++GN      P
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P

Query:  K--------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ
        K                          N+Q  P  N                               S + LE+L+KEYM R DA IQS Q S R LE+Q
Subjt:  K--------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ

Query:  MGQLANELKARP
        +GQLANEL+ RP
Subjt:  MGQLANELKARP

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]1.2e-6349.82Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAEKFL KYFPP RNAK RSEI+ F+Q+E+ET S+A ERFKE+L+KCP+H +PHCIQ+E FYNGLN A++ ++DASA GA+L+K++NEA+E LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRG-SSKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIG------NA
        + + QWS  R  +S+KV  ++EVD ++ + A +AS+ N LK+M +  +VQ +  ++        A  SCVYCG+ H FE CP+NPASV ++G        
Subjt:  TYSCQWSDVRG-SSKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIG------NA

Query:  PKNKQALPHKNSE---------------SSLETLMKEYMTRTDATIQSNQASFRALELQMGQLANELKARP
         + KQ+ P   S+               SSLE+LM++YM + DA IQS  AS + LE+Q+GQLAN+LK RP
Subjt:  PKNKQALPHKNSE---------------SSLETLMKEYMTRTDATIQSNQASFRALELQMGQLANELKARP

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.3e-6246.74Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAEKFL KYFPP RNAK RSEI+ F+Q+E+ET S+A ERFKELL+KCP+H +PHCIQ+E FYNGLN A++ ++DASA GA+L+K++NEA+E LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRG-SSKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPKNKQA
        + + QWS  R  +S+KV  ++EVD ++ + A +AS+ N LK+M +  +VQ +  ++        A  SCVYCG+ H FE CP+N ASV ++GN   N+  
Subjt:  TYSCQWSDVRG-SSKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPKNKQA

Query:  LPHKNS-----------------------------------------ESSLETLMKEYMTRTDATIQSNQASFRALELQMGQLANELKARP
         P+ NS                                          SSLE+LM++YM + D  IQS  AS R LE+Q+GQLAN+LK RP
Subjt:  LPHKNS-----------------------------------------ESSLETLMKEYMTRTDATIQSNQASFRALELQMGQLANELKARP

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333945.6e-6446.33Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN L EKFL KYFPP RNA+ R+EIV F+Q E++T SEA ERFKE+L+KCP+H LPHCIQME FYNGLN+AT+ +VDASA GA+L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P
        + +CQW+DVR +  +K + ++EVD +S+I A +AS+ N L+++AL  +      V ++A +N  AAESCVYCGEEH F+ CP+NPAS+F++GN      P
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P

Query:  KN-------------------------KQALPHK--------------------NSE------------SSLETLMKEYMTRTDATIQSNQASFRALELQ
        KN                          Q +P K                    N++            +S+E+L+KEYM + D  IQ+ QAS R LE+Q
Subjt:  KN-------------------------KQALPHK--------------------NSE------------SSLETLMKEYMTRTDATIQSNQASFRALELQ

A0A6J1EQ90 uncharacterized protein LOC1114364111.7e-6546.36Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAE FL KYFPP RNA+ ++EIV F+Q E+ET SEA ERFKE+L+KCP+H LPHCIQME FYNGLN+ T+ +VDASA GA+L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPK----
        + +CQW+DVR +  +K + ++EVD +S+I A +AS+ N L+++AL  +      V + A +N  AAESCVYCGEEH F+ CP+NPAS+F++GN       
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPK----

Query:  ---------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ
                                   N+Q  P  N                               SE+S+E+L+KEYM + DA IQS QAS R LE+Q
Subjt:  ---------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ

Query:  MG
        +G
Subjt:  MG

A0A6J1G7Q6 uncharacterized protein LOC1114515981.2e-6345.83Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAEKFL KYFPP R+A+ R+EIV F++ E ET SEA ERFKE L+KCP+H LPHCIQ+E FYNGLN AT+ +VDASA G +L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P
        + +CQW DVR +  KK + ++EVD +S+I A +AS+ N L+++A            +   +   A ESCVYCGE+H F+ CP+NPAS+F++GN      P
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNA-----P

Query:  K--------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ
        K                          N+Q  P  N                               S + LE+L+KEYM R DA IQS Q S R LE+Q
Subjt:  K--------------------------NKQALPHKN-------------------------------SESSLETLMKEYMTRTDATIQSNQASFRALELQ

Query:  MGQLANELKARP
        +GQLANEL+ RP
Subjt:  MGQLANELKARP

A0A6J1H7E4 uncharacterized protein LOC1114611687.1e-5955.5Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAEKFL KYFPP RNA+ R+EIV F+Q E+ET SEA ERFKE+L+KCP+H LPHCIQME FYNGLN+AT+ +VDASA GA+L+KT+NEAYE LERI+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPKNKQA
        + +CQW+DVR +  KK + ++EVD +S+I A +AS+ N L+++A            + A +   A ESCVYCGEEH F+ CP NPAS+ ++ N       
Subjt:  TYSCQWSDVRGS-SKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPKNKQA

Query:  LPHKNSESS
           KN+ SS
Subjt:  LPHKNSESS

U5CUI2 Retrotrans_gag domain-containing protein1.6e-5052.82Show/hide
Query:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS
        WN LAEKFL KYFPP RNAK RSEI+ F+Q E+E+ S+A ERFKELL+KCP+H +PHCIQME FYNGLN A++ ++DASA GA+L+K++NEA+E LE I+
Subjt:  WNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERIS

Query:  TYSCQWSDVRG-SSKKVKAIMEVDDVSTIRADIASLANTLKSMAL--VSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGN
        + + QWS+ R  +S+KV  ++EVD ++ + A +AS+ N LK++++    N+Q +  ++S          SCV+CGE H FE CP+NP SV ++GN
Subjt:  TYSCQWSDVRG-SSKKVKAIMEVDDVSTIRADIASLANTLKSMAL--VSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAACAAGTTAGCAGAGAAATTTCTTAGTAAGTATTTTCCACCAATTAGAAATGCCAAGTTAAGGAGTGAGATAGTGAGATTTAGGCAAAATGAGGAAGAAACTTT
TAGTGAGGCTAGGGAAAGGTTTAAGGAGCTTTTGCAAAAGTGTCCCTACCACTGTTTACCACATTGTATTCAAATGGAAATATTTTACAATGGGTTAAACTTAGCAACCC
AGTGTATTGTTGATGCTTCTGCGGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAGGCTTATGAGACTTTAGAGAGAATATCAACCTACAGTTGTCAGTGGTCAGACGTG
AGAGGCTCTAGTAAGAAAGTTAAAGCAATAATGGAAGTTGATGATGTGTCAACCATTAGGGCTGATATTGCATCATTGGCTAATACTCTTAAAAGTATGGCACTTGTTAG
CAATGTTCAGCAGTCGCCAGTGGTGGAATCTATTGCATTTCTGAATCACGTAGCAGCTGAATCTTGTGTCTATTGTGGTGAAGAGCATAATTTTGAATTTTGCCCCAACA
ACCCAGCTTCTGTGTTTTTCATAGGGAATGCCCCAAAAAATAAGCAGGCATTGCCCCATAAAAATTCAGAGAGTTCTCTGGAGACTTTGATGAAAGAATATATGACTCGT
ACTGATGCCACAATTCAGAGTAATCAAGCTTCATTTAGAGCCCTAGAATTGCAAATGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGAACAAGTTAGCAGAGAAATTTCTTAGTAAGTATTTTCCACCAATTAGAAATGCCAAGTTAAGGAGTGAGATAGTGAGATTTAGGCAAAATGAGGAAGAAACTTT
TAGTGAGGCTAGGGAAAGGTTTAAGGAGCTTTTGCAAAAGTGTCCCTACCACTGTTTACCACATTGTATTCAAATGGAAATATTTTACAATGGGTTAAACTTAGCAACCC
AGTGTATTGTTGATGCTTCTGCGGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAGGCTTATGAGACTTTAGAGAGAATATCAACCTACAGTTGTCAGTGGTCAGACGTG
AGAGGCTCTAGTAAGAAAGTTAAAGCAATAATGGAAGTTGATGATGTGTCAACCATTAGGGCTGATATTGCATCATTGGCTAATACTCTTAAAAGTATGGCACTTGTTAG
CAATGTTCAGCAGTCGCCAGTGGTGGAATCTATTGCATTTCTGAATCACGTAGCAGCTGAATCTTGTGTCTATTGTGGTGAAGAGCATAATTTTGAATTTTGCCCCAACA
ACCCAGCTTCTGTGTTTTTCATAGGGAATGCCCCAAAAAATAAGCAGGCATTGCCCCATAAAAATTCAGAGAGTTCTCTGGAGACTTTGATGAAAGAATATATGACTCGT
ACTGATGCCACAATTCAGAGTAATCAAGCTTCATTTAGAGCCCTAGAATTGCAAATGGGTCAGCTAGCTAATGAGCTGAAGGCACGACCTTAA
Protein sequenceShow/hide protein sequence
MWNKLAEKFLSKYFPPIRNAKLRSEIVRFRQNEEETFSEARERFKELLQKCPYHCLPHCIQMEIFYNGLNLATQCIVDASAGGALLAKTFNEAYETLERISTYSCQWSDV
RGSSKKVKAIMEVDDVSTIRADIASLANTLKSMALVSNVQQSPVVESIAFLNHVAAESCVYCGEEHNFEFCPNNPASVFFIGNAPKNKQALPHKNSESSLETLMKEYMTR
TDATIQSNQASFRALELQMGQLANELKARP