; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035336 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035336
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:19585568..19588314
RNA-Seq ExpressionLag0035336
SyntenyLag0035336
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.7e-2241.21Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIG--------------RSSSSSMSSCSR---------------LGSDLK-----TAGGT
        D+A+DWL++IPP SITTW+ L QAFL K+FPPAK+ +LRTEIG                    +  C +               L S  K     TAGG+
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIG--------------RSSSSSMSSCSR---------------LGSDLK-----TAGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPK-KIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQ----SIESAAVLAS
        + SK  + A  +LED+AT SY W  ER++P     AG++E+D V++L+ QM+SL NA  K +  G AQ    SI S A LAS
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPK-KIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQ----SIESAAVLAS

XP_015387963.1 uncharacterized protein LOC107177920 [Citrus sinensis]5.6e-2143.87Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSC-----------------------------SRLGSDLKT-----AGGT
        DKA++WL S+  G+ITTWD L Q FL K+FPPAKT KLR +I   +   M S                              + LGS+ +T     AGGT
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSC-----------------------------SRLGSDLKT-----AGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLAN
        L+ KT E A  LLE+MA+N+YQW S+RS P+KIV G   +D V+AL TQM++L+N
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLAN

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]3.6e-2039.23Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSS--------------SSSMSSCSR---------------LGSDLKT-----AGGT
        DKA+ W QS+P GSITTWD L Q FL K+FPP+K+ +LR EI +                   +  C +               L    +T     AGG 
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSS--------------SSSMSSCSR---------------LGSDLKT-----AGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAAVLASRSQE
        L++KT E A  LL+D+ATNSYQW SERS  KK VAG+ E+D ++AL  Q++SL N  +  +  G+ Q+++S    +S  QE
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAAVLASRSQE

XP_023881727.1 uncharacterized protein LOC111994101 [Quercus suber]2.8e-2044.37Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSCSRLGSDL---KTAGGTLLSKTVENARILLEDMATNSYQWASERSTPK
        DKAR WLQS+ PGSIT+W  + +  L KFFP AKT +LR+EIG+   +   S  +          +GGTL+SKT E A  LLE+MA+N+YQW +ER+  K
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSCSRLGSDL---KTAGGTLLSKTVENARILLEDMATNSYQWASERSTPK

Query:  KIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAA
        K VAG+ E++  +AL  Q++SL++     +     QS E  A
Subjt:  KIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAA

XP_030923419.1 uncharacterized protein LOC115950351 [Quercus lobata]8.6e-2248.76Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSCSRLGSDLKTAGGTLLSKTVENARILLEDMATNSYQWASERSTPKKIV
        DKAR WLQS+ PGSIT+W  + + FL KFFPPAKT +LR+EIG+   +       + +    +G TL+SKT E    LLE+MA+N+YQW +ER+  KK V
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSCSRLGSDLKTAGGTLLSKTVENARILLEDMATNSYQWASERSTPKKIV

Query:  AGVFEIDNVSALQTQMSSLAN
        AG+ E+D  +AL  Q++SL++
Subjt:  AGVFEIDNVSALQTQMSSLAN

TrEMBL top hitse value%identityAlignment
A0A3S3N117 Retrotrans_gag domain-containing protein5.3e-1738.96Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSS--------------CSRLGSDL--------------------KTAGGT
        DKA+ WL S+P  +ITTWD L + FL KFFPP KTVK+R +I   + + M S              C   G  L                       GGT
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSS--------------CSRLGSDL--------------------KTAGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLA
        L+ K+ E A  L+E+MATN+YQW S+    KKI  GV E+D++SAL  Q+++L+
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLA

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129454.0e-1741.56Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI-------GRSSSSS-------MSSCSRLG---------------SDLKT-----AGGT
        DKA+ WL S+P GSITTW+ L Q FL KFFPPAKT K+R +I       G S   +       +  C   G                 +KT     AGG 
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI-------GRSSSSS-------MSSCSRLG---------------SDLKT-----AGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLA
        L+SK   +A  LLE+MA+N+YQW SERS  +K V G +EID +  L TQ+++L+
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLA

A0A6J0ZYV0 uncharacterized protein LOC1104134134.0e-1741.56Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI-------GRSSSSS-------MSSCSRLG---------------SDLKT-----AGGT
        DKA+ WL S+P GSITTW+ L Q FL KFFPPAKT K+R +I       G S   +       +  C   G                 +KT     AGG 
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI-------GRSSSSS-------MSSCSRLG---------------SDLKT-----AGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLA
        L+SK   +A  LLE+MA+N+YQW SERS  +K V G +EID +  L TQ+++L+
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLA

Q2AA09 Retrotransposon gag protein1.8e-1737.27Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI---GRSSSSSMSSCSRLGSDL-------------------------------KTAGGT
        DKAR WLQS+PPGSITTWD L +AFL K+FPP+KT +LR +I    +    S+        DL                                 AGG 
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI---GRSSSSSMSSCSRLGSDL-------------------------------KTAGGT

Query:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFS
        L++K+V +A+ L+EDMA N +QW+ ERS PKK  +G +++D +  + +++ +L   F K S
Subjt:  LLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFS

U5CUI2 Retrotrans_gag domain-containing protein2.6e-1633.15Show/hide
Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI-------GRSSS----------------------------SSMSSCSRLGSDLKTAGG
        D+AR WL ++PP S+T W+ L + FL+K+FPP +  K R+EI         S+S                            + +++ SR+  D  +A G
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEI-------GRSSS----------------------------SSMSSCSRLGSDLKTAGG

Query:  TLLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAAVLAS
         +LSK+   A  +LE +A+N+YQW++ R+   + VAGV E+D ++AL  QM+S+ N     S  G+A++I+ AA + S
Subjt:  TLLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAAVLAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACTCGAATCCGTCGACACCGCAAGTCTCCGTCCCGATCTACACGTCTTCTCACTTGATTCTCGCTCCTTGGCCTTGCTTCAGCTCCCACCAAAGGCGTGCTTA
CTTGAGCTTGGCTATAGGGAAATCTGATGCTTACGATGATAAAGCACGAGATTGGTTGCAGTCTATTCCCCCTGGGAGCATCACCACCTGGGATGCTTTAGTCCAGGCAT
TTCTGAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGCGTTCCAGTAGCAGTTCGATGAGCAGTTGTTCGAGGCTTGGGAGCGATTTAAAG
ACTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGGCATCGGAGCGGTCTACACCTAAAAA
GATTGTTGCTGGAGTGTTCGAGATTGACAATGTAAGTGCACTTCAGACCCAGATGTCTTCCCTGGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAA
TTGAATCAGCTGCCGTTTTAGCATCTAGATCTCAGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACTCGAATCCGTCGACACCGCAAGTCTCCGTCCCGATCTACACGTCTTCTCACTTGATTCTCGCTCCTTGGCCTTGCTTCAGCTCCCACCAAAGGCGTGCTTA
CTTGAGCTTGGCTATAGGGAAATCTGATGCTTACGATGATAAAGCACGAGATTGGTTGCAGTCTATTCCCCCTGGGAGCATCACCACCTGGGATGCTTTAGTCCAGGCAT
TTCTGAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGCGTTCCAGTAGCAGTTCGATGAGCAGTTGTTCGAGGCTTGGGAGCGATTTAAAG
ACTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGGCATCGGAGCGGTCTACACCTAAAAA
GATTGTTGCTGGAGTGTTCGAGATTGACAATGTAAGTGCACTTCAGACCCAGATGTCTTCCCTGGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAA
TTGAATCAGCTGCCGTTTTAGCATCTAGATCTCAGGAGTAG
Protein sequenceShow/hide protein sequence
MENSNPSTPQVSVPIYTSSHLILAPWPCFSSHQRRAYLSLAIGKSDAYDDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGRSSSSSMSSCSRLGSDLK
TAGGTLLSKTVENARILLEDMATNSYQWASERSTPKKIVAGVFEIDNVSALQTQMSSLANAFMKFSGTGSAQSIESAAVLASRSQE