; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038983 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038983
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:32577454..32578227
RNA-Seq ExpressionLag0038983
SyntenyLag0038983
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.4e-4842.53Show/hide
Query:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW
        SST S     +T  S+ I +    G+K ++VKL+++ F+ WK Q+ T L  + L +F++ E+  P+K ++S   SS      PNP Y+ W RQD LISSW
Subjt:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW

Query:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI
         LGSMS +IL+++L C++A +IW+ L   FSSR LAQ +Q + KL  IKKG++ LK+YFLKI   VD+LA+  K  S DDH+++IL GLGS+Y S + VI
Subjt:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI

Query:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH
        + +  +P +Q+V SLLLTQE++ E  + + ++ +LPSVN          +    TN   +H
Subjt:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]3.6e-4441.45Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTIL---SGTDSSVKIPNPDYEAWVRQDNLISSWFLGS
        SS++    + +T++      G+K ++VKL ++NF+ WK Q+ T L  + L +F + E   P+K +    S + S+ + PNP+Y+ W R + LIS W LGS
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTIL---SGTDSSVKIPNPDYEAWVRQDNLISSWFLGS

Query:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE
        MS +IL++++ C++A +IW  L   FSSR LAQ +Q + KL  IKKG++SLK+YFLKI+  VD+LA+  K  S DDH+++IL GLG +Y S + +I+ + 
Subjt:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE

Query:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSV
         +P +Q+V SLLLTQE++ E  + + ++ +LP V
Subjt:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSV

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.4e-4842.53Show/hide
Query:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW
        SST S     +T  S+ I +    G+K ++VKL+++ F+ WK Q+ T L  + L +F++ E+  P+K ++S   SS      PNP Y+ W RQD LISSW
Subjt:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW

Query:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI
         LGSMS +IL+++L C++A +IW+ L   FSSR LAQ +Q + KL  IKKG++ LK+YFLKI   VD+LA+  K  S DDH+++IL GLGS+Y S + VI
Subjt:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI

Query:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH
        + +  +P +Q+V SLLLTQE++ E  + + ++ +LPSVN          +    TN   +H
Subjt:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.7e-6254.22Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTD---SSVKIPNPDYEAWVRQDNLISSWFLGS
        SS   SD     Q  KTINPGSK +IV+L+++N + WK Q+RT L G+GL  +ID     P + + +  D   SS    NP Y  W++QD LIS+W LGS
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTD---SSVKIPNPDYEAWVRQDNLISSWFLGS

Query:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE
        M+ DILS++LDC++A +IW VL   F+SR LA+V+QL+ KLE  KKGNLSLKDYFLKIKNLVDSLA AGKK S +DH++HIL GLG E+D+ + VIT + 
Subjt:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE

Query:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQ
            LQ+V SLLL QE R ERN  IN+DGSLPSVN T   +S      Q
Subjt:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQ

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]1.2e-4245.12Show/hide
Query:  KIQVRTTLHGHGLGHFIDDEATIPTKTILSG---TDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQL
        K QV T + GHGL  +ID +   P++ I +G   T S+ + PNP+Y  W++QD LIS W LGSMS +ILS++LDC    +IW +L   F+SRNLA+V+QL
Subjt:  KIQVRTTLHGHGLGHFIDDEATIPTKTILSG---TDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQL

Query:  RTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQ--ENRIERNTAINTDGSLPSVN
        ++KLE +KKG+++LK+YFLKIKNLVDSLA AGK+   DDH++HIL  LG E+DS V VI+ ++S   +Q+  S   +     +++ +T  ++  S P+ +
Subjt:  RTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQ--ENRIERNTAINTDGSLPSVN

Query:  YTNVQASNTKQQQQL
           V   +T Q Q +
Subjt:  YTNVQASNTKQQQQL

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-4842.53Show/hide
Query:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW
        SST S     +T  S+ I +    G+K ++VKL+++ F+ WK Q+ T L  + L +F++ E+  P+K ++S   SS      PNP Y+ W RQD LISSW
Subjt:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW

Query:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI
         LGSMS +IL+++L C++A +IW+ L   FSSR LAQ +Q + KL  IKKG++ LK+YFLKI   VD+LA+  K  S DDH+++IL GLGS+Y S + VI
Subjt:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI

Query:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH
        + +  +P +Q+V SLLLTQE++ E  + + ++ +LPSVN          +    TN   +H
Subjt:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH

A0A5A7UB21 Keratin, type II cytoskeletal 1-like1.7e-4441.45Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTIL---SGTDSSVKIPNPDYEAWVRQDNLISSWFLGS
        SS++    + +T++      G+K ++VKL ++NF+ WK Q+ T L  + L +F + E   P+K +    S + S+ + PNP+Y+ W R + LIS W LGS
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTIL---SGTDSSVKIPNPDYEAWVRQDNLISSWFLGS

Query:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE
        MS +IL++++ C++A +IW  L   FSSR LAQ +Q + KL  IKKG++SLK+YFLKI+  VD+LA+  K  S DDH+++IL GLG +Y S + +I+ + 
Subjt:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE

Query:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSV
         +P +Q+V SLLLTQE++ E  + + ++ +LP V
Subjt:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-4842.53Show/hide
Query:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW
        SST S     +T  S+ I +    G+K ++VKL+++ F+ WK Q+ T L  + L +F++ E+  P+K ++S   SS      PNP Y+ W RQD LISSW
Subjt:  SSTES-----DTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVK---IPNPDYEAWVRQDNLISSW

Query:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI
         LGSMS +IL+++L C++A +IW+ L   FSSR LAQ +Q + KL  IKKG++ LK+YFLKI   VD+LA+  K  S DDH+++IL GLGS+Y S + VI
Subjt:  FLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI

Query:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH
        + +  +P +Q+V SLLLTQE++ E  + + ++ +LPSVN          +    TN   +H
Subjt:  TEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFH

A0A6J1DLT9 uncharacterized protein LOC1110217578.3e-6354.22Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTD---SSVKIPNPDYEAWVRQDNLISSWFLGS
        SS   SD     Q  KTINPGSK +IV+L+++N + WK Q+RT L G+GL  +ID     P + + +  D   SS    NP Y  W++QD LIS+W LGS
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTD---SSVKIPNPDYEAWVRQDNLISSWFLGS

Query:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE
        M+ DILS++LDC++A +IW VL   F+SR LA+V+QL+ KLE  KKGNLSLKDYFLKIKNLVDSLA AGKK S +DH++HIL GLG E+D+ + VIT + 
Subjt:  MSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKE

Query:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQ
            LQ+V SLLL QE R ERN  IN+DGSLPSVN T   +S      Q
Subjt:  STPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNYTNVQASNTKQQQQ

A0A6J1DSS1 uncharacterized protein LOC1110235865.6e-4345.12Show/hide
Query:  KIQVRTTLHGHGLGHFIDDEATIPTKTILSG---TDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQL
        K QV T + GHGL  +ID +   P++ I +G   T S+ + PNP+Y  W++QD LIS W LGSMS +ILS++LDC    +IW +L   F+SRNLA+V+QL
Subjt:  KIQVRTTLHGHGLGHFIDDEATIPTKTILSG---TDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQL

Query:  RTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQ--ENRIERNTAINTDGSLPSVN
        ++KLE +KKG+++LK+YFLKIKNLVDSLA AGK+   DDH++HIL  LG E+DS V VI+ ++S   +Q+  S   +     +++ +T  ++  S P+ +
Subjt:  RTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQ--ENRIERNTAINTDGSLPSVN

Query:  YTNVQASNTKQQQQL
           V   +T Q Q +
Subjt:  YTNVQASNTKQQQQL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-0927.5Show/hide
Query:  SKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVLNA
        +K NI   D E +  WK ++R               A +  + +L   D    +PN   ++W + +    S  +  +S+  L+      TA  I + L+A
Subjt:  SKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVLNA

Query:  RFSSRNLAQVLQLRTKLETIK-KGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI-TEKESTPPLQKVYSLLLTQENRIERN
         +  ++LA  L LR +L ++K    +SL  +F     L+  L AAG K  + D + H+L  L S YD  +  I T  E    L  V + LL QE +I+ +
Subjt:  RFSSRNLAQVLQLRTKLETIK-KGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVI-TEKESTPPLQKVYSLLLTQENRIERN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-0925.64Show/hide
Query:  GSKTNIVKLDEEN-FMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVL
        G K  + K + +N F  W+ ++R  L   GL   +D ++  P          ++K      E W   D   +S     +S+D+++ ++D +TA  IW  L
Subjt:  GSKTNIVKLDEEN-FMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLDCETALDIWKVL

Query:  NARFSSRNLAQVLQLRTKLETIKKG-NLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQE
         + + S+ L   L L+ +L  +      +   +      L+  LA  G K  ++D  I +L  L S YD+    I   ++T  L+ V S LL  E
Subjt:  NARFSSRNLAQVLQLRTKLETIKKG-NLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-2731.37Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSN
        ++  E   LN+T IL      + +N+ KL   N++ W  QV     G+ L  F+D   T+P  TI  GTD++ ++ NPDY  W RQD LI S  LG++S 
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSN

Query:  DILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTP
         +   V    TA  IW+ L   +++ +   V QLRT+L+   KG  ++ DY   +    D LA  GK    D+ V  +L+ L  EY   +  I  K++ P
Subjt:  DILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTP

Query:  PLQKVYSLLLTQENRIERNTAINTDGSLP----SVNYTNVQASNTKQQQQLTNVY
         L +++  LL  E++I    A+++   +P    +V++ N   +N        N Y
Subjt:  PLQKVYSLLLTQENRIERNTAINTDGSLP----SVNYTNVQASNTKQQQQLTNVY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1828.05Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSN
        ++  E   L +T IL      + +N+ KL   N++ W  QV     G+ L  F+D    +P  TI  GTD+  ++ NPDY  W RQD LI S  LG++S 
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSN

Query:  DILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTP
         +   V    TA  IW+ L   +++ +   V QLR                        D LA  GK    D+ V  +L+ L  +Y   +  I  K++ P
Subjt:  DILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTP

Query:  PLQKVYSLLLTQENRIERNTAINTDGSLP-SVNYTNVQASNTKQQQ
         L +++  L+ +E+++    A+N+   +P + N    + +NT + Q
Subjt:  PLQKVYSLLLTQENRIERNTAINTDGSLP-SVNYTNVQASNTKQQQ

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-0929.38Show/hide
Query:  SSSTESDTLNSTQILKTINPGSKTNIVKL--DEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSM
        S S  SD  +   +   I+  S  +I KL  DE+N++ WKI+ R+ L       FID   T+P     S          P Y+ W + + ++  W + SM
Subjt:  SSSTESDTLNSTQILKTINPGSKTNIVKL--DEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSM

Query:  SNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNL
        ++ +L  V+  ETA  +W+ L   F      ++ QLR +L T+++G  S+++YF K+  +
Subjt:  SNDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-1123.23Show/hide
Query:  ILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMS-NDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLA
        ++   D ++   N +   W ++D ++     G+++        +   T+ DIW  +  +F +   A+ L+L ++L T   G++ + DY+ K+K L DSL 
Subjt:  ILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMS-NDILSEVLDCETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLA

Query:  AAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQENRIER
              +  + V+++L GL  ++D+ + VI  ++  P      ++L  +E+R++R
Subjt:  AAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQENRIER

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-1429.17Show/hide
Query:  ILSGTDSSVKIPNPDYE-AWVRQDNLISSWFLGSMSNDILSEVLDCE-TALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSL
        +L   D S   P P  E  W  +D L+  W  G++++ +L  ++    TA D+W  L   F     A+ LQ   +L T    +LS+ +Y  K+K+L D L
Subjt:  ILSGTDSSVKIPNPDYE-AWVRQDNLISSWFLGSMSNDILSEVLDCE-TALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSL

Query:  AAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNY---TNVQASNTKQQQQLTNVY
               S    V+H+L GL  +YD  + VI  K   P   +  S+LL +E+R+       +  SL   N+   +NV  +  +QQ++    Y
Subjt:  AAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQENRIERNTAINTDGSLPSVNY---TNVQASNTKQQQQLTNVY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTTCTTCCACAGAATCCGATACTCTGAATTCAACACAAATCTTGAAGACGATAAATCCTGGCAGCAAGACTAACATCGTCAAGTTGGACGAAGAGAATTTTAT
GCATTGGAAAATTCAAGTTCGAACAACACTGCATGGTCATGGACTAGGGCACTTTATTGATGATGAAGCTACAATCCCAACGAAGACGATTCTGTCTGGTACCGATTCCT
CAGTCAAAATTCCTAATCCTGATTATGAAGCTTGGGTGCGACAAGACAATCTCATCTCTTCTTGGTTCTTGGGGTCAATGTCTAATGATATACTCTCAGAAGTGTTAGAT
TGTGAAACAGCCCTAGATATCTGGAAAGTGTTGAATGCAAGATTTTCATCTCGGAATCTTGCTCAAGTTTTGCAACTCAGAACCAAATTGGAAACCATAAAGAAAGGTAA
TCTTTCTCTCAAAGATTACTTTCTTAAAATCAAGAATCTTGTTGATTCTTTAGCAGCTGCTGGGAAGAAATTTTCTCAGGATGATCATGTCATACATATCCTCAAAGGCC
TAGGTTCAGAATACGACTCGACGGTCAAAGTAATTACTGAGAAAGAAAGCACACCTCCTTTACAGAAGGTCTACTCTCTTCTTCTTACTCAAGAAAACAGAATTGAAAGG
AATACAGCTATAAACACAGATGGCTCTCTCCCCTCTGTTAATTACACCAACGTTCAGGCTTCAAATACCAAACAACAACAACAGCTCACAAATGTATATCTTTTCCATAT
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTTCTTCCACAGAATCCGATACTCTGAATTCAACACAAATCTTGAAGACGATAAATCCTGGCAGCAAGACTAACATCGTCAAGTTGGACGAAGAGAATTTTAT
GCATTGGAAAATTCAAGTTCGAACAACACTGCATGGTCATGGACTAGGGCACTTTATTGATGATGAAGCTACAATCCCAACGAAGACGATTCTGTCTGGTACCGATTCCT
CAGTCAAAATTCCTAATCCTGATTATGAAGCTTGGGTGCGACAAGACAATCTCATCTCTTCTTGGTTCTTGGGGTCAATGTCTAATGATATACTCTCAGAAGTGTTAGAT
TGTGAAACAGCCCTAGATATCTGGAAAGTGTTGAATGCAAGATTTTCATCTCGGAATCTTGCTCAAGTTTTGCAACTCAGAACCAAATTGGAAACCATAAAGAAAGGTAA
TCTTTCTCTCAAAGATTACTTTCTTAAAATCAAGAATCTTGTTGATTCTTTAGCAGCTGCTGGGAAGAAATTTTCTCAGGATGATCATGTCATACATATCCTCAAAGGCC
TAGGTTCAGAATACGACTCGACGGTCAAAGTAATTACTGAGAAAGAAAGCACACCTCCTTTACAGAAGGTCTACTCTCTTCTTCTTACTCAAGAAAACAGAATTGAAAGG
AATACAGCTATAAACACAGATGGCTCTCTCCCCTCTGTTAATTACACCAACGTTCAGGCTTCAAATACCAAACAACAACAACAGCTCACAAATGTATATCTTTTCCATAT
CTGA
Protein sequenceShow/hide protein sequence
MDSSSTESDTLNSTQILKTINPGSKTNIVKLDEENFMHWKIQVRTTLHGHGLGHFIDDEATIPTKTILSGTDSSVKIPNPDYEAWVRQDNLISSWFLGSMSNDILSEVLD
CETALDIWKVLNARFSSRNLAQVLQLRTKLETIKKGNLSLKDYFLKIKNLVDSLAAAGKKFSQDDHVIHILKGLGSEYDSTVKVITEKESTPPLQKVYSLLLTQENRIER
NTAINTDGSLPSVNYTNVQASNTKQQQQLTNVYLFHI