; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005679 (gene) of Snake gourd v1 genome

Gene IDTan0005679
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG10:54764945..54767688
RNA-Seq ExpressionTan0005679
SyntenyTan0005679
Gene Ontology termsNA
InterPro domainsIPR009027 - Ribosomal protein L9/RNase H1, N-terminal
IPR011320 - Ribonuclease H1, N-terminal
IPR024752 - Myb/SANT-like domain
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]3.3e-6948.11Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV RR FARSGETVSR+FN VLN VL+LH++LLK Q  S TH+   +  +   + +  S+ +KH WT
Subjt:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT

Query:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK
         +ED  LVE L+ LV  G WR DNG F+PGYL  +QK++ EK+  S ++    ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK
Subjt:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK

Query:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS
         H NA  + NKPFP++ DL  VFG+DRATG   +TP+EM S  A   EE ++ +  +DF       +E         +D+P  PTSM +  G        
Subjt:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS

Query:  KRKRSSFQTELIDVVRTT
         +KR S+  +L+D  R T
Subjt:  KRKRSSFQTELIDVVRTT

KAG6523854.1 hypothetical protein ZIOFF_013741 [Zingiber officinale]1.2e-6049.25Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHA---------LMSD
        MDRR+   LC +L + G L   +N+ I E+V+ FLHI+AH+VKNR+++RQ ARSGETVSR F+ VLN+VL+LH++LLK     P +          L  +
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHA---------LMSD

Query:  GKVLSIMAGT-----------SRNSKHTWTKVEDARLVESLVSL-VHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNT-IDCKVRTLKKQYNAIAE
        GK+  I+              ++N+KH WTK EDA LV+ LV L   + W+S+NG FR GYL HL+K++A KLP+S L+    I+ + + LK+Q++AI +
Subjt:  GKVLSIMAGT-----------SRNSKHTWTKVEDARLVESLVSL-VHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNT-IDCKVRTLKKQYNAIAE

Query:  MLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPME
        ML N  SGFGWN+  KC+   K+VFD WVKSH  A+G+RNK FPH DDL FV+GKDRATG  AETP +
Subjt:  MLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPME

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]3.1e-6744.86Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSDGKVLSI---
        MDRR FTILCTML++ GGL  TQ VD++EMVVIFLHI+AHDVKNRV RR  ARSGETVSR+FN+VLNAVL+LH++LLK Q    TH+   DG  + +   
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSDGKVLSI---

Query:  --------------------------------------MAGT-SRNSKHTWTKVEDARLVESLVSLV-HNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCL
                                              MA T S+ +KH WT +ED  LVE L+ LV   GWR+DNG F+ GYL                
Subjt:  --------------------------------------MAGT-SRNSKHTWTKVEDARLVESLVSLV-HNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCL

Query:  EQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE
                     KQY AIAEM+  ACSGFGWNE  KC+E EK VFD WVK H NA G+ NKPFP++ DL  VFG+DRATG   +TP+EM+S  A   EE
Subjt:  EQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE

Query:  -EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTM
         ++ +  +DF       +E         +D+P  PTSM +  G SSR     +KR S+  +L+D  R +M
Subjt:  -EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTM

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]1.1e-6949.06Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV RR FARSGETVSR+FN VLN VL+LH++LLK Q  S TH+   +  +   + +  S+ +KH WT
Subjt:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT

Query:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK
         +ED  LVE L+ LV  G WR DNG F+PGYL  +QK++ EK+  S ++    ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK
Subjt:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK

Query:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS
         H NA  + NKPFP++ DL  VFG+DRATG   +TP+EM S  A   EE ++ +  +DF       +E         +D+P  PTSM +  G SSR    
Subjt:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS

Query:  KRKRSSFQTELIDVVRTT
         +KR S+  +L+D  R T
Subjt:  KRKRSSFQTELIDVVRTT

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]2.8e-6056.68Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVE
        M    + SKH W+KVEDA+LVE+L+ LV  GWRSDNG FRPGYLQHL+++L EK+P   L QNTI+CKVR+LKKQYN ++EMLS   SGF WNEEFKCV+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVE

Query:  AEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNT
         E+E+FD WV SH NA  M NKPFPHYDD + VFGKDR  G  +E P  MA++A  + E+EIRLGSQD    E R  E+    D  +++  +  T   + 
Subjt:  AEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNT

Query:  FGMSSRCIGSKRKRSSF
           SSR  GSKRKR SF
Subjt:  FGMSSRCIGSKRKRSSF

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein1.6e-6948.11Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV RR FARSGETVSR+FN VLN VL+LH++LLK Q  S TH+   +  +   + +  S+ +KH WT
Subjt:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT

Query:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK
         +ED  LVE L+ LV  G WR DNG F+PGYL  +QK++ EK+  S ++    ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK
Subjt:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK

Query:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS
         H NA  + NKPFP++ DL  VFG+DRATG   +TP+EM S  A   EE ++ +  +DF       +E         +D+P  PTSM +  G        
Subjt:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS

Query:  KRKRSSFQTELIDVVRTT
         +KR S+  +L+D  R T
Subjt:  KRKRSSFQTELIDVVRTT

A0A5A7SWD8 Retrotransposon protein2.5e-5932.42Show/hide
Query:  NTIQRLEHRSPYDRHQIRHLNFFRLIYEIDLCCRESTRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNF
        N  +R+ H     RH+IR L +FR+I+  DL CR+STRMDRR F ILC +L++  GL  T+ VD+EEMV +FLHILAHDVKNRVI+R+F RSGET+SR+F
Subjt:  NTIQRLEHRSPYDRHQIRHLNFFRLIYEIDLCCRESTRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNF

Query:  NSVLNAVLQLHDLLLKNQNQSP-------------------------------------------THAL-------------------MSDGKVLS----
        N VL AV++LHD LLK     P                                           T+ L                    +D ++L     
Subjt:  NSVLNAVLQLHDLLLKNQNQSP-------------------------------------------THAL-------------------MSDGKVLS----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------IMAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEM
                          M  +SR  KHTWTK E+A LVE LV+    GWRSDNG FRPGYL  L +M+A K+P   +  +TID +++ +K+ ++A+AEM
Subjt:  -----------------IMAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEM

Query:  LSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEEEIRLGSQDFMGVEQRTMENLRI
            CSGFGWN+E KC+ AEKEVFD W  SH  A G+ NK F HYD+L++VFGKDRATG  AE+  ++ S+     +      +      +   M +L +
Subjt:  LSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEEEIRLGSQDFMGVEQRTMENLRI

Query:  GDIGEDDLPDPPTS-MRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTMD
         ++  DDL +  T+ +     +SS   GSKRKR    T+  D+VRT ++
Subjt:  GDIGEDDLPDPPTS-MRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTMD

A0A5D3C7T4 Uncharacterized protein1.5e-6744.86Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSDGKVLSI---
        MDRR FTILCTML++ GGL  TQ VD++EMVVIFLHI+AHDVKNRV RR  ARSGETVSR+FN+VLNAVL+LH++LLK Q    TH+   DG  + +   
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSDGKVLSI---

Query:  --------------------------------------MAGT-SRNSKHTWTKVEDARLVESLVSLV-HNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCL
                                              MA T S+ +KH WT +ED  LVE L+ LV   GWR+DNG F+ GYL                
Subjt:  --------------------------------------MAGT-SRNSKHTWTKVEDARLVESLVSLV-HNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCL

Query:  EQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE
                     KQY AIAEM+  ACSGFGWNE  KC+E EK VFD WVK H NA G+ NKPFP++ DL  VFG+DRATG   +TP+EM+S  A   EE
Subjt:  EQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE

Query:  -EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTM
         ++ +  +DF       +E         +D+P  PTSM +  G SSR     +KR S+  +L+D  R +M
Subjt:  -EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTM

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein5.5e-7049.06Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV RR FARSGETVSR+FN VLN VL+LH++LLK Q  S TH+   +  +   + +  S+ +KH WT
Subjt:  MLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSD-GKVLSIMAGTSRNSKHTWT

Query:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK
         +ED  LVE L+ LV  G WR DNG F+PGYL  +QK++ EK+  S ++    ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK
Subjt:  KVEDARLVESLVSLVHNG-WRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQN-TIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK

Query:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS
         H NA  + NKPFP++ DL  VFG+DRATG   +TP+EM S  A   EE ++ +  +DF       +E         +D+P  PTSM +  G SSR    
Subjt:  SHTNAMGMRNKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEE-EIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGS

Query:  KRKRSSFQTELIDVVRTT
         +KR S+  +L+D  R T
Subjt:  KRKRSSFQTELIDVVRTT

A0A803QNC5 Uncharacterized protein1.3e-5836.05Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLK----------------------
        MDRRTF ILC  LK+TGGL  ++NVD+EEMV IFLHI+AHDVKNR++RRQFARSGETVSR+FN VLNA+L LHDLLLK                      
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLK----------------------

Query:  ---------------------NQNQSPTHAL-------------------MSDGKVL-------------------------------------------
                              +N+  T+ L                    +D +VL                                           
Subjt:  ---------------------NQNQSPTHAL-------------------MSDGKVL-------------------------------------------

Query:  -----------------------------------------------------SIMAGTSRNS-----KHTWTKVEDARLVESLVSLVHNG-WRSDNGIF
                                                              IM  TS+++     KH WT ++D++LVE LV + ++G W++DNG F
Subjt:  -----------------------------------------------------SIMAGTSRNS-----KHTWTKVEDARLVESLVSLVHNG-WRSDNGIF

Query:  RPGYLQHLQKMLAEKLPNSCLE-QNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDR
        +PGYLQ L+KM+ +++PNS ++ Q  ID +++ LK+QY AI++ML  + SGFGWNE+ KCV A+K VFD WVKSH  A G+ +KPFP+YD+LA V+GKDR
Subjt:  RPGYLQHLQKMLAEKLPNSCLE-QNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDR

Query:  ATGIGAETPMEMASSAAEQMEEEIRLGSQD
        ATG GA       S   +++ EEI  G  D
Subjt:  ATGIGAETPMEMASSAAEQMEEEIRLGSQD

SwissProt top hitse value%identityAlignment
Q9KEI9 Ribonuclease H1.3e-0451.02Show/hide
Query:  MGKAKFYVVFIDRNPGIYKTWHEYHRQVNEYRGAIHQSYASFAEAEYAF
        M K+K+YVV+  R PGIY +W     QV  Y GA  +SY S  EAE AF
Subjt:  MGKAKFYVVFIDRNPGIYKTWHEYHRQVNEYRGAIHQSYASFAEAEYAF

Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein1.4e-0929.58Show/hide
Query:  GTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLA---EKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCV
        G  +   + WT  E     + L+ L+   WR  +GI   G L    K+L    ++L  +   +N +  +++ LK  Y +  + L    SGFGW+ E K  
Subjt:  GTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLA---EKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATG
         A  EV+  ++K+H N   M+ +   H++DL  +FG   ATG
Subjt:  EAEKEVFDAWVKSHTNAMGMRNKPFPHYDDLAFVFGKDRATG

AT1G43722.1 unknown protein1.1e-0936.89Show/hide
Query:  NFFRLIYEIDLCCRESTRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQ
        N +R + +    C +  RM    FT LC ML++   L PT N+ IEE V +FL I  H+   R +  +F R+ ETV R F  VL A     +LL  +  +
Subjt:  NFFRLIYEIDLCCRESTRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQ

Query:  SPT
        +PT
Subjt:  SPT

AT4G02210.1 unknown protein1.2e-0824.8Show/hide
Query:  TWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV
        TW    D   ++ ++     G     G+FR      +  +   K   S  + + +  + ++L++Q+NAI  +L +   GF W+ E + V A+  V+  ++
Subjt:  TWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV

Query:  KSHTNAMGMRNKPFPHYDDLAFVFG
        K+H +A     +P P+Y DL  + G
Subjt:  KSHTNAMGMRNKPFPHYDDLAFVFG

AT4G02210.2 unknown protein1.2e-0824.8Show/hide
Query:  TWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV
        TW    D   ++ ++     G     G+FR      +  +   K   S  + + +  + ++L++Q+NAI  +L +   GF W+ E + V A+  V+  ++
Subjt:  TWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV

Query:  KSHTNAMGMRNKPFPHYDDLAFVFG
        K+H +A     +P P+Y DL  + G
Subjt:  KSHTNAMGMRNKPFPHYDDLAFVFG

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.8e-1242.05Show/hide
Query:  CRESTRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQL-HDLLLKNQN
        C E+ RMD+  F  LC +L++ G L  T  + IE  + IFL I+ H+++ R ++  F  SGET+SR+FN+VLNAV+ +  D    N N
Subjt:  CRESTRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQL-HDLLLKNQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAAAGCGAAGTTCTACGTGGTCTTCATCGATCGTAACCCAGGGATTTACAAGACATGGCATGAATATCATAGACAAGTAAATGAATATAGGGGAGCAATCCACCA
ATCCTATGCATCTTTTGCAGAGGCAGAGTATGCATTTACTGAATTTATCATTCGGAACAATGATCATTCCACTACGATCGTTCCTCTTGGACATCTGTGTAATGTAGCTC
CCATTAATACTATTCAACGATTAGAGCACCGATCACCATACGACCGACATCAAATTAGGCATTTGAATTTCTTTCGCCTCATTTACGAAATTGACCTATGTTGTCGCGAA
AGCACAAGAATGGATAGAAGGACCTTTACCATCTTGTGTACTATGCTGAAGTCAACTGGCGGTTTAGTACCGACACAGAATGTTGATATCGAAGAAATGGTTGTTATATT
CTTGCACATCTTAGCACACGATGTTAAGAATCGGGTGATTCGCAGGCAATTTGCCCGGTCCGGTGAGACGGTTTCTAGAAACTTCAACTCGGTGCTAAATGCAGTTTTAC
AGCTACATGATTTGTTATTAAAAAACCAGAACCAATCACCAACACATGCACTGATGAGCGATGGAAAAGTTTTGAGCATCATGGCAGGTACTTCGCGAAACTCCAAGCAT
ACGTGGACGAAGGTGGAGGATGCGAGGTTGGTGGAGTCACTTGTATCTTTAGTACACAATGGGTGGCGATCTGACAACGGGATCTTCAGGCCTGGCTATTTACAACATCT
CCAGAAGATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGATTGCAAGGTCAGAACTCTCAAAAAACAATACAATGCTATTGCAGAGATGCTTA
GTAATGCATGTAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGAGCCATACAAACGCAATGGGGATGAGG
AATAAGCCATTTCCGCACTATGATGACCTCGCATTTGTCTTTGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGCAGAACA
AATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACCCTC
CTACTAGCATGCGTAATACATTTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAACGATCATCCTTCCAAACTGAATTAATTGATGTAGTGCGCACAACAATGGAT
ATGCATACCAATCACATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAAAGCGAAGTTCTACGTGGTCTTCATCGATCGTAACCCAGGGATTTACAAGACATGGCATGAATATCATAGACAAGTAAATGAATATAGGGGAGCAATCCACCA
ATCCTATGCATCTTTTGCAGAGGCAGAGTATGCATTTACTGAATTTATCATTCGGAACAATGATCATTCCACTACGATCGTTCCTCTTGGACATCTGTGTAATGTAGCTC
CCATTAATACTATTCAACGATTAGAGCACCGATCACCATACGACCGACATCAAATTAGGCATTTGAATTTCTTTCGCCTCATTTACGAAATTGACCTATGTTGTCGCGAA
AGCACAAGAATGGATAGAAGGACCTTTACCATCTTGTGTACTATGCTGAAGTCAACTGGCGGTTTAGTACCGACACAGAATGTTGATATCGAAGAAATGGTTGTTATATT
CTTGCACATCTTAGCACACGATGTTAAGAATCGGGTGATTCGCAGGCAATTTGCCCGGTCCGGTGAGACGGTTTCTAGAAACTTCAACTCGGTGCTAAATGCAGTTTTAC
AGCTACATGATTTGTTATTAAAAAACCAGAACCAATCACCAACACATGCACTGATGAGCGATGGAAAAGTTTTGAGCATCATGGCAGGTACTTCGCGAAACTCCAAGCAT
ACGTGGACGAAGGTGGAGGATGCGAGGTTGGTGGAGTCACTTGTATCTTTAGTACACAATGGGTGGCGATCTGACAACGGGATCTTCAGGCCTGGCTATTTACAACATCT
CCAGAAGATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGATTGCAAGGTCAGAACTCTCAAAAAACAATACAATGCTATTGCAGAGATGCTTA
GTAATGCATGTAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGAGCCATACAAACGCAATGGGGATGAGG
AATAAGCCATTTCCGCACTATGATGACCTCGCATTTGTCTTTGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGCAGAACA
AATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACCCTC
CTACTAGCATGCGTAATACATTTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAACGATCATCCTTCCAAACTGAATTAATTGATGTAGTGCGCACAACAATGGAT
ATGCATACCAATCACATGTAA
Protein sequenceShow/hide protein sequence
MGKAKFYVVFIDRNPGIYKTWHEYHRQVNEYRGAIHQSYASFAEAEYAFTEFIIRNNDHSTTIVPLGHLCNVAPINTIQRLEHRSPYDRHQIRHLNFFRLIYEIDLCCRE
STRMDRRTFTILCTMLKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIRRQFARSGETVSRNFNSVLNAVLQLHDLLLKNQNQSPTHALMSDGKVLSIMAGTSRNSKH
TWTKVEDARLVESLVSLVHNGWRSDNGIFRPGYLQHLQKMLAEKLPNSCLEQNTIDCKVRTLKKQYNAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAMGMR
NKPFPHYDDLAFVFGKDRATGIGAETPMEMASSAAEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDPPTSMRNTFGMSSRCIGSKRKRSSFQTELIDVVRTTMD
MHTNHM