; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016947 (gene) of Snake gourd v1 genome

Gene IDTan0016947
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibosomal RNA-processing protein 8
Genome locationLG11:5787079..5805330
RNA-Seq ExpressionTan0016947
SyntenyTan0016947
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001245 - Serine-threonine/tyrosine-protein kinase, catalytic domain
IPR044799 - NAC domain-containing protein SOG1-like
IPR042036 - Ribosomal RNA-processing protein 8, N-terminal domain
IPR036093 - NAC domain superfamily
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase
IPR011009 - Protein kinase-like domain superfamily
IPR008271 - Serine/threonine-protein kinase, active site
IPR007823 - Ribosomal RNA processing protein 8
IPR003441 - NAC domain
IPR000719 - Protein kinase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2292276.1 hypothetical protein GH714_018368 [Hevea brasiliensis]0.0e+0059.74Show/hide
Query:  TGEEALNYFKEDQVLFDVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAI
        +GEEALNYFKED  LFD+YH+GYQEQMSHWPE PVN+II WLK+   S +VADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDM+NTPLD+SSVDVA+
Subjt:  TGEEALNYFKEDQVLFDVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAI

Query:  FCLSLMGVNYESYLAEAQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR-
        FCLSLMG N+ SYL EA R LKP GWLLIAEVKSRFDP+ GGADP KF KAV +LGF S LKDFSNKMFILLYF+KK+++SSK K+I+WP+LKPCLYKR 
Subjt:  FCLSLMGVNYESYLAEAQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR-

Query:  ----------------------------------------------------------HPVPSPAFL------------------------FAGACKEPI
                                                                  H   +P  +                        F GACKEP+
Subjt:  ----------------------------------------------------------HPVPSPAFL------------------------FAGACKEPI

Query:  MVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGTYRWMAPELY
        MVIVTELLLGGTLRKYLL++RPRCL++H A GFALDIA AMEC+H+HGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGTYRWMAPELY
Subjt:  MVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGTYRWMAPELY

Query:  STVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLRYLSTIPQPE
        STVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+AENLPE+LA I+TSCW EDPNARPNF QIIQMLL YLST+   E
Subjt:  STVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLRYLSTIPQPE

Query:  YVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLRSWILDSRTI
         V    +   ENAVLPPESPGTSSLM A R  +GE P  +ME+K   F S                       W+ G              +W++DSRTI
Subjt:  YVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLRSWILDSRTI

Query:  ARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVS
        ARKVRN +++S+  IKD  A  ECPNCH  IDN+D+  +WPGLPAGVKFDP+D +I++HLAAKC VG+ KP A IDEFIPTL  D+GICYTHPENLPG  
Subjt:  ARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVS

Query:  KDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSK
        KDG+ VHFFH+T NAYATGQRKRRK+ +   S+ +H RWHKTGKTK+V+ENGV KG KKI  LY+S+K GSKP+KS+WV+HQYHLGT+EDEK+GEYV+SK
Subjt:  KDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSK

Query:  ISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDD
        I +QQPKQ   ++D + ++D D L    SPRTP +NPP PPR  KS   +  AD         EE++   A  V  P++  E+DVGC AWLAGESQ +++
Subjt:  ISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDD

Query:  VELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSF--------PLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI
         +   + D+LLC E+ D  SS F++ S LN +S+         ++G+N  ++  N N PCGIADLENL+LDTPPD  LADL F SQ+  F   D I
Subjt:  VELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSF--------PLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI

KAF2308452.1 hypothetical protein GH714_009732 [Hevea brasiliensis]6.9e-24563.38Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEPIMVIVTELLLGGTLRKYLL++RPRCLD+H A GFALDIA AMECLHSHGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN RP+AENLPE+LA I+ SCW EDPNARPNFSQIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR
        YLSTI  P  V P  +   ENAVLPPESPGTSSLM A R  +GE    +ME+K  +     S   F                  TG            LR
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR

Query:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT
        +W++D RTIARKV+N +++S+  IKD  A R CPNCH  IDN+D+  +WPGLPAGVKFDP+D E+++HLAAKC  G+ KPHA IDEFIPTL  D+GICYT
Subjt:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT

Query:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE
        HPENLPG  KDG+++HFFHKT NAYATGQRKRRK+ +   S+ EH RWHKTGKTK+V+ENGV KG KKI  LY+SSK GSKP+KS+WV+HQYHLGT+EDE
Subjt:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE

Query:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWL
        KEGEYV+SKI +Q  KQ   ++D + I+D D L   TSPRTP +NPP PPR  KS   + VAD+       KE+++   AS V  P +  E+DV  PAWL
Subjt:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWL

Query:  AGESQGIDDVELHYLEDNLLCNE-----VLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWL
        AGESQ +++ E   + D+LLC E      L +D+S  +S+  +      ++G+N  +   N N PCGIADLENLELDTPPDF LADL F SQ+SI DW+
Subjt:  AGESQGIDDVELHYLEDNLLCNE-----VLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWL

KAF5740414.1 serine/threonine-protein kinase HT1-like [Tripterygium wilfordii]2.6e-23658.76Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYL+++RP  +++H A GFALDIA AMECLH+HGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+AENLPE+LA I+TSCW EDPNARPNFSQIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEY-VTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAI-----FFRLLLESLVVAFESSIIWVTGRAE-----
        +L+TI  P + V P  +   EN+VLPPESPGTSSLM   R  + E  N EME++     + FS+           ++S++    S  + V   +E     
Subjt:  YLSTIPQPEY-VTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAI-----FFRLLLESLVVAFESSIIWVTGRAE-----

Query:  -----HAPPTQ------------QDPLRSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAK
             + PP+             +D  R+W++D RTIA+KV+N +  S+  I  C A RECPNCH+ IDN+D+  EWPGLPAGVKF+P+D E+++HLAAK
Subjt:  -----HAPPTQ------------QDPLRSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAK

Query:  CTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSL
        C VGN KPH LIDEFIPTLE D+GICYTHP+ LPG  KDG+ V+FFH+T NAYATGQRKRR++    +S+  H RWHKTGKTK V+ENG+  GWKKI  L
Subjt:  CTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSL

Query:  YRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAK
        Y+ +K  SKP+KS+WV+HQYHLG EEDE+ GEYV+SKI HQQPKQ  K++   +I+  D+L  QTSPRTP  NPP PPR G   + + V D+  P    +
Subjt:  YRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAK

Query:  EEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELD
        E + T  AS V  P++ LE+D+G P  LAGESQ ++  +L+Y++++  C E+L   S++ ++N  LN  S      N   E  N N   G  DLENLELD
Subjt:  EEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELD

Query:  TPPDFHLADLHFGSQESIFDWLDRI
        TPPDF L DL+F SQ+S F WLDR+
Subjt:  TPPDFHLADLHFGSQESIFDWLDRI

KAG5624447.1 hypothetical protein H5410_009665 [Solanum commersonii]6.3e-22257.88Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYLL++RPRCLD   A  FALDIA AMECLHSHGIIHRDLKPENL+LT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE+V NKLPFEGMSNLQAAYAAAFKN+RP+A++LPE+LA I+TSCW EDPN RPNF+QIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR
        +LS++  PE V P  +   EN VLPPESPGTSSLM + R  +GE P   ME +                                 R + A   Q    R
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR

Query:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT
        SW+++ R +A KV+N S  ++  IKDC AKR+CPNC++ IDN D+  EWPGLP GVKFDP+D E+++HL AKC VGN + H  IDEFIPTL+  +GICYT
Subjt:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT

Query:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE
        HPENLPG  KDG+ +HFF++ +NAYATG+RKRRKI   ++   EH RWHKTGKTK V+ENG  KG KK+  LY++ K G K EK++WV+HQYHLG +EDE
Subjt:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE

Query:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWL
        KEGEYV+SKI +QQ KQ  K+ND+   ++     +QT P TPK+  P PPR G++   + + D+ LP S  +E E+  E  + S   I  E +      L
Subjt:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWL

Query:  AGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACH-EIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI
        AGESQ  D    + ++++LLC+E  D    +F+ +   N         + CH    NGN  CGI++L+NLELD+PPDF LADL FGSQE+IF WLDR+
Subjt:  AGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACH-EIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI

TKY50484.1 NAC domain-containing protein 8 [Spatholobus suberectus]3.3e-22358.55Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYLL++RP+CLD+  A GFALDIA AMECLHSHGIIHRDLKP+NLILTGDHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR GEKKHYN+KVD YSF IVFWE++ NKLPFEGMSNLQAAYAAAFKN RP+AE+LPE+LA I+TSCW EDPN R NFSQIIQMLLR
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR
        YLST+  PE V P  M   ENAVLPPESPGTS+LM   R  +GE P  ++E+                                               R
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR

Query:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT
        SW++D    +RKV N + S++  IKDC    ECP CH+ IDN D+  EWPG P GVKFDP+D E+++HLAAKC +GN KPH  I EFIPTLE +QGICYT
Subjt:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT

Query:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE
         PENLPG  KDGN VHFFHKT NAYATGQRKRRKI      + EH RWHKTG+TKAVIE+GVHKG+KKI  LY  SK GSKP K++WV+HQYHLG+EEDE
Subjt:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE

Query:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGK---SFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCP
        K+GEYV+SKI +QQ KQ +K+ +N +++D  ++  Q  P TPK NPP  P +GK   +F  N +       SF ++ +  P  S+    ++   ++   P
Subjt:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGK---SFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCP

Query:  AWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGN--VPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLD
        AWLAGESQ   D +   L+D LLC E+L   SSA ++NS L          +  ++ AN N  V   ++ L+ LELDTPPDF L++L FGSQ+S   W+D
Subjt:  AWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGN--VPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLD

Query:  RI
        ++
Subjt:  RI

TrEMBL top hitse value%identityAlignment
A0A1R3GMR3 Uncharacterized protein2.1e-21556.98Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYLL++RP+ LD+  A GFALDIA AMECLHSHGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+A++LPE+LA I+TSCW EDPNARPNF+QIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR
         LS+I  PE V PP     ENAVLPPESPG + +            N  + +   + F+ FS  F  L L  +                           
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR

Query:  SWILDSRTIARKVRNISQSSSQVIK-DCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICY
        +W++D R +A+K++N S SSS     D  A RECPNC + IDN+D+ P WPG P GVKFDP+D  I++HLAAKC VGN K HA +D FIPTLE + GICY
Subjt:  SWILDSRTIARKVRNISQSSSQVIK-DCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICY

Query:  THPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEED
        THPENLPG   DG+ ++FF +T NAYATGQRKRRKI+ N  S+ EH RWHKTGK+K++IENGVHKGWKKI  LY+ SK  SKPEK++WV+HQYHLG  E+
Subjt:  THPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEED

Query:  EKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAW
        E++G+YV+SKI++Q  KQ  K++ + LI++ + L  QT+PRTPK+ PP PPR  KS   +        +S  +E E + EAS      ++L + V    W
Subjt:  EKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAW

Query:  LAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIA-----NGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWL
        LAGESQ  ++  L  L+ +LLCN+++       IS  Q N+        N C +I      N +   GI +LENL+ DTPPDF +ADL FGSQ+S+  W+
Subjt:  LAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIA-----NGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWL

Query:  DR
        DR
Subjt:  DR

A0A3Q7FZU2 Uncharacterized protein3.5e-21857.39Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYLL++RPRCLD   A  FALDIA AMECLHSHGIIHRDLKPENL+LT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE+V NKLPFEGMSNLQAAYAAAFKN+RP+A++LPE+LA I+TSCW EDPN RPNF+QIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFE-SSIIWVTGRAEHAPPTQQDPL
        +LS++  PE V P  +   EN VLPPESPG                         S F+C +  +  L+L+ + +  +  S+I   G+           L
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFE-SSIIWVTGRAEHAPPTQQDPL

Query:  RSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICY
        RSW+++ R +A KV+N S  ++  IKDC AKR+CPNC++ IDN D+  EWPGLP GVKFDP+D E+++HL AKC VGN + H  IDEFIPTLE  +GICY
Subjt:  RSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICY

Query:  THPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEED
        THPENLPG  KDG+ +HFF++ +NAYATG+RKRRKI   ++   EH RWHKTGKTK V+ENG  KG KK+  LY++ K GSK EK++WV+HQYHLG +ED
Subjt:  THPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEED

Query:  EKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAW
        EKEGEYV+SKI +QQ KQ  K+ND    ++     +QT P TPK+  P PPR G++   + + D  LP S  +E E+  E  + S   I  E +      
Subjt:  EKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAW

Query:  LAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGS-----NACH-EIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDW
        LAGESQ  D  ++H   ++LLC+E  D         S L+     L  S     + CH    NGN  CGI++L+NLELD+PPDF LADL FGSQE+IF W
Subjt:  LAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGS-----NACH-EIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDW

Query:  LDRI
        LDR+
Subjt:  LDRI

A0A6A6KW68 Ribosomal RNA-processing protein 80.0e+0059.74Show/hide
Query:  TGEEALNYFKEDQVLFDVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAI
        +GEEALNYFKED  LFD+YH+GYQEQMSHWPE PVN+II WLK+   S +VADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDM+NTPLD+SSVDVA+
Subjt:  TGEEALNYFKEDQVLFDVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAI

Query:  FCLSLMGVNYESYLAEAQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR-
        FCLSLMG N+ SYL EA R LKP GWLLIAEVKSRFDP+ GGADP KF KAV +LGF S LKDFSNKMFILLYF+KK+++SSK K+I+WP+LKPCLYKR 
Subjt:  FCLSLMGVNYESYLAEAQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR-

Query:  ----------------------------------------------------------HPVPSPAFL------------------------FAGACKEPI
                                                                  H   +P  +                        F GACKEP+
Subjt:  ----------------------------------------------------------HPVPSPAFL------------------------FAGACKEPI

Query:  MVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGTYRWMAPELY
        MVIVTELLLGGTLRKYLL++RPRCL++H A GFALDIA AMEC+H+HGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGTYRWMAPELY
Subjt:  MVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGTYRWMAPELY

Query:  STVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLRYLSTIPQPE
        STVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+AENLPE+LA I+TSCW EDPNARPNF QIIQMLL YLST+   E
Subjt:  STVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLRYLSTIPQPE

Query:  YVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLRSWILDSRTI
         V    +   ENAVLPPESPGTSSLM A R  +GE P  +ME+K   F S                       W+ G              +W++DSRTI
Subjt:  YVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLRSWILDSRTI

Query:  ARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVS
        ARKVRN +++S+  IKD  A  ECPNCH  IDN+D+  +WPGLPAGVKFDP+D +I++HLAAKC VG+ KP A IDEFIPTL  D+GICYTHPENLPG  
Subjt:  ARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVS

Query:  KDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSK
        KDG+ VHFFH+T NAYATGQRKRRK+ +   S+ +H RWHKTGKTK+V+ENGV KG KKI  LY+S+K GSKP+KS+WV+HQYHLGT+EDEK+GEYV+SK
Subjt:  KDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSK

Query:  ISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDD
        I +QQPKQ   ++D + ++D D L    SPRTP +NPP PPR  KS   +  AD         EE++   A  V  P++  E+DVGC AWLAGESQ +++
Subjt:  ISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDD

Query:  VELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSF--------PLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI
         +   + D+LLC E+ D  SS F++ S LN +S+         ++G+N  ++  N N PCGIADLENL+LDTPPD  LADL F SQ+  F   D I
Subjt:  VELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSF--------PLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI

A0A6A6M487 Uncharacterized protein3.3e-24563.38Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEPIMVIVTELLLGGTLRKYLL++RPRCLD+H A GFALDIA AMECLHSHGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN RP+AENLPE+LA I+ SCW EDPNARPNFSQIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR
        YLSTI  P  V P  +   ENAVLPPESPGTSSLM A R  +GE    +ME+K  +     S   F                  TG            LR
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLR

Query:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT
        +W++D RTIARKV+N +++S+  IKD  A R CPNCH  IDN+D+  +WPGLPAGVKFDP+D E+++HLAAKC  G+ KPHA IDEFIPTL  D+GICYT
Subjt:  SWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYT

Query:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE
        HPENLPG  KDG+++HFFHKT NAYATGQRKRRK+ +   S+ EH RWHKTGKTK+V+ENGV KG KKI  LY+SSK GSKP+KS+WV+HQYHLGT+EDE
Subjt:  HPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDE

Query:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWL
        KEGEYV+SKI +Q  KQ   ++D + I+D D L   TSPRTP +NPP PPR  KS   + VAD+       KE+++   AS V  P +  E+DV  PAWL
Subjt:  KEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNISLENDVGCPAWL

Query:  AGESQGIDDVELHYLEDNLLCNE-----VLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWL
        AGESQ +++ E   + D+LLC E      L +D+S  +S+  +      ++G+N  +   N N PCGIADLENLELDTPPDF LADL F SQ+SI DW+
Subjt:  AGESQGIDDVELHYLEDNLLCNE-----VLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWL

A0A7J7D258 Serine/threonine-protein kinase HT1-like1.3e-23658.76Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYL+++RP  +++H A GFALDIA AMECLH+HGIIHRDLKPENLILT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+AENLPE+LA I+TSCW EDPNARPNFSQIIQMLL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEY-VTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAI-----FFRLLLESLVVAFESSIIWVTGRAE-----
        +L+TI  P + V P  +   EN+VLPPESPGTSSLM   R  + E  N EME++     + FS+           ++S++    S  + V   +E     
Subjt:  YLSTIPQPEY-VTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAI-----FFRLLLESLVVAFESSIIWVTGRAE-----

Query:  -----HAPPTQ------------QDPLRSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAK
             + PP+             +D  R+W++D RTIA+KV+N +  S+  I  C A RECPNCH+ IDN+D+  EWPGLPAGVKF+P+D E+++HLAAK
Subjt:  -----HAPPTQ------------QDPLRSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAK

Query:  CTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSL
        C VGN KPH LIDEFIPTLE D+GICYTHP+ LPG  KDG+ V+FFH+T NAYATGQRKRR++    +S+  H RWHKTGKTK V+ENG+  GWKKI  L
Subjt:  CTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSL

Query:  YRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAK
        Y+ +K  SKP+KS+WV+HQYHLG EEDE+ GEYV+SKI HQQPKQ  K++   +I+  D+L  QTSPRTP  NPP PPR G   + + V D+  P    +
Subjt:  YRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAK

Query:  EEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELD
        E + T  AS V  P++ LE+D+G P  LAGESQ ++  +L+Y++++  C E+L   S++ ++N  LN  S      N   E  N N   G  DLENLELD
Subjt:  EEELTPEASRVSYPNISLENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELD

Query:  TPPDFHLADLHFGSQESIFDWLDRI
        TPPDF L DL+F SQ+S F WLDR+
Subjt:  TPPDFHLADLHFGSQESIFDWLDRI

SwissProt top hitse value%identityAlignment
F4HY61 NAC domain-containing protein 107.6e-5350.88Show/hide
Query:  SRTIARKVRNISQSSSQVIKDCDAK-------RECPNC-----HFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLET
        S T    V+   QS S V    D+K         CP+C     H   D      + P LPAGVKFDP+D+EI+ HL AK +    K H LIDEFIPTLE 
Subjt:  SRTIARKVRNISQSSSQVIKDCDAK-------RECPNC-----HFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLET

Query:  DQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYH
        + GICYTHPE LPGVSKDG   HFFH+ S AY TG RKRRK+ +++       RWHKTGKT+ V+      G+KKI  LY +     KPEK++WV+HQYH
Subjt:  DQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYH

Query:  LGTEEDEKEGEYVLSKISHQ-QPKQC
        LG+ EDEK+GE VLSK+ +Q QP+QC
Subjt:  LGTEEDEKEGEYVLSKISHQ-QPKQC

O49459 NAC domain-containing protein 731.3e-5249.77Show/hide
Query:  DSRTIARKVRNISQSSSQVIK-DCDAKRECPNC--HFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTH
        D +T+ R + +   + S V        + CP+C  +F         + PGLPAGVKFDPTD+E+++HL  K      K H LIDEFI T++ + GICYTH
Subjt:  DSRTIARKVRNISQSSSQVIK-DCDAKRECPNC--HFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTH

Query:  PENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEK
        PE LPGV+KDG   HFFH+ S AY TG RKRRK+ + DS      RWHKTGKT+ V+  G  +G+KKI  LY +     KPEK++WV+HQYHLGT E+EK
Subjt:  PENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEK

Query:  EGEYVLSKISHQ-QPKQCSKS
        EGE V+SK+ +Q QP+QC  S
Subjt:  EGEYVLSKISHQ-QPKQCSKS

Q6NQK2 SUPPRESSOR OF GAMMA RESPONSE 11.9e-7239.42Show/hide
Query:  RSWILDSRTIARKVRNISQSSS--QVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGI
        RSW++DS  IA K+ + S SS   QV+   +  R CP C  +IDN+D+  +WPGLP GVKFDP+D EI+ HL AK  +  L  H  IDEFIPT+  D GI
Subjt:  RSWILDSRTIARKVRNISQSSS--QVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGI

Query:  CYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTE
        CYTHP+NLPGV  DG   HFFHK   AY+TG RKRRKI  +D   +   RWHKTG+TK V+ +GV +G KKI  LY     G K  K++WV+HQYHLG E
Subjt:  CYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTE

Query:  EDEKEGEYVLSKISHQQPKQC-----SKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNA--VADNGLPQSFAKEEEL----TPEASRVSYP
        EDEKEG+YV+SKI +QQP+Q       K+   +  D F A+     P TPK   P  PR+     S++   +D   P  +    E+    T E   +   
Subjt:  EDEKEGEYVLSKISHQQPKQC-----SKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNA--VADNGLPQSFAKEEEL----TPEASRVSYP

Query:  NISLENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDS------------SAFISNSQ--LNQMSF--PLSGSNACHEIANG-----NVPC----
          S++ +   P+       G+++     L+D     +  D ++            S FI NSQ  +  +S    L GS    E  N        PC    
Subjt:  NISLENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDS------------SAFISNSQ--LNQMSF--PLSGSNACHEIANG-----NVPC----

Query:  -----------------GIADLENLELDTPPDFHLADLHFGSQESIFDW
                          + D  N+ELDTPP+F L+ L FGSQ+S   W
Subjt:  -----------------GIADLENLELDTPPDFHLADLHFGSQESIFDW

Q84JC0 Ribosomal RNA-processing protein 82.3e-9762.9Show/hide
Query:  DDGRSSKKRKRAKRRGADKSKDFANNIADAT---KPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLF
        ++ ++S+ RKR ++R    SK+            K N  D             +    S+FLD +R RLSGG FRMLNEKLYTC+G+EAL+YFKED  +F
Subjt:  DDGRSSKKRKRAKRRGADKSKDFANNIADAT---KPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLF

Query:  DVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAE
        D+YHTGYQ+QMS+WPELPVN II WL  +  S +VADFGCGDAR+AK+VKNKVFSFDLVSK+PSVIACDM+NT L+SSSVDVA+FCLSLMG NY SY+ E
Subjt:  DVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAE

Query:  AQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR
        A R L+P G LLIAEVKSRFDP+NGGADPK F+KAVC+LGF S LKDFSNKMFIL +FKKK++ +S  K I WP+LK CLYKR
Subjt:  AQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR

Q9M0F8 NAC domain-containing protein 753.9e-5755.67Show/hide
Query:  CDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYA
        C +K+ CP+C   ++      +W GLPAGVKFDPTD+E+++HL AK    + K H LIDEFIPT+E + GICYTHPE LPGV++DG   HFFH+ S AY 
Subjt:  CDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHLAAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYA

Query:  TGQRKRRKIES------NDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQ-QPKQCS
        TG RKRRKI++        SSS    RWHKTGKT+ V+ NG  KG KKI  LY +     KPEK++WV+HQYHLGT E+EKEGE V+SKI +Q QP+QC+
Subjt:  TGQRKRRKIES------NDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGGSKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQ-QPKQCS

Query:  KSN
         S+
Subjt:  KSN

Arabidopsis top hitse value%identityAlignment
AT3G27560.1 Protein kinase superfamily protein9.4e-10772.01Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELLLGGTLRKYL+S+RP+ LD+  A GFALDIA AMECLHSHGIIHRDLKPENLIL+ DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKNLRP+AE+LP +L  I+TSCW EDPN RPNF++IIQMLLR
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPT--MHPPENAVLPPESPGTSSLMAA----TRRGTGEVPNGEMEEKSSSFFSCFS
        YL+T+  P+ + PP   +   EN VL PESPGT SLM+       R T    +   ++   SFFSC S
Subjt:  YLSTIPQPEYVTPPT--MHPPENAVLPPESPGTSSLMAA----TRRGTGEVPNGEMEEKSSSFFSCFS

AT5G40530.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.6e-9862.9Show/hide
Query:  DDGRSSKKRKRAKRRGADKSKDFANNIADAT---KPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLF
        ++ ++S+ RKR ++R    SK+            K N  D             +    S+FLD +R RLSGG FRMLNEKLYTC+G+EAL+YFKED  +F
Subjt:  DDGRSSKKRKRAKRRGADKSKDFANNIADAT---KPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLF

Query:  DVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAE
        D+YHTGYQ+QMS+WPELPVN II WL  +  S +VADFGCGDAR+AK+VKNKVFSFDLVSK+PSVIACDM+NT L+SSSVDVA+FCLSLMG NY SY+ E
Subjt:  DVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAE

Query:  AQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR
        A R L+P G LLIAEVKSRFDP+NGGADPK F+KAVC+LGF S LKDFSNKMFIL +FKKK++ +S  K I WP+LK CLYKR
Subjt:  AQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKR

AT5G40530.2 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.4e-8658.45Show/hide
Query:  DDGRSSKKRKRAKRRGADKSKDFANNIADAT---KPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLF
        ++ ++S+ RKR ++R    SK+            K N  D             +    S+FLD +R RLSGG FRMLNEKLYTC+G+EAL+YFKED  +F
Subjt:  DDGRSSKKRKRAKRRGADKSKDFANNIADAT---KPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLF

Query:  DVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAE
        D+YHTGYQ+QMS+WPELPVN II WL  +  S +VADFGCGDAR+AK+VKNKVFSFDLVSK+PSVIACDM+NT L+SSSVDVA+FCLSLMG NY SY+ E
Subjt:  DVYHTGYQEQMSHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAE

Query:  AQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALK-----------------------DFSNKMFILLYFKKK
        A R L+P G LLIAEVKSRFDP+NGGADPK F+KAVC+LGF S LK                       DFSNKMFIL +FKKK
Subjt:  AQRALKPRGWLLIAEVKSRFDPSNGGADPKKFIKAVCELGFVSALK-----------------------DFSNKMFILLYFKKK

AT5G40540.1 Protein kinase superfamily protein1.1e-10472.18Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEPIMVIVTELLLGGTLRKYL+S+RP  LD+  A G+ALDIA AMECLHSHG+IHRDLKPE+LILT D+KTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR+GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+A++LP++LA I+TSCW EDPN RPNF++IIQMLLR
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTP--PTMHPPENAVLPPESPGTSSLMAATRRGTGEVP----NGEMEEKSSSFFSC
         LSTI   E V P    +   EN VLPPESPGT SLM  T R   ++P    + + E + S FF C
Subjt:  YLSTIPQPEYVTP--PTMHPPENAVLPPESPGTSSLMAATRRGTGEVP----NGEMEEKSSSFFSC

AT5G50180.1 Protein kinase superfamily protein3.2e-10771.32Show/hide
Query:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT
        F GACKEP+MVIVTELL GGTLRKYLL++RP CL+   A GFALDIA  MECLHSHGIIHRDLKPENL+LT DHKTVKL DFGLAREES+TEMMTAETGT
Subjt:  FAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEAAGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGT

Query:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR
        YRWMAPELYSTVTLR GEKKHYN+KVD YSF IV WE++ NKLPFEGMSNLQAAYAAAFKN+RP+AE+LPEEL  I+TSCW EDPNARPNF+ II++LL 
Subjt:  YRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMSNLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLR

Query:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIF
        YLS +  P    P  +   +N +LPP+SPGTSSLMA      GE P  + E+K    F CF+  +
Subjt:  YLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFSCFSAIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACGACGGCCGCTCAAGCAAAAAGCGCAAGAGGGCGAAGCGTCGTGGAGCAGACAAAAGCAAAGATTTCGCCAATAATATAGCGGATGCTACTAAACCCAATAG
CGAAGACATCGTTGTTTCCGGTTCTTCCGACGGCAGAAAATGTCGGAGGGTTTCTGGTTCGTCTAGTTTTCTCGATAAGATGCGAGCTCGTTTGTCAGGAGGGCATTTCA
GGATGCTTAATGAGAAGTTGTATACCTGCACAGGGGAAGAGGCTCTTAATTATTTCAAAGAAGACCAGGTTCTGTTTGATGTTTATCATACAGGATACCAAGAGCAAATG
TCGCATTGGCCTGAGCTGCCCGTGAATTTAATCATAAAATGGCTGAAAGAGCACGATCCCTCATTTATTGTGGCAGATTTTGGCTGTGGGGATGCACGCCTAGCAAAGAA
TGTGAAGAATAAAGTTTTCTCATTTGATCTGGTTTCCAAAGATCCTTCTGTTATTGCTTGTGATATGGCCAATACACCTCTTGACTCTTCATCTGTAGATGTTGCCATCT
TTTGCCTTTCACTTATGGGGGTCAATTATGAAAGTTACTTGGCAGAAGCACAAAGAGCACTTAAACCACGTGGATGGCTTTTAATAGCAGAAGTCAAAAGCAGATTTGAC
CCAAGCAATGGAGGAGCCGACCCCAAAAAGTTCATAAAGGCTGTTTGTGAGCTAGGTTTCGTCTCAGCTCTGAAGGACTTCTCAAATAAGATGTTTATCTTGTTATACTT
CAAGAAAAAGGATGAGAAAAGTTCTAAGGGGAAAGATATCGATTGGCCCCAGCTAAAACCATGTTTATACAAACGTCACCCAGTTCCTTCCCCGGCTTTTCTTTTTGCTG
GTGCTTGCAAGGAGCCTATTATGGTGATAGTGACTGAGCTTCTTTTAGGCGGTACATTACGAAAGTACTTGTTGAGTATACGGCCAAGGTGCTTGGATTTGCACGAGGCA
GCTGGCTTTGCACTTGATATTGCATGTGCAATGGAATGCCTGCATTCCCATGGAATCATTCACCGCGATCTCAAACCAGAGAATTTGATCTTGACCGGAGACCACAAAAC
TGTTAAGCTTACAGACTTTGGTCTGGCTAGAGAAGAGTCGGTAACAGAGATGATGACGGCTGAAACCGGGACTTACAGATGGATGGCTCCAGAGCTTTACAGTACAGTTA
CTTTAAGAAATGGAGAGAAGAAGCATTACAATAACAAGGTGGATGTTTACAGCTTTGGAATTGTATTTTGGGAGATTGTTCAAAATAAGTTGCCTTTCGAAGGCATGTCG
AATCTACAAGCAGCATATGCAGCTGCTTTTAAGAACTTACGACCCAATGCCGAGAACCTCCCGGAGGAATTGGCCCCGATTATAACTTCGTGCTGGACGGAGGATCCGAA
CGCTCGGCCTAACTTCAGCCAAATCATACAGATGCTCTTGAGATATCTATCCACCATTCCACAACCAGAGTATGTTACACCACCAACAATGCACCCGCCTGAGAATGCAG
TGTTGCCACCAGAGTCTCCTGGAACGAGTTCTTTGATGGCAGCCACAAGACGTGGCACCGGGGAAGTCCCGAACGGCGAGATGGAAGAGAAATCGAGCAGTTTCTTCTCC
TGTTTCTCTGCTATATTTTTTCGTTTGTTGCTTGAGAGCTTGGTGGTTGCATTTGAGAGCTCGATCATTTGGGTTACTGGCAGAGCTGAACATGCCCCACCAACCCAACA
AGATCCTTTGAGGTCTTGGATCCTTGATAGCCGAACAATTGCAAGAAAAGTCAGAAACATCAGTCAGTCCTCTTCGCAAGTAATTAAAGATTGTGATGCGAAACGAGAAT
GTCCAAACTGTCACTTTATTATTGATAATACAGATATTTGTCCTGAATGGCCTGGTTTGCCTGCTGGTGTTAAGTTTGATCCAACTGATGAAGAAATAATGGATCACTTA
GCGGCAAAGTGCACCGTTGGAAACTTAAAGCCACATGCATTAATTGATGAATTCATCCCCACACTAGAAACAGATCAAGGAATCTGCTACACTCATCCAGAAAATCTCCC
CGGTGTTAGTAAAGATGGGAATGATGTCCATTTCTTTCACAAAACAAGCAATGCATATGCCACAGGTCAAAGGAAGCGCCGCAAGATTGAAAGTAACGATAGTTCAAGCA
TGGAACATTTCCGCTGGCACAAGACAGGTAAGACCAAAGCTGTGATAGAAAATGGTGTTCATAAGGGATGGAAGAAGATATTCTCTCTGTACAGAAGTTCGAAGGGAGGC
TCGAAGCCCGAAAAGTCTAGCTGGGTGATACATCAATATCATTTGGGAACTGAAGAAGATGAGAAGGAAGGCGAATACGTTTTGTCAAAGATTTCTCATCAGCAACCAAA
GCAATGTAGTAAGAGCAATGATAATATGCTAATTGATGATTTTGATGCTCTGCTCCATCAAACTAGCCCTAGAACTCCCAAGTCAAATCCTCCGATTCCGCCTCGGTCAG
GAAAATCATTTGTCTCCAATGCTGTTGCTGACAATGGTCTGCCTCAATCATTTGCAAAGGAAGAAGAGTTGACTCCGGAAGCATCTCGTGTTTCTTATCCTAATATTTCA
TTAGAGAACGACGTGGGATGCCCGGCATGGTTGGCTGGAGAATCTCAGGGTATAGATGATGTAGAATTACATTACTTGGAAGACAACTTATTATGCAATGAAGTCCTGGA
TGTGGATTCTAGTGCTTTTATAAGTAACAGCCAACTAAATCAAATGTCCTTCCCTCTCTCCGGTAGCAATGCTTGCCACGAGATTGCGAACGGCAACGTTCCGTGTGGAA
TTGCAGACCTTGAGAACCTTGAATTGGATACTCCACCTGACTTTCATCTTGCTGACCTGCATTTTGGTTCTCAAGAGAGCATTTTCGACTGGCTCGACAGGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACGACGGCCGCTCAAGCAAAAAGCGCAAGAGGGCGAAGCGTCGTGGAGCAGACAAAAGCAAAGATTTCGCCAATAATATAGCGGATGCTACTAAACCCAATAG
CGAAGACATCGTTGTTTCCGGTTCTTCCGACGGCAGAAAATGTCGGAGGGTTTCTGGTTCGTCTAGTTTTCTCGATAAGATGCGAGCTCGTTTGTCAGGAGGGCATTTCA
GGATGCTTAATGAGAAGTTGTATACCTGCACAGGGGAAGAGGCTCTTAATTATTTCAAAGAAGACCAGGTTCTGTTTGATGTTTATCATACAGGATACCAAGAGCAAATG
TCGCATTGGCCTGAGCTGCCCGTGAATTTAATCATAAAATGGCTGAAAGAGCACGATCCCTCATTTATTGTGGCAGATTTTGGCTGTGGGGATGCACGCCTAGCAAAGAA
TGTGAAGAATAAAGTTTTCTCATTTGATCTGGTTTCCAAAGATCCTTCTGTTATTGCTTGTGATATGGCCAATACACCTCTTGACTCTTCATCTGTAGATGTTGCCATCT
TTTGCCTTTCACTTATGGGGGTCAATTATGAAAGTTACTTGGCAGAAGCACAAAGAGCACTTAAACCACGTGGATGGCTTTTAATAGCAGAAGTCAAAAGCAGATTTGAC
CCAAGCAATGGAGGAGCCGACCCCAAAAAGTTCATAAAGGCTGTTTGTGAGCTAGGTTTCGTCTCAGCTCTGAAGGACTTCTCAAATAAGATGTTTATCTTGTTATACTT
CAAGAAAAAGGATGAGAAAAGTTCTAAGGGGAAAGATATCGATTGGCCCCAGCTAAAACCATGTTTATACAAACGTCACCCAGTTCCTTCCCCGGCTTTTCTTTTTGCTG
GTGCTTGCAAGGAGCCTATTATGGTGATAGTGACTGAGCTTCTTTTAGGCGGTACATTACGAAAGTACTTGTTGAGTATACGGCCAAGGTGCTTGGATTTGCACGAGGCA
GCTGGCTTTGCACTTGATATTGCATGTGCAATGGAATGCCTGCATTCCCATGGAATCATTCACCGCGATCTCAAACCAGAGAATTTGATCTTGACCGGAGACCACAAAAC
TGTTAAGCTTACAGACTTTGGTCTGGCTAGAGAAGAGTCGGTAACAGAGATGATGACGGCTGAAACCGGGACTTACAGATGGATGGCTCCAGAGCTTTACAGTACAGTTA
CTTTAAGAAATGGAGAGAAGAAGCATTACAATAACAAGGTGGATGTTTACAGCTTTGGAATTGTATTTTGGGAGATTGTTCAAAATAAGTTGCCTTTCGAAGGCATGTCG
AATCTACAAGCAGCATATGCAGCTGCTTTTAAGAACTTACGACCCAATGCCGAGAACCTCCCGGAGGAATTGGCCCCGATTATAACTTCGTGCTGGACGGAGGATCCGAA
CGCTCGGCCTAACTTCAGCCAAATCATACAGATGCTCTTGAGATATCTATCCACCATTCCACAACCAGAGTATGTTACACCACCAACAATGCACCCGCCTGAGAATGCAG
TGTTGCCACCAGAGTCTCCTGGAACGAGTTCTTTGATGGCAGCCACAAGACGTGGCACCGGGGAAGTCCCGAACGGCGAGATGGAAGAGAAATCGAGCAGTTTCTTCTCC
TGTTTCTCTGCTATATTTTTTCGTTTGTTGCTTGAGAGCTTGGTGGTTGCATTTGAGAGCTCGATCATTTGGGTTACTGGCAGAGCTGAACATGCCCCACCAACCCAACA
AGATCCTTTGAGGTCTTGGATCCTTGATAGCCGAACAATTGCAAGAAAAGTCAGAAACATCAGTCAGTCCTCTTCGCAAGTAATTAAAGATTGTGATGCGAAACGAGAAT
GTCCAAACTGTCACTTTATTATTGATAATACAGATATTTGTCCTGAATGGCCTGGTTTGCCTGCTGGTGTTAAGTTTGATCCAACTGATGAAGAAATAATGGATCACTTA
GCGGCAAAGTGCACCGTTGGAAACTTAAAGCCACATGCATTAATTGATGAATTCATCCCCACACTAGAAACAGATCAAGGAATCTGCTACACTCATCCAGAAAATCTCCC
CGGTGTTAGTAAAGATGGGAATGATGTCCATTTCTTTCACAAAACAAGCAATGCATATGCCACAGGTCAAAGGAAGCGCCGCAAGATTGAAAGTAACGATAGTTCAAGCA
TGGAACATTTCCGCTGGCACAAGACAGGTAAGACCAAAGCTGTGATAGAAAATGGTGTTCATAAGGGATGGAAGAAGATATTCTCTCTGTACAGAAGTTCGAAGGGAGGC
TCGAAGCCCGAAAAGTCTAGCTGGGTGATACATCAATATCATTTGGGAACTGAAGAAGATGAGAAGGAAGGCGAATACGTTTTGTCAAAGATTTCTCATCAGCAACCAAA
GCAATGTAGTAAGAGCAATGATAATATGCTAATTGATGATTTTGATGCTCTGCTCCATCAAACTAGCCCTAGAACTCCCAAGTCAAATCCTCCGATTCCGCCTCGGTCAG
GAAAATCATTTGTCTCCAATGCTGTTGCTGACAATGGTCTGCCTCAATCATTTGCAAAGGAAGAAGAGTTGACTCCGGAAGCATCTCGTGTTTCTTATCCTAATATTTCA
TTAGAGAACGACGTGGGATGCCCGGCATGGTTGGCTGGAGAATCTCAGGGTATAGATGATGTAGAATTACATTACTTGGAAGACAACTTATTATGCAATGAAGTCCTGGA
TGTGGATTCTAGTGCTTTTATAAGTAACAGCCAACTAAATCAAATGTCCTTCCCTCTCTCCGGTAGCAATGCTTGCCACGAGATTGCGAACGGCAACGTTCCGTGTGGAA
TTGCAGACCTTGAGAACCTTGAATTGGATACTCCACCTGACTTTCATCTTGCTGACCTGCATTTTGGTTCTCAAGAGAGCATTTTCGACTGGCTCGACAGGATATGA
Protein sequenceShow/hide protein sequence
MADDGRSSKKRKRAKRRGADKSKDFANNIADATKPNSEDIVVSGSSDGRKCRRVSGSSSFLDKMRARLSGGHFRMLNEKLYTCTGEEALNYFKEDQVLFDVYHTGYQEQM
SHWPELPVNLIIKWLKEHDPSFIVADFGCGDARLAKNVKNKVFSFDLVSKDPSVIACDMANTPLDSSSVDVAIFCLSLMGVNYESYLAEAQRALKPRGWLLIAEVKSRFD
PSNGGADPKKFIKAVCELGFVSALKDFSNKMFILLYFKKKDEKSSKGKDIDWPQLKPCLYKRHPVPSPAFLFAGACKEPIMVIVTELLLGGTLRKYLLSIRPRCLDLHEA
AGFALDIACAMECLHSHGIIHRDLKPENLILTGDHKTVKLTDFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRNGEKKHYNNKVDVYSFGIVFWEIVQNKLPFEGMS
NLQAAYAAAFKNLRPNAENLPEELAPIITSCWTEDPNARPNFSQIIQMLLRYLSTIPQPEYVTPPTMHPPENAVLPPESPGTSSLMAATRRGTGEVPNGEMEEKSSSFFS
CFSAIFFRLLLESLVVAFESSIIWVTGRAEHAPPTQQDPLRSWILDSRTIARKVRNISQSSSQVIKDCDAKRECPNCHFIIDNTDICPEWPGLPAGVKFDPTDEEIMDHL
AAKCTVGNLKPHALIDEFIPTLETDQGICYTHPENLPGVSKDGNDVHFFHKTSNAYATGQRKRRKIESNDSSSMEHFRWHKTGKTKAVIENGVHKGWKKIFSLYRSSKGG
SKPEKSSWVIHQYHLGTEEDEKEGEYVLSKISHQQPKQCSKSNDNMLIDDFDALLHQTSPRTPKSNPPIPPRSGKSFVSNAVADNGLPQSFAKEEELTPEASRVSYPNIS
LENDVGCPAWLAGESQGIDDVELHYLEDNLLCNEVLDVDSSAFISNSQLNQMSFPLSGSNACHEIANGNVPCGIADLENLELDTPPDFHLADLHFGSQESIFDWLDRI