; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003819 (gene) of Snake gourd v1 genome

Gene IDTan0003819
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG10:63044042..63045813
RNA-Seq ExpressionTan0003819
SyntenyTan0003819
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025179.1 UPF0481 protein [Cucurbita argyrosperma subsp. argyrosperma]2.5e-10253.38Show/hide
Query:  QHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTW
        Q+KD    V+E+TT+NL S + N+ D   E+      I +IP  IKKVNP AF PQ +SFGPYHHG +HL PMEK+K  +    +R  GLS++D+V   W
Subjt:  QHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTW

Query:  GMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVIN---QNIDKKV-----------D
         M+EDLQR+YD LDD+W  +P KFLELMI+DGC ++Q LL    + ++    V RDMLLLENQ+PM LL KL+ M+ +   +NI   V           +
Subjt:  GMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVIN---QNIDKKV-----------D

Query:  ILVRGGCKHLLDMFRLELILRRQMEP-LLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISK
        IL+     HLLDM+R EL      EP LLQR  +G G+EIQLA  F KAGIK+K+G     +DFD+ +GVL LPFI MNA+IES LLNAM FEKL GI  
Subjt:  ILVRGGCKHLLDMFRLELILRRQMEP-LLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISK

Query:  EANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS
           SF+ILMGNL+EKDE++SFNQLAK  VL MW     VY+ V ++C RPWRIWWT LKD +F  PWTIIS+L A++GF LL++QT+YG+YGYY P  S
Subjt:  EANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS

KAG7025181.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-10555.27Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILVRGGCKHL
           W M+EDLQR+YD LDD+W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDM  L   V  TLL++ +                     HL
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILVRGGCKHL

Query:  LDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMG
        LDM+R EL+  +  E   LQR  +G G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHIES LLNAMAFEKL GI    +SF+ILM 
Subjt:  LDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMG

Query:  NLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS
        NL+EKDE++SFNQLAK EVL MW   T+VYN V ++C RPWRIW T LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY PH S
Subjt:  NLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS

XP_022925442.1 UPF0481 protein At3g47200-like [Cucurbita moschata]5.4e-11355.94Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILV-------
           W M+EDLQR+YD LDD+W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDMLLLENQ+PM LL KL+ M + ++ +KKV  LV       
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILV-------

Query:  --------RGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL
                     HLLDM+R EL+  +  E   LQR  +G G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHI+S L+NAMAFEKL
Subjt:  --------RGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL

Query:  SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYK
         GI    +SF+ILMGNL+EKDE++SFNQLAK EVL MW    +VYN V ++C RPWRIWWT LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY 
Subjt:  SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYK

Query:  PHSS
        P  S
Subjt:  PHSS

XP_022925444.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.1e-10250.93Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        ME  + +++ AVV E+ T+N+ S L NLPD+L E +     I +IP  IKKVNP AF PQ +SFGPYHHG +HL P EK+K  +F+ F++  GLS+ED+V
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVI-------NQNIDKKV----
           W M+EDLQR+YD LDDKW  EP KFLE+MI+DGC I+Q LL    +  +    VLRD+LLLENQ+PM LL KL  M++       N+N++  V    
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVI-------NQNIDKKV----

Query:  -------DILVRGGCKHLLDMFRLELIL-------RRQMEPLLQR-------------RL--MGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVL
                +L+     H+LDM+R EL         R Q+     R             RL  +G G+EIQLA  F KAGIK+K+G     +DFD+ +GVL
Subjt:  -------DILVRGGCKHLLDMFRLELIL-------RRQMEPLLQR-------------RL--MGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVL

Query:  RLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIIS
         LPFI MNA+IES LLNAMAFEKL GI     SF+ILMGNL+EKDE++SFNQLAK  VL +W     VY  V  +C RPW+IWWT LKD NFQ+PWTIIS
Subjt:  RLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIIS

Query:  TLAAIVGFVLLILQTLYGIYGYYKPHSS
        T  A++GF LLI+QT+YG+YGYY P  S
Subjt:  TLAAIVGFVLLILQTLYGIYGYYKPHSS

XP_023535324.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]2.2e-11456.68Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILV-------
           W M+EDLQR+YD LDD+W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDMLLLENQ+PM LL KL+ M + ++ +KKV  LV       
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILV-------

Query:  --------RGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL
                     HLLDM+R EL+  +  E   LQR  +G G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHIES LLNAMAFEKL
Subjt:  --------RGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL

Query:  SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYK
         GI    +SF+ILMGNL+EKDE++SFNQLAK EVL MW   T+VYN V ++C RPWRIWWT LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY 
Subjt:  SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYK

Query:  PHSS
        P  S
Subjt:  PHSS

TrEMBL top hitse value%identityAlignment
A0A6J1EBQ1 UPF0481 protein At3g47200-like2.6e-11355.94Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILV-------
           W M+EDLQR+YD LDD+W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDMLLLENQ+PM LL KL+ M + ++ +KKV  LV       
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILV-------

Query:  --------RGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL
                     HLLDM+R EL+  +  E   LQR  +G G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHI+S L+NAMAFEKL
Subjt:  --------RGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL

Query:  SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYK
         GI    +SF+ILMGNL+EKDE++SFNQLAK EVL MW    +VYN V ++C RPWRIWWT LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY 
Subjt:  SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYK

Query:  PHSS
        P  S
Subjt:  PHSS

A0A6J1EC69 UPF0481 protein At3g47200-like5.4e-10350.93Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        ME  + +++ AVV E+ T+N+ S L NLPD+L E +     I +IP  IKKVNP AF PQ +SFGPYHHG +HL P EK+K  +F+ F++  GLS+ED+V
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVI-------NQNIDKKV----
           W M+EDLQR+YD LDDKW  EP KFLE+MI+DGC I+Q LL    +  +    VLRD+LLLENQ+PM LL KL  M++       N+N++  V    
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVI-------NQNIDKKV----

Query:  -------DILVRGGCKHLLDMFRLELIL-------RRQMEPLLQR-------------RL--MGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVL
                +L+     H+LDM+R EL         R Q+     R             RL  +G G+EIQLA  F KAGIK+K+G     +DFD+ +GVL
Subjt:  -------DILVRGGCKHLLDMFRLELIL-------RRQMEPLLQR-------------RL--MGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVL

Query:  RLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIIS
         LPFI MNA+IES LLNAMAFEKL GI     SF+ILMGNL+EKDE++SFNQLAK  VL +W     VY  V  +C RPW+IWWT LKD NFQ+PWTIIS
Subjt:  RLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIIS

Query:  TLAAIVGFVLLILQTLYGIYGYYKPHSS
        T  A++GF LLI+QT+YG+YGYY P  S
Subjt:  TLAAIVGFVLLILQTLYGIYGYYKPHSS

A0A6J1ECA1 UPF0481 protein At3g47200-like2.2e-9653.06Show/hide
Query:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV
        MEM    D+  +V  +TT+NL S LEN    +   + I   I +IP+ I KVNP AF PQ +SFGPYHHG +HL PMEK K   F+ F+R  GL  EDIV
Subjt:  MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIV

Query:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKV--------DIL
           W M+EDLQ +YD L D+W  +P KFLE+MIVDG  +L  LL G     +    ++RDMLLLENQ+PM LL KL+ M +  + D+KV        + L
Subjt:  KRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQNIDKKV--------DIL

Query:  VRGGCKHLLDMFRLEL-ILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIKQGP----LDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEA
        +     HLLDM+R EL +     E  LQ   +G  +EIQLAT F KAGIK+++G     L F+E +GVL L FI MNA+IES LLNAM FEKLSGI    
Subjt:  VRGGCKHLLDMFRLEL-ILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIKQGP----LDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEA

Query:  NSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYY
         SF+ILMGNL+EKDE++SFNQLAK  VL MW +  +VY  V K+C RPWRIWWT LK+ +FQ+PW IIS L+AI+GFVLLI+QT+ G+YGYY
Subjt:  NSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYY

A0A6J1EF72 UPF0481 protein At3g47200-like1.1e-9547.42Show/hide
Query:  MEMEQHKDEGAVVIELT---------TKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRF
        ME+   +++  VV+EL+         T+NL  +LE+LPD   + ++    I +IP  IKKVNP AF PQ +SFGPYHHG  HL PMEK K  + Q F+R 
Subjt:  MEMEQHKDEGAVVIELT---------TKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRF

Query:  VGLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQN-------
         GLS ED+V   W M+EDLQ +YD LDDKW  +P KFLELM++DGC ++  L +   +        LRDML+LENQ+P+ LL KL+ M+  +N       
Subjt:  VGLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVINQN-------

Query:  -------ID-----------------KKVDILVRG--------GCK---------HLLDMFRLELILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIK
               ID                   +  LV G        G K         H+L M+R E++   +   L Q  L G G+EIQLA  F KAGIK+K
Subjt:  -------ID-----------------KKVDILVRG--------GCK---------HLLDMFRLELILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIK

Query:  QG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIW
        +G     + FDEK+G L LPFI MNA++ES LLN MAFEKLSGI+    SF+ILMGNL EKDE++SFNQLAK  VLE+W     VY+ V K+  RPW+IW
Subjt:  QG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIW

Query:  WTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS
        WT LKD NFQ+PWTIIST +A++GF LLI+QT+YG+YGYY P  S
Subjt:  WTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS

A0A6J1EI35 UPF0481 protein At3g47200-like5.1e-9352.02Show/hide
Query:  VIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFV-GLSVEDIVKRTWGMVEDLQ
        ++E   +NL S L NL     E + I   I +IP  I  VNP AF P+ +SFGPYHHG +HL PMEK K  +   F++   GL+ E IV   W M+ DLQ
Subjt:  VIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFV-GLSVEDIVKRTWGMVEDLQ

Query:  RAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSI----NQIVLRDMLLLENQVPMTLLQKLH---------FMVINQNIDKKV---DILVRGGCK-
         +YD LDDKW KEP KFLELMI+DGC I+   L+   +    N  V RDMLLLENQ+PM LL KL+         F++   N+   V   D +  G  + 
Subjt:  RAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSI----NQIVLRDMLLLENQVPMTLLQKLH---------FMVINQNIDKKV---DILVRGGCK-

Query:  ---HLLDMFRLEL---ILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEAN
           HLLDM+R EL   +L R  +  LQ   +G  +EI+LAT F KAGIK+++G     + FDE +G+L LPFI MNA+IES LLNAMAFEKLSGI     
Subjt:  ---HLLDMFRLEL---ILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEAN

Query:  SFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS
        SF++LMGNL++K E++SFNQLAK EVL  + +   VY  V ++C RPWRIWWT LKD NFQNPWTIISTL+A +GFVLLILQT+YG+YGYY P  S
Subjt:  SFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026452.0e-0626.55Show/hide
Query:  EQHK-DEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQ-RFVGLSVEDIVK
        EQH+ DE   VI +  K+L + LE       +LE ++  I  +P  +   +P ++ P R+S GPYH     L+ ME+ K    +  + ++      D+V+
Subjt:  EQHK-DEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQ-RFVGLSVEDIVK

Query:  RTWGMVEDLQRAYDDLDDKWIK-EPQKFLELMIVDGCCILQHL------LDGDSINQI----VLRDMLLLENQVPMTLLQK-LHFMV-INQNIDKKVDIL
        +   M   ++  Y     K+I    +  L +M VD   +++ L           IN++    +LRD++++ENQ+P+ +L+K L F +   ++ D  +  +
Subjt:  RTWGMVEDLQRAYDDLDDKWIK-EPQKFLELMIVDGCCILQHL------LDGDSINQI----VLRDMLLLENQVPMTLLQK-LHFMV-INQNIDKKVDIL

Query:  VRGGCKHLLDM---FRLELILRRQME
        + G CK L  +   F  + IL+ Q +
Subjt:  VRGGCKHLLDM---FRLELILRRQME

Q9SD53 UPF0481 protein At3g472008.6e-2126.73Show/hide
Query:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSVED--IVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCC
        I ++P     +NPKA+ P+ +S GPYH+G  HL  +++ K R  Q F        VE+  +VK    + + ++++Y +     +K     + +M++DGC 
Subjt:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSVED--IVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCC

Query:  ILQHL--------LDGDSINQI------VLRDMLLLENQVPMTLLQKLH---------------FMVINQNIDKKVDILVRG---GCKHLLDMFRLELIL
        IL           L  D I  I      +  D+LLLENQVP  +LQ L+               F      IDK+     +      KHLLD+ R E  L
Subjt:  ILQHL--------LDGDSINQI------VLRDMLLLENQVPMTLLQKLH---------------FMVINQNIDKKVDILVRG---GCKHLLDMFRLELIL

Query:  RRQMEPLLQRRLMGPGNEIQL---------------------ATLFRKAGIKIK------QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SG
            E     +   P  ++QL                     A   R  GIK +         L+   K+  L++P +  +  I S  LN +AFE+  + 
Subjt:  RRQMEPLLQRRLMGPGNEIQL---------------------ATLFRKAGIKIK------QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SG

Query:  ISKEANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE---------DTFVYNKVRKYCNRPWRIW----WTRLKDTNFQNPWTIISTLA
         S E  ++I+ MG L+  +E  +F +  K          +EV E +K          DT   N V K  N   + W    W   + T+F++PWT +S+ A
Subjt:  ISKEANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE---------DTFVYNKVRKYCNRPWRIW----WTRLKDTNFQNPWTIISTLA

Query:  AIVGFVLLILQTLYGIYGY
         +   +L +LQ+   I  Y
Subjt:  AIVGFVLLILQTLYGIYGY

Arabidopsis top hitse value%identityAlignment
AT3G47250.1 Plant protein of unknown function (DUF247)1.7e-2426.92Show/hide
Query:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI
        I +IP+ + +VNPKA+ P+ +S GPYH+G  HL  +++ KFR  + F  R     + E+++    G ++   RA     ++   E  + + +MI+DGC I
Subjt:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI

Query:  LQHLL----------DGDSINQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDILVR---GGCKHLLDMFRLELI
        L  LL          + D I  I      +  D+LLLENQVP  +L+               ++ F   N +IDK      +    G KHLLD+ R   I
Subjt:  LQHLL----------DGDSINQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDILVR---GGCKHLLDMFRLELI

Query:  --LRRQMEPLLQ-----RRLMGPGNEIQ----------LATLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE
          +R   E         +   G   E+            A   R  GIK +     +  LD   K+  L++P + ++  I S  LN +AFE+  +  + +
Subjt:  --LRRQMEPLLQ-----RRLMGPGNEIQ----------LATLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE

Query:  ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------FVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVG
          S+++ MG L+   E  +F    K          +EV + +K        DT       V+  V +Y ++ +   W   + T+F++PWT +S+ A +  
Subjt:  ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------FVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVG

Query:  FVLLILQTLYGIYGYY
         +L + Q  Y I  YY
Subjt:  FVLLILQTLYGIYGYY

AT3G47250.2 Plant protein of unknown function (DUF247)1.7e-2426.92Show/hide
Query:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI
        I +IP+ + +VNPKA+ P+ +S GPYH+G  HL  +++ KFR  + F  R     + E+++    G ++   RA     ++   E  + + +MI+DGC I
Subjt:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI

Query:  LQHLL----------DGDSINQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDILVR---GGCKHLLDMFRLELI
        L  LL          + D I  I      +  D+LLLENQVP  +L+               ++ F   N +IDK      +    G KHLLD+ R   I
Subjt:  LQHLL----------DGDSINQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDILVR---GGCKHLLDMFRLELI

Query:  --LRRQMEPLLQ-----RRLMGPGNEIQ----------LATLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE
          +R   E         +   G   E+            A   R  GIK +     +  LD   K+  L++P + ++  I S  LN +AFE+  +  + +
Subjt:  --LRRQMEPLLQ-----RRLMGPGNEIQ----------LATLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE

Query:  ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------FVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVG
          S+++ MG L+   E  +F    K          +EV + +K        DT       V+  V +Y ++ +   W   + T+F++PWT +S+ A +  
Subjt:  ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------FVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVG

Query:  FVLLILQTLYGIYGYY
         +L + Q  Y I  YY
Subjt:  FVLLILQTLYGIYGYY

AT3G47250.3 Plant protein of unknown function (DUF247)1.7e-2426.92Show/hide
Query:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI
        I +IP+ + +VNPKA+ P+ +S GPYH+G  HL  +++ KFR  + F  R     + E+++    G ++   RA     ++   E  + + +MI+DGC I
Subjt:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI

Query:  LQHLL----------DGDSINQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDILVR---GGCKHLLDMFRLELI
        L  LL          + D I  I      +  D+LLLENQVP  +L+               ++ F   N +IDK      +    G KHLLD+ R   I
Subjt:  LQHLL----------DGDSINQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDILVR---GGCKHLLDMFRLELI

Query:  --LRRQMEPLLQ-----RRLMGPGNEIQ----------LATLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE
          +R   E         +   G   E+            A   R  GIK +     +  LD   K+  L++P + ++  I S  LN +AFE+  +  + +
Subjt:  --LRRQMEPLLQ-----RRLMGPGNEIQ----------LATLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE

Query:  ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------FVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVG
          S+++ MG L+   E  +F    K          +EV + +K        DT       V+  V +Y ++ +   W   + T+F++PWT +S+ A +  
Subjt:  ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------FVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVG

Query:  FVLLILQTLYGIYGYY
         +L + Q  Y I  YY
Subjt:  FVLLILQTLYGIYGYY

AT3G50120.1 Plant protein of unknown function (DUF247)3.1e-2624.94Show/hide
Query:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQ
        I ++P  +++ + K++ PQ +S GPYHHG   L  M++ K+R+     +     ++  +     + E  +  Y   +        +F+E++++DGC +L+
Subjt:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQ

Query:  HLLDG--DSINQI-----------------VLRDMLLLENQVPMTLLQKLHFMVI---NQN----------------------------------IDKKV
         L  G  +   ++                 + RDM++LENQ+P+ +L +L  + +   NQ                                    DK  
Subjt:  HLLDG--DSINQI-----------------VLRDMLLLENQVPMTLLQKLHFMVI---NQN----------------------------------IDKKV

Query:  DILVRGGCKHLLDMFRLELILRR-QMEPLLQRRLMGPGNE---------IQLATLFRKAGIKIKQGPL----DFDEKQGVLRLPFINMNAHIESALLNAM
        D     G  H LD+FR  L+    + EP L R+                I   T  ++AGIK ++       D   K G L +P + ++   +S  LN +
Subjt:  DILVRGGCKHLLDMFRLELILRR-QMEPLLQRRLMGPGNE---------IQLATLFRKAGIKIKQGPL----DFDEKQGVLRLPFINMNAHIESALLNAM

Query:  AFEKLS-GISKEANSFIILMGNLIEKDE---------------------IESFNQLAKSEVLEMWKEDTFVYN---KVRKYCNRPWRIWWTRLKDTNFQN
        AFE+     S +  S+II M NLI+  E                      + FN+L +  V +   ED+++     +V +Y +  W  W   LK   F N
Subjt:  AFEKLS-GISKEANSFIILMGNLIEKDE---------------------IESFNQLAKSEVLEMWKEDTFVYN---KVRKYCNRPWRIWWTRLKDTNFQN

Query:  PWTIISTLAAIVGFVLLILQTLYGIYGYYKPHS
        PW I+S  AA++  VL   Q+ Y +Y YYKP S
Subjt:  PWTIISTLAAIVGFVLLILQTLYGIYGYYKPHS

AT4G31980.1 unknown protein2.0e-2526.03Show/hide
Query:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVK--RTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI
        I K+PN ++++NP A+ P+ +SFGP H G   L  ME  K+R   +F      S+ED+V+  RTW      Q A     +       +F+E+++VDG  +
Subjt:  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVK--RTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCI

Query:  LQHLL---------DGDSI--NQI----VLRDMLLLENQVPMTLLQKLHFMVIN---QNIDKKVDILVRGGCKHLLDMFRLELILRRQMEPLLQRRLMGP
        ++ LL         + D I  N +    V RDM+L+ENQ+P  +++++  +++N   Q     + +  R     L  +   + I   +    L R    P
Subjt:  LQHLL---------DGDSI--NQI----VLRDMLLLENQVPMTLLQKLHFMVIN---QNIDKKVDILVRGGCKHLLDMFRLELILRRQMEPLLQRRLMGP

Query:  GNEIQL------------ATLFRKAGIKIKQGP-----LDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEK--------
           I+L            AT    AG++ K        LD     GVL++P I ++   ES   N + FE+    +K    +I+L+G  I+         
Subjt:  GNEIQL------------ATLFRKAGIKIKQGP-----LDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEK--------

Query:  -------------DEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGI
                     D    FN ++K  + +     + +   ++ YCN PW  W   L+   F NPW + S  AA++  +L  +Q++  I
Subjt:  -------------DEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGGAGCAGCATAAGGATGAGGGGGCCGTAGTTATTGAGCTCACGACTAAAAACCTGACATCGCGATTGGAAAATCTTCCCGACAACCTCAGTGAACTGGAAGT
TATCAGTAAACCAATCCTTAAAATACCAAACGACATAAAAAAGGTTAATCCGAAAGCATTCTTGCCGCAACGGATGTCGTTCGGGCCATATCACCATGGAGTCGTTCATT
TGTATCCGATGGAGAAAATGAAATTTCGATCGTTTCAAACATTTCAACGTTTTGTTGGACTGTCTGTTGAAGACATCGTGAAGCGCACGTGGGGCATGGTGGAAGATTTA
CAAAGAGCCTACGATGATCTTGATGATAAATGGATAAAAGAACCACAAAAATTCTTGGAGCTCATGATTGTGGATGGTTGTTGCATATTGCAACACCTACTCGATGGGGA
TTCAATCAATCAAATTGTGTTGCGGGATATGCTGCTGCTTGAGAATCAGGTGCCCATGACGCTTCTTCAGAAGCTGCATTTCATGGTAATAAACCAAAACATAGACAAGA
AGGTAGATATATTAGTGAGAGGAGGTTGCAAACATCTTTTAGATATGTTCAGGCTAGAATTGATTCTTAGGAGACAAATGGAACCATTACTTCAAAGAAGGCTCATGGGA
CCGGGAAACGAAATTCAGCTAGCAACACTCTTCCGTAAAGCCGGGATCAAAATCAAGCAAGGCCCACTTGATTTTGATGAAAAACAAGGTGTGTTGAGGCTCCCATTCAT
CAATATGAATGCTCACATTGAATCAGCCTTGTTAAATGCAATGGCATTCGAGAAACTTTCAGGGATTTCCAAAGAAGCAAACTCTTTCATTATTCTGATGGGTAATCTGA
TAGAGAAAGATGAGATAGAGTCGTTCAATCAGTTGGCTAAATCTGAGGTTTTGGAAATGTGGAAGGAGGACACTTTTGTATACAATAAAGTGAGAAAGTATTGTAATAGG
CCATGGAGAATATGGTGGACAAGGCTCAAAGATACAAACTTTCAAAATCCTTGGACCATTATCTCCACTCTTGCCGCTATCGTAGGCTTTGTGTTACTAATTCTCCAAAC
CTTATACGGAATCTATGGATACTACAAACCACATTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGGAGCAGCATAAGGATGAGGGGGCCGTAGTTATTGAGCTCACGACTAAAAACCTGACATCGCGATTGGAAAATCTTCCCGACAACCTCAGTGAACTGGAAGT
TATCAGTAAACCAATCCTTAAAATACCAAACGACATAAAAAAGGTTAATCCGAAAGCATTCTTGCCGCAACGGATGTCGTTCGGGCCATATCACCATGGAGTCGTTCATT
TGTATCCGATGGAGAAAATGAAATTTCGATCGTTTCAAACATTTCAACGTTTTGTTGGACTGTCTGTTGAAGACATCGTGAAGCGCACGTGGGGCATGGTGGAAGATTTA
CAAAGAGCCTACGATGATCTTGATGATAAATGGATAAAAGAACCACAAAAATTCTTGGAGCTCATGATTGTGGATGGTTGTTGCATATTGCAACACCTACTCGATGGGGA
TTCAATCAATCAAATTGTGTTGCGGGATATGCTGCTGCTTGAGAATCAGGTGCCCATGACGCTTCTTCAGAAGCTGCATTTCATGGTAATAAACCAAAACATAGACAAGA
AGGTAGATATATTAGTGAGAGGAGGTTGCAAACATCTTTTAGATATGTTCAGGCTAGAATTGATTCTTAGGAGACAAATGGAACCATTACTTCAAAGAAGGCTCATGGGA
CCGGGAAACGAAATTCAGCTAGCAACACTCTTCCGTAAAGCCGGGATCAAAATCAAGCAAGGCCCACTTGATTTTGATGAAAAACAAGGTGTGTTGAGGCTCCCATTCAT
CAATATGAATGCTCACATTGAATCAGCCTTGTTAAATGCAATGGCATTCGAGAAACTTTCAGGGATTTCCAAAGAAGCAAACTCTTTCATTATTCTGATGGGTAATCTGA
TAGAGAAAGATGAGATAGAGTCGTTCAATCAGTTGGCTAAATCTGAGGTTTTGGAAATGTGGAAGGAGGACACTTTTGTATACAATAAAGTGAGAAAGTATTGTAATAGG
CCATGGAGAATATGGTGGACAAGGCTCAAAGATACAAACTTTCAAAATCCTTGGACCATTATCTCCACTCTTGCCGCTATCGTAGGCTTTGTGTTACTAATTCTCCAAAC
CTTATACGGAATCTATGGATACTACAAACCACATTCATCTTGATCCAACCGCCACTCATATGCTTTCATTTCAATGCTTTTTTTTTTTCAATTTTGTGTGAGGAAAGTTG
CACTTATGAAAATCTAATTGCAAATTTAGCACAAGC
Protein sequenceShow/hide protein sequence
MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDL
QRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQIVLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILVRGGCKHLLDMFRLELILRRQMEPLLQRRLMG
PGNEIQLATLFRKAGIKIKQGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNR
PWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS