; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G14890 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G14890
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDUF4283 domain-containing protein
Genome locationClcChr02:27411664..27415323
RNA-Seq ExpressionClc02G14890
SyntenyClc02G14890
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034063.1 hypothetical protein E6C27_scaffold65G00490 [Cucumis melo var. makuwa]1.1e-20562.1Show/hide
Query:  SRFFFSPGHQPT-------VSSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSI
        ++ F+SP HQPT        S N+ +    KQPIS SP+F S HSTTRS VC FS SQT  I +++ HSLIAWVVGK IR   LAR LHRHLRLT+   +
Subjt:  SRFFFSPGHQPT-------VSSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSI

Query:  HELDLGYFVLKFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC
         EL LGYFVLKF ETDF ALE NPW IPNLCIY FPW PNF+P EA DS+ID WIRL ELPIEYY E+IL+ I +T+GEALVKIDPITKDRKKCKYARIC
Subjt:  HELDLGYFVLKFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC

Query:  VRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNFSVS-----------------------------SRF-----------QDLNDFP
        VRI++YEPLPS+IR+G+I QEIEYEG D+LCPRC CVVHLKHDC N S S                             SRF           Q+L    
Subjt:  VRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNFSVS-----------------------------SRF-----------QDLNDFP

Query:  TLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVG
           MG SEK+ATRT+SS LLPQS GL +E ++ QKEK G      PNLPK        +S+  EE+SSST+  P+ E  NLNLS     LP     +   
Subjt:  TLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVG

Query:  TSCFTTLQVHNNKPQPSSS--VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVI
        TSC   L+VHNN+PQPSSS    SISTQ     SKT+PTFCSSGI RS+L K+IT+   SQGFGINRRPILY + ESI SFE+GLS++P+SAPKQ QF I
Subjt:  TSCFTTLQVHNNKPQPSSS--VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVI

Query:  SFVSTPRSGTKAISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVF
        SFVSTPRSGTKAISA DSKK+LGWNFRGMDN NLIEGL YMVQKYE SIVVIFGT+ITDDVVEEVVD+L F GSY KKFD+YHGGVWLFMF +DVQTEVF
Subjt:  SFVSTPRSGTKAISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVF

Query:  EVNSYNTQQVA----------EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA
        EVNSY+TQQV+          EPM R SY D   TS Q WG  SFC STYS  NALA
Subjt:  EVNSYNTQQVA----------EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]3.3e-11443.42Show/hide
Query:  GHQPTVSSNYVTGNNFKQPI-SLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFS
        G     +  Y++    K+P+ + S + ESH STT + VCN SPSQT RITQ + HSLIAWV G++IR  +LA RL RHL LT  + + EL LGYFVLKFS
Subjt:  GHQPTVSSNYVTGNNFKQPI-SLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFS

Query:  ETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTI
        ETD+ ALE  PWSIPNLCIY F W P+F+P EA +SS+D WIRLHEL IEYY EEIL+ IA TIG  LVK DP+TK+R+KCK+ARIC+RI+L +PLPS I
Subjt:  ETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTI

Query:  RLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNF---SVSSRFQDLNDFPTLLMGASEKSATRTTSSSLLPQSPGLSS----------------------
        +LG+I+Q+IEYEG+DLLCP CR V  LK +C N    S SS    L D PT            ++SSS  P  P +SS                      
Subjt:  RLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNF---SVSSRFQDLNDFPTLLMGASEKSATRTTSSSLLPQSPGLSS----------------------

Query:  -----------------EAIKNQKEKYGIGLPNLPKVSALQEEASSSTVKTPMLEHNNLNLSLPSLV-------GTSCFTTLQVHNNKPQPSSSVGSIST
                         +A    KE  G  + +LPK   L ++ S+ T K P LE     L  P++V        TS  T +  HNN+P           
Subjt:  -----------------EAIKNQKEKYGIGLPNLPKVSALQEEASSSTVKTPMLEHNNLNLSLPSLV-------GTSCFTTLQVHNNKPQPSSSVGSIST

Query:  QQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMD-ESITSFEIGLSD-SPNSAPKQIQFVISFVSTPR------SGTKAISASD--
             + K    F S+ I RSM  KE+ + P S+   ++  PI++ ++ + I SF++ LS    NS P +  + +  + T R       G+K +S S+  
Subjt:  QQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMD-ESITSFEIGLSD-SPNSAPKQIQFVISFVSTPR------SGTKAISASD--

Query:  SKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA------
        SKK+L W F G DNANL++ LK ++Q +E SIV+IFGTKI+    E VV EL F GSYC+K D Y+GGVWL +  QDVQ    EV+SY+ QQV+      
Subjt:  SKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA------

Query:  ----EPMVRYSYEDTGTSSQIWGPASFCTST
             P       DT TSS  WG   F TST
Subjt:  ----EPMVRYSYEDTGTSSQIWGPASFCTST

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-11243.11Show/hide
Query:  GHQPTVSSNYVTGNNFKQPI-SLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFS
        G     +  Y++    K+P+ + S + ESH STT + VCN SPSQT RITQ + HSLIAWV G++IR  +LA RL RHL LT  + + EL LGYFVLKFS
Subjt:  GHQPTVSSNYVTGNNFKQPI-SLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFS

Query:  ETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTI
        ETD+ ALE  PWSIPNLCIY F W P+F+P EA +SS+D WIRL EL IEYY EEIL+ IA TIG  LVK DP+TK+R+KCK+ARIC+RI+L +PLPS I
Subjt:  ETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTI

Query:  RLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNF---SVSSRFQDLNDFPTLLMGASEKSATRTTSSSLLPQSPGLSS----------------------
        +LG+I+Q+IEYEG+DLLCP CR V  LK +C N    S SS    L D PT            ++SSS  P  P +SS                      
Subjt:  RLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNF---SVSSRFQDLNDFPTLLMGASEKSATRTTSSSLLPQSPGLSS----------------------

Query:  -----------------EAIKNQKEKYGIGLPNLPKVSALQEEASSSTVKTPMLEHNNLNLSLPSLV-------GTSCFTTLQVHNNKPQPSSSVGSIST
                         +A    KE  G  + +LPK   L ++ S+ T K P LE     L  P++V        TS  T +  HNN+P           
Subjt:  -----------------EAIKNQKEKYGIGLPNLPKVSALQEEASSSTVKTPMLEHNNLNLSLPSLV-------GTSCFTTLQVHNNKPQPSSSVGSIST

Query:  QQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMD-ESITSFEIGLSD-SPNSAPKQIQFVISFVSTPR------SGTKAISASD--
             + K    F S+ I RS   KE+ + P S+   ++  PI++ ++ + I SF++ LS    NS P +  + +  + T R       G+K +S S+  
Subjt:  QQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMD-ESITSFEIGLSD-SPNSAPKQIQFVISFVSTPR------SGTKAISASD--

Query:  SKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA------
        SKK+L W F G DNANL++ LK ++Q +E SIV+IFGTKI+    E VV EL F GSYC+K D Y+GGVWL +  QDVQ    EV+SY+ QQV+      
Subjt:  SKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA------

Query:  ----EPMVRYSYEDTGTSSQIWGPASFCTST
             P       DT TSS  WG   F TST
Subjt:  ----EPMVRYSYEDTGTSSQIWGPASFCTST

KAG7031864.1 hypothetical protein SDJN02_05905, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-10141.77Show/hide
Query:  RSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEAT
        ++ VC  S SQT RITQ + HS IAW+ GK++R   +A  L RHL LT  + + EL LGYFVLKF ETDF AL+  PWS+PNLCI+  PW P+F+P E  
Subjt:  RSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEAT

Query:  DSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNF
         SS+D W+RLHEL IEYY +E+LQ IA  IG  LVKIDP+TK+R KCK+ARICVR++L +PLPS IRLG+IRQEIEYEG +LLCP C  V  L+H+C N 
Subjt:  DSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNF

Query:  SV-------------------------------------SSRFQDL----NDFPTLLMGASEKSATRTTSSSLLPQSP-GLSSEAIKNQKEKYGIGLPNL
         +                                      SRFQ L    N++PTL  G S K+ T   +SS    SP  +  +AI  +KEK G+ +  L
Subjt:  SV-------------------------------------SSRFQDL----NDFPTLLMGASEKSATRTTSSSLLPQSP-GLSSEAIKNQKEKYGIGLPNL

Query:  PKVSALQEEASSSTVKTPMLEHNNLNLSLPSLVGTSCFTTLQVHNNKPQPSSSVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRR
        PK      E+S  T+K  +      N                V+   P P+ S+  +        S+ +  F S+ I+RS  TKEIT+ P S+   ++  
Subjt:  PKVSALQEEASSSTVKTPMLEHNNLNLSLPSLVGTSCFTTLQVHNNKPQPSSSVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRR

Query:  PILYMMDESITSFEIGLSDS-PNSAPKQIQFVISFVSTPRSGTK----AISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVE
        P +Y +D  I + +I LS++   S   QIQ+ I FV T R G K    + S S  KK+L W F   DN  LI  LK +++ +E SIV+IFGTKI+   V+
Subjt:  PILYMMDESITSFEIGLSDS-PNSAPKQIQFVISFVSTPRSGTK----AISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVE

Query:  EVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVRYSYEDTGTSSQIWGPASFCTSTYS
        +VV EL F  SY +K D Y GGVWL +  QDV+T   +VNS + QQ+   +    Y D+ T++  + P+   T   S
Subjt:  EVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVRYSYEDTGTSSQIWGPASFCTSTYS

KGN50454.1 hypothetical protein Csa_000484 [Cucumis sativus]1.8e-20061.7Show/hide
Query:  SSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSETDFQAL
        S N+ + N   QPIS SPEF S HSTTRS VC FS SQT  I +++ HSLIAWVVGK IR  +LAR L+RHLRLT    + EL LGYFVLKF ETDF A+
Subjt:  SSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSETDFQAL

Query:  EANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQ
        E NPW IPNLCIY FPW PNF+P EA DS+ID WIRL ELPIEYY E+IL+ I +T+GE LVKIDPITKDRKKCKYARICVRI++YEPLPS+IR+G+I Q
Subjt:  EANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQ

Query:  EIEYEGIDLLCPRCRCVVHLKHDCFNFSVS----------------------------------------SRFQDLNDFPTLLMGASEKSATRTTSSSLL
        EIEYEG DLLCPRC CVVHLKHDC N S S                                        S  Q+L    T  MG SEK+ATR +SSSLL
Subjt:  EIEYEGIDLLCPRCRCVVHLKHDCFNFSVS----------------------------------------SRFQDLNDFPTLLMGASEKSATRTTSSSLL

Query:  PQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVGTSCFTTLQVHNNKPQPSSS-
        PQ  GL +E ++ QKEK G      PNLPK        +S+  EE+SSST+  P+LEH NLNLS     LP     +   T C T L+V+NN+PQPSSS 
Subjt:  PQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVGTSCFTTLQVHNNKPQPSSS-

Query:  -VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISASDSKK
           S+STQ     SKT+PTFCSSGI RS+L K IT+T  SQGFGINRRPI Y + ESI SFE+GLS++P+SAPKQ QF ISFVSTPRSGTK ISA DSKK
Subjt:  -VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISASDSKK

Query:  LLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA---------
        +LGWNFRGMDN NLIEGL YMVQKYE SIVVIFGT+ITD+VVEEVVD+L F GSY KKFD+YHGGVWLFMF +DVQTEVFEVNSY+T+QV+         
Subjt:  LLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA---------

Query:  -EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA
         EPM R+SY D   TS Q WG  SF  +TYS  NALA
Subjt:  -EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA

TrEMBL top hitse value%identityAlignment
A0A0A0KNJ5 DUF4283 domain-containing protein8.6e-20161.7Show/hide
Query:  SSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSETDFQAL
        S N+ + N   QPIS SPEF S HSTTRS VC FS SQT  I +++ HSLIAWVVGK IR  +LAR L+RHLRLT    + EL LGYFVLKF ETDF A+
Subjt:  SSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSETDFQAL

Query:  EANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQ
        E NPW IPNLCIY FPW PNF+P EA DS+ID WIRL ELPIEYY E+IL+ I +T+GE LVKIDPITKDRKKCKYARICVRI++YEPLPS+IR+G+I Q
Subjt:  EANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQ

Query:  EIEYEGIDLLCPRCRCVVHLKHDCFNFSVS----------------------------------------SRFQDLNDFPTLLMGASEKSATRTTSSSLL
        EIEYEG DLLCPRC CVVHLKHDC N S S                                        S  Q+L    T  MG SEK+ATR +SSSLL
Subjt:  EIEYEGIDLLCPRCRCVVHLKHDCFNFSVS----------------------------------------SRFQDLNDFPTLLMGASEKSATRTTSSSLL

Query:  PQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVGTSCFTTLQVHNNKPQPSSS-
        PQ  GL +E ++ QKEK G      PNLPK        +S+  EE+SSST+  P+LEH NLNLS     LP     +   T C T L+V+NN+PQPSSS 
Subjt:  PQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVGTSCFTTLQVHNNKPQPSSS-

Query:  -VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISASDSKK
           S+STQ     SKT+PTFCSSGI RS+L K IT+T  SQGFGINRRPI Y + ESI SFE+GLS++P+SAPKQ QF ISFVSTPRSGTK ISA DSKK
Subjt:  -VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISASDSKK

Query:  LLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA---------
        +LGWNFRGMDN NLIEGL YMVQKYE SIVVIFGT+ITD+VVEEVVD+L F GSY KKFD+YHGGVWLFMF +DVQTEVFEVNSY+T+QV+         
Subjt:  LLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVA---------

Query:  -EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA
         EPM R+SY D   TS Q WG  SF  +TYS  NALA
Subjt:  -EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA

A0A5A7SSJ3 DUF4283 domain-containing protein1.7e-10040.38Show/hide
Query:  GHQPTVSSNYVTGNNFK--QPISLSPEFESHHSTTRSLV--CNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVL
        G     + NY++    K   PIS S +FES  STT + V  CN +PS+T RITQ + HSLIA VVGK+ R  +LA RL  HLRLT  + + EL LGYFVL
Subjt:  GHQPTVSSNYVTGNNFK--QPISLSPEFESHHSTTRSLV--CNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVL

Query:  KFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLP
        KFSETD+ ALE  PWSIPNLCI+ FPW P+F+P EA +SS++ WIRL EL IEYY  EIL+ IA+ IG  LVKIDP+T+DR KCK+AR C+ ++L +PLP
Subjt:  KFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLP

Query:  STIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCF---NFSVSSRFQDLNDFP--TLLMGASEKSATRTTSSSLLPQSPGLSS-----------------
        S I LG+IRQ IEYEG + LC +C  V  L+HDC    N S S  F    D P  ++     E  +T ++   L+P+S  +S+                 
Subjt:  STIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCF---NFSVSSRFQDLNDFP--TLLMGASEKSATRTTSSSLLPQSPGLSS-----------------

Query:  -----------------------------EAIKNQKEKYGIG---LPNLPKVSALQEEASSSTVKTPMLEHNNLNLSLPSLVGTSCFTTLQVHNNKPQPS
                                     +AI  +KEK  I    LP+LPK     +++S+ T+K P L+    ++    L       +  + ++  QP 
Subjt:  -----------------------------EAIKNQKEKYGIG---LPNLPKVSALQEEASSSTVKTPMLEHNNLNLSLPSLVGTSCFTTLQVHNNKPQPS

Query:  SSVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMD-ESITSFEIGLSD-SPNSAPKQIQFVISFVSTPRSGTK------
        S   SI   Q    S+    F S  I      +EI N+P S+    +  P +Y +D + ITS  I LS+    S   Q Q+ I  V T + G K      
Subjt:  SSVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMD-ESITSFEIGLSD-SPNSAPKQIQFVISFVSTPRSGTK------

Query:  --AISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQ
          + S   +KK+L W F  MDNA L+  LK ++Q +E SIV+IFG KIT     +V+ EL F GSY  + D Y+GGVWL +  QDVQT   +VNSY+ QQ
Subjt:  --AISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQ

Query:  VAEPMVRY----------SYEDTGTSSQIWGPASFCTST
        V+  +  +          S  DT TSS  WG   F TST
Subjt:  VAEPMVRY----------SYEDTGTSSQIWGPASFCTST

A0A5A7SUD3 DUF4283 domain-containing protein5.2e-20662.1Show/hide
Query:  SRFFFSPGHQPT-------VSSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSI
        ++ F+SP HQPT        S N+ +    KQPIS SP+F S HSTTRS VC FS SQT  I +++ HSLIAWVVGK IR   LAR LHRHLRLT+   +
Subjt:  SRFFFSPGHQPT-------VSSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSI

Query:  HELDLGYFVLKFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC
         EL LGYFVLKF ETDF ALE NPW IPNLCIY FPW PNF+P EA DS+ID WIRL ELPIEYY E+IL+ I +T+GEALVKIDPITKDRKKCKYARIC
Subjt:  HELDLGYFVLKFSETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC

Query:  VRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNFSVS-----------------------------SRF-----------QDLNDFP
        VRI++YEPLPS+IR+G+I QEIEYEG D+LCPRC CVVHLKHDC N S S                             SRF           Q+L    
Subjt:  VRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDCFNFSVS-----------------------------SRF-----------QDLNDFP

Query:  TLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVG
           MG SEK+ATRT+SS LLPQS GL +E ++ QKEK G      PNLPK        +S+  EE+SSST+  P+ E  NLNLS     LP     +   
Subjt:  TLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIG---LPNLPK--------VSALQEEASSSTVKTPMLEHNNLNLS-----LP-----SLVG

Query:  TSCFTTLQVHNNKPQPSSS--VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVI
        TSC   L+VHNN+PQPSSS    SISTQ     SKT+PTFCSSGI RS+L K+IT+   SQGFGINRRPILY + ESI SFE+GLS++P+SAPKQ QF I
Subjt:  TSCFTTLQVHNNKPQPSSS--VGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVI

Query:  SFVSTPRSGTKAISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVF
        SFVSTPRSGTKAISA DSKK+LGWNFRGMDN NLIEGL YMVQKYE SIVVIFGT+ITDDVVEEVVD+L F GSY KKFD+YHGGVWLFMF +DVQTEVF
Subjt:  SFVSTPRSGTKAISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVF

Query:  EVNSYNTQQVA----------EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA
        EVNSY+TQQV+          EPM R SY D   TS Q WG  SFC STYS  NALA
Subjt:  EVNSYNTQQVA----------EPMVRYSYED-TGTSSQIWGPASFCTSTYSTANALA

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.0e-10043.9Show/hide
Query:  STTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSET--DFQALEANPWSIPNLCIYTFPWIPNFR
        ST  + VCN +PSQT RI Q +  SLI WVVGK I   +LA RL R+L L   L + EL LG+FVLKFS     ++ALE  PWSIP+LCIY FPWIPNF+
Subjt:  STTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSET--DFQALEANPWSIPNLCIYTFPWIPNFR

Query:  PYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKH
        P EA+   +D WIRL EL IEYY +E+L+ IAETIG  LVKIDP+T  R+KC YARIC+R++L  PL  + + G+  Q+I YEG+DLLC  C CV  LKH
Subjt:  PYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKH

Query:  DCF-NFSVSSRFQDLNDF--PTLLMGASEKSATRTTSSSLLPQSPGLSS----------------EAIKNQKEKYGIGL---PNLPKVSALQE-EASSST
        DC  N S SS F   +    P    G+S  S     SSS L Q+P  SS                 A  ++ +   + L   P+LP   + +E + S S 
Subjt:  DCF-NFSVSSRFQDLNDF--PTLLMGASEKSATRTTSSSLLPQSPGLSS----------------EAIKNQKEKYGIGL---PNLPKVSALQE-EASSST

Query:  VKTPMLEHNNLNLSLP-----------SLVGTSCFTTLQVHNNKPQPSS-SVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPI
           P+L+  NL  S+P               TS  TTL V NN+PQPSS ++ SI+  Q  S  +    F S+ I++S + K I NTP S+   ++  P 
Subjt:  VKTPMLEHNNLNLSLP-----------SLVGTSCFTTLQVHNNKPQPSS-SVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPI

Query:  LYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISAS-DSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDEL
        +Y +D +ITS  I L +   +  +  Q   +    P S   ++SAS  SKK+L WNFR  DNA L+  LK ++Q ++ SIV+IFGTKI+    + VV EL
Subjt:  LYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISAS-DSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDEL

Query:  GFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVRYS
         F GSYC+K D Y GG WL +  QDVQ    EV+SY+ QQV+  ++ +S
Subjt:  GFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVRYS

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X12.3e-10043.68Show/hide
Query:  STTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSET--DFQALEANPWSIPNLCIYTFPWIPNFR
        ST  + VCN +PSQT RI Q +  SLI WVVGK I   +LA RL R+L L   L + EL LG+FVLKFS     ++ALE  PWSIP+LCIY FPWIPNF+
Subjt:  STTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFSET--DFQALEANPWSIPNLCIYTFPWIPNFR

Query:  PYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKH
        P EA+   +D WIRL EL IEYY +E+L+ IAETIG  LVKIDP+T  R+KC YARIC+R++L  PL  + + G+  Q+I YEG+DLLC  C CV  LKH
Subjt:  PYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKH

Query:  DCF-NFSVSSRFQDLNDF--PTLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIGLPNLPKVSA------------LQEE-------------
        DC  N S SS F   +    P    G+S  S     SSS L  +P LSS +  N   K  + +P+ P  ++            L EE             
Subjt:  DCF-NFSVSSRFQDLNDF--PTLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIGLPNLPKVSA------------LQEE-------------

Query:  ASSSTVKTPMLEHNNLNLSLP-----------SLVGTSCFTTLQVHNNKPQPSS-SVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGI
         S S    P+L+  NL  S+P               TS  TTL V NN+PQPSS ++ SI+  Q  S  +    F S+ I++S + K I NTP S+   +
Subjt:  ASSSTVKTPMLEHNNLNLSLP-----------SLVGTSCFTTLQVHNNKPQPSS-SVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGI

Query:  NRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISAS-DSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEE
        +  P +Y +D +ITS  I L +   +  +  Q   +    P S   ++SAS  SKK+L WNFR  DNA L+  LK ++Q ++ SIV+IFGTKI+    + 
Subjt:  NRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTPRSGTKAISAS-DSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEE

Query:  VVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVRYS
        VV EL F GSYC+K D Y GG WL +  QDVQ    EV+SY+ QQV+  ++ +S
Subjt:  VVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVRYS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding7.4e-1928.25Show/hide
Query:  LIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKF--SETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGE
        +I  V+G  I    L R+L    + +  +++ +L   +F+++F   E    AL   PW +    +    W   F P      +   W+RL  +P  YY  
Subjt:  LIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKF--SETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGE

Query:  EILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDC
         +L  IA  +G  L K+D  T +  K ++AR+C+ ++L +PL  T+ +   R  + YEG+  +C  C    HL H C
Subjt:  EILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIEYEGIDLLCPRCRCVVHLKHDC

AT2G13450.1 unknown protein2.1e-0524.14Show/hide
Query:  SETD-FQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPS
        SE D    L   PW   N  +    W  N   +  T  SI+ W+++  +P+ Y  EE    IA  +GE ++ +D       +  Y R+ +R  + + L  
Subjt:  SETD-FQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPS

Query:  TIRL----GQIRQ-EIEYEGIDLLCPRCRCVVHLKHDCFNFSVSS
         +R+    G+      +YE +  +C  C  + H ++ C    + S
Subjt:  TIRL----GQIRQ-EIEYEGIDLLCPRCRCVVHLKHDCFNFSVSS

AT2G41590.1 unknown protein1.6e-0524.88Show/hide
Query:  KNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKF-SETDFQALE-ANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIA
        +N+ S  +A  L R   LT+ +    LD  Y    F +E D   ++   PW   N  +    W     P     ++ID W+++  +P+ Y  EE +  IA
Subjt:  KNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKF-SETDFQALE-ANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIA

Query:  ETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRL----GQIRQ-EIEYEGIDLLCPRCRCVVHLKHDC-FNFSVSSRFQDLNDFPTLLMGASE
        + +GE L+ +D       +  Y R+ VR  + + L    R+    G+      +YE +  +C  C    H +  C +     S  ++   F   +  +S 
Subjt:  ETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRL----GQIRQ-EIEYEGIDLLCPRCRCVVHLKHDC-FNFSVSSRFQDLNDFPTLLMGASE

Query:  KSATRTTSSSL-LPQSP
         S ++ T SS  +PQ+P
Subjt:  KSATRTTSSSL-LPQSP

AT3G47920.1 unknown protein6.1e-0526.17Show/hide
Query:  LDLGYFVLKF-SETDFQALEANP-WSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC
        LD  Y    F +E D  +++    W   N  +    W P   P     ++ID W+++  +P+ Y  EE    IA  IGE ++ +D       +  Y R+ 
Subjt:  LDLGYFVLKF-SETDFQALEANP-WSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC

Query:  VRIHLYEPLPSTIRLGQIRQE-----IEYEGIDLLCPRCRCVVHLKHDC
        VRI + + L    R+     E      +YE +  +C  C  V H ++ C
Subjt:  VRIHLYEPLPSTIRLGQIRQE-----IEYEGIDLLCPRCRCVVHLKHDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.3e-0730.2Show/hide
Query:  LDLGYFVLKF-SETD-FQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC
        LD   F ++F SE D    L   PW      I    W  +F P E   + ID W+ +  +P+ Y  E  ++IIA T+GE +V +D   +   +  + R+ 
Subjt:  LDLGYFVLKF-SETD-FQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARIC

Query:  VRIHLYEPLPSTIRLGQIRQE-----IEYEGIDLLCPRCRCVVHLKHDC
        VR+   EPL    R+    +E      EYE +  +C  C  V H    C
Subjt:  VRIHLYEPLPSTIRLGQIRQE-----IEYEGIDLLCPRCRCVVHLKHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCCAATCCAGATTTTTCTTCTCCCCCGGCCACCAGCCCACCGTGTCAAGCAACTATGTCACCGGAAACAACTTCAAACAACCAATTTCTCTCTCACCGGAATT
CGAATCGCATCATTCAACCACCCGCTCCCTCGTCTGTAACTTCTCTCCCTCTCAAACTCATCGTATTACTCAAGACTACACTCATTCCCTCATAGCATGGGTCGTCGGGA
AAAACATCCGTTCACCGGAACTCGCTCGTCGCCTTCACCGTCATCTTCGTCTCACCGATCATCTGAGTATCCACGAGCTAGATCTCGGGTATTTCGTGCTTAAATTCTCT
GAGACTGATTTTCAAGCCTTAGAAGCCAATCCATGGTCAATCCCTAATCTCTGCATCTACACCTTCCCATGGATTCCCAATTTCAGACCCTACGAGGCGACGGATTCTTC
TATAGATGCCTGGATTCGCCTCCATGAACTCCCCATCGAGTATTACGGCGAAGAAATTCTGCAAATTATTGCAGAGACCATCGGCGAAGCTCTCGTCAAAATTGACCCCA
TTACCAAAGATCGCAAGAAGTGTAAGTATGCTCGGATCTGTGTCAGAATTCATTTGTATGAGCCACTTCCATCTACGATCAGACTCGGTCAAATTCGACAGGAAATTGAG
TATGAGGGTATTGACCTGTTGTGCCCTCGCTGTAGATGCGTTGTTCATCTAAAGCATGATTGTTTCAATTTTTCGGTTTCTTCTAGATTTCAGGATTTGAATGATTTCCC
AACCCTTTTAATGGGTGCCTCTGAAAAATCAGCTACAAGAACAACCTCAAGCTCTCTTCTACCTCAGTCACCTGGATTGTCATCTGAAGCAATTAAAAACCAGAAGGAGA
AATATGGAATTGGACTGCCCAACTTGCCTAAAGTGTCTGCTCTACAAGAAGAAGCATCTTCATCAACAGTTAAAACTCCTATGTTAGAACACAACAATTTGAATCTGTCT
CTCCCTTCACTTGTCGGAACCAGCTGCTTCACCACACTCCAAGTCCACAACAACAAACCACAACCATCATCATCGGTTGGGAGCATTTCGACCCAACAATGGCCGTCTAT
TTCAAAGACTGTCCCCACGTTCTGTTCCAGTGGGATCGAACGCTCGATGCTTACGAAAGAGATAACCAACACACCATTATCTCAAGGATTCGGTATCAATCGTCGTCCAA
TTCTCTACATGATGGATGAATCTATCACGAGCTTTGAAATTGGTCTTTCAGACAGTCCAAACTCTGCACCAAAACAAATCCAGTTTGTTATCAGCTTTGTGTCAACTCCA
AGAAGTGGAACGAAGGCGATATCGGCATCGGATTCTAAGAAACTATTGGGCTGGAATTTTCGTGGGATGGACAATGCCAATCTAATAGAAGGATTAAAATACATGGTTCA
AAAGTATGAGACATCCATTGTAGTGATCTTTGGCACCAAGATCACTGATGATGTTGTGGAGGAGGTCGTGGATGAGCTCGGTTTCCACGGTTCGTATTGTAAGAAGTTTG
ATGACTATCATGGGGGTGTTTGGTTGTTTATGTTCGGACAAGATGTGCAAACTGAAGTCTTTGAAGTTAACTCATACAACACACAACAGGTTGCTGAACCAATGGTTAGA
TATTCATATGAAGATACCGGAACATCATCGCAAATATGGGGACCTGCATCGTTCTGTACTTCCACATATTCGACGGCTAATGCATTGGCATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATCATATCTATCATGAACCATCATCCTTCAAAATTTCTTCGTTGTTGTTGAAAATCTAAGAAATTTCTTTTCACTTGAATTGAAGAACACCAAAATGAAACAATATAATC
GCTATGAATGTTTAAATTCCAATCCAAAATTCTTTCAATTCTTTCAACAATCTCATCGTCCCAACAAATCCCATGGCAACCCAATCCAGATTTTTCTTCTCCCCCGGCCA
CCAGCCCACCGTGTCAAGCAACTATGTCACCGGAAACAACTTCAAACAACCAATTTCTCTCTCACCGGAATTCGAATCGCATCATTCAACCACCCGCTCCCTCGTCTGTA
ACTTCTCTCCCTCTCAAACTCATCGTATTACTCAAGACTACACTCATTCCCTCATAGCATGGGTCGTCGGGAAAAACATCCGTTCACCGGAACTCGCTCGTCGCCTTCAC
CGTCATCTTCGTCTCACCGATCATCTGAGTATCCACGAGCTAGATCTCGGGTATTTCGTGCTTAAATTCTCTGAGACTGATTTTCAAGCCTTAGAAGCCAATCCATGGTC
AATCCCTAATCTCTGCATCTACACCTTCCCATGGATTCCCAATTTCAGACCCTACGAGGCGACGGATTCTTCTATAGATGCCTGGATTCGCCTCCATGAACTCCCCATCG
AGTATTACGGCGAAGAAATTCTGCAAATTATTGCAGAGACCATCGGCGAAGCTCTCGTCAAAATTGACCCCATTACCAAAGATCGCAAGAAGTGTAAGTATGCTCGGATC
TGTGTCAGAATTCATTTGTATGAGCCACTTCCATCTACGATCAGACTCGGTCAAATTCGACAGGAAATTGAGTATGAGGGTATTGACCTGTTGTGCCCTCGCTGTAGATG
CGTTGTTCATCTAAAGCATGATTGTTTCAATTTTTCGGTTTCTTCTAGATTTCAGGATTTGAATGATTTCCCAACCCTTTTAATGGGTGCCTCTGAAAAATCAGCTACAA
GAACAACCTCAAGCTCTCTTCTACCTCAGTCACCTGGATTGTCATCTGAAGCAATTAAAAACCAGAAGGAGAAATATGGAATTGGACTGCCCAACTTGCCTAAAGTGTCT
GCTCTACAAGAAGAAGCATCTTCATCAACAGTTAAAACTCCTATGTTAGAACACAACAATTTGAATCTGTCTCTCCCTTCACTTGTCGGAACCAGCTGCTTCACCACACT
CCAAGTCCACAACAACAAACCACAACCATCATCATCGGTTGGGAGCATTTCGACCCAACAATGGCCGTCTATTTCAAAGACTGTCCCCACGTTCTGTTCCAGTGGGATCG
AACGCTCGATGCTTACGAAAGAGATAACCAACACACCATTATCTCAAGGATTCGGTATCAATCGTCGTCCAATTCTCTACATGATGGATGAATCTATCACGAGCTTTGAA
ATTGGTCTTTCAGACAGTCCAAACTCTGCACCAAAACAAATCCAGTTTGTTATCAGCTTTGTGTCAACTCCAAGAAGTGGAACGAAGGCGATATCGGCATCGGATTCTAA
GAAACTATTGGGCTGGAATTTTCGTGGGATGGACAATGCCAATCTAATAGAAGGATTAAAATACATGGTTCAAAAGTATGAGACATCCATTGTAGTGATCTTTGGCACCA
AGATCACTGATGATGTTGTGGAGGAGGTCGTGGATGAGCTCGGTTTCCACGGTTCGTATTGTAAGAAGTTTGATGACTATCATGGGGGTGTTTGGTTGTTTATGTTCGGA
CAAGATGTGCAAACTGAAGTCTTTGAAGTTAACTCATACAACACACAACAGGTTGCTGAACCAATGGTTAGATATTCATATGAAGATACCGGAACATCATCGCAAATATG
GGGACCTGCATCGTTCTGTACTTCCACATATTCGACGGCTAATGCATTGGCATGCTGATGAGATTAAGGATCAACTTTCAATGAAGATCACTTTTCATAGAAAGCAAACT
AGACGTCCGTCTCAACTTTTTTGTAACGTGTTTGTTTCGGGGACCAAATTACTGTTTTTGTTATGTAACTTTATATGTCAAACTTTTTCTCGTGTTTATTGCAACTTTAC
TTGGGTGTTATATAAAACACTTGTAATAAGACTTTTATGATAATATTGAATGAATTTTCATTAGTGTTTGGTATAA
Protein sequenceShow/hide protein sequence
MATQSRFFFSPGHQPTVSSNYVTGNNFKQPISLSPEFESHHSTTRSLVCNFSPSQTHRITQDYTHSLIAWVVGKNIRSPELARRLHRHLRLTDHLSIHELDLGYFVLKFS
ETDFQALEANPWSIPNLCIYTFPWIPNFRPYEATDSSIDAWIRLHELPIEYYGEEILQIIAETIGEALVKIDPITKDRKKCKYARICVRIHLYEPLPSTIRLGQIRQEIE
YEGIDLLCPRCRCVVHLKHDCFNFSVSSRFQDLNDFPTLLMGASEKSATRTTSSSLLPQSPGLSSEAIKNQKEKYGIGLPNLPKVSALQEEASSSTVKTPMLEHNNLNLS
LPSLVGTSCFTTLQVHNNKPQPSSSVGSISTQQWPSISKTVPTFCSSGIERSMLTKEITNTPLSQGFGINRRPILYMMDESITSFEIGLSDSPNSAPKQIQFVISFVSTP
RSGTKAISASDSKKLLGWNFRGMDNANLIEGLKYMVQKYETSIVVIFGTKITDDVVEEVVDELGFHGSYCKKFDDYHGGVWLFMFGQDVQTEVFEVNSYNTQQVAEPMVR
YSYEDTGTSSQIWGPASFCTSTYSTANALAC