; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016146 (gene) of Snake gourd v1 genome

Gene IDTan0016146
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0496 protein 3-like
Genome locationLG11:8966849..8968031
RNA-Seq ExpressionTan0016146
SyntenyTan0016146
Gene Ontology termsNA
InterPro domainsIPR007749 - Protein of unknown function DUF677


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602186.1 UPF0496 protein 3, partial [Cucurbita argyrosperma subsp. sororia]3.9e-13775.7Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL
        MW KFRAS+     +  + K  I+N  QKSFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSSS+ D   RK      LQ DQL+   SI+ESP L+
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL

Query:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA
        MLPEL+GLLID+FNVSAEASNLC+R+LANLKL RSNSR++Q+ +DSIEKCSSP+++ETI SDLLA RAP SDLDKRDFA IHD+YAAVS  LN TRKKVA
Subjt:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA

Query:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA
        RKIRSIK+IDRFTCGLVAIT+R LTALVMA   GP     RLKS RRKL++HQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKA
Subjt:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA

Query:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        MVRLFVERKEDKFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC VTI+R+RASVINQM
Subjt:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

KAG7032870.1 UPF0496 protein 3, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-13675.42Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL
        MW KFRAS+     +  + K  I+N  QKSFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSSS+ D   RK      LQ DQL+   SI+ESP L+
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL

Query:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA
        MLPEL+GLLID+FNVSAEASNLC+R+L NLKL RSNSR++Q+ +DSIEKCSSP+++ETI SDLLA RAP SDLDKRDFA IHD+YAAVS  LN TRKKVA
Subjt:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA

Query:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA
        RKIRSIK+IDRFTCGLVAIT+R LTALVMA   GP     RLKS RRKL++HQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKA
Subjt:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA

Query:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        MVRLFVERKEDKFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC VTI+R+RASVINQM
Subjt:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

XP_022959620.1 UPF0496 protein 3-like [Cucurbita moschata]5.1e-13775.42Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL
        MW KFRAS+     +  + K  I+N  QKSFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSSS+ D   RK      LQ DQL+   SI+ESP L+
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL

Query:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA
        MLPEL+GLLID+FNVSAEASNLC+R+LANLKL RSNSR++Q+ +DSIEKCSSP+++ETI SDLLA RAP SDLDKRDFA IHD+YAAVS  LN TRKKVA
Subjt:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA

Query:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA
        RKIRSIK+IDRFTCGLVAIT+R LTALVMA   GP     RLKS RRKL++HQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKA
Subjt:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA

Query:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        MVRLFVERKE+KFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC+VTI+R+RASVINQM
Subjt:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

XP_022990396.1 UPF0496 protein 3-like [Cucurbita maxima]2.8e-13574.44Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK----LQPDQLDVILSIIESPFLLML
        MW KFRAS+     +  + K  I+N  QKSFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSS+    R+      LQ DQL+   SI+ESP L+ML
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK----LQPDQLDVILSIIESPFLLML

Query:  PELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARK
        PEL+GLLID+FNVSAEASNLC+R+LANLKL RSNSR++Q+ +DSIEKCSSP+++ETI SDLLA RAP SDLDKRDFA IHD+YAAVS  LN TRKKVARK
Subjt:  PELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARK

Query:  IRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV
        IRSIK+I++ TCGLVAIT+R LTALVMA   GP     RLKS RRKL++HQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKAMV
Subjt:  IRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV

Query:  RLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        RLFVERKEDKFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC+VTI+R+RASVINQM
Subjt:  RLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

XP_023542867.1 UPF0496 protein 3-like [Cucurbita pepo subsp. pepo]2.5e-13674.51Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRR-----KLQPDQLDVILSIIESPFLLM
        MW KFRAS+     +  + K  I+N  Q+SFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSSS+    R+      LQ DQL+   SI+ESP L+M
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRR-----KLQPDQLDVILSIIESPFLLM

Query:  LPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVAR
        LPEL+GLLID+FNVSAEASNLC+R+LANLKL RSNSR++Q+ +DSIEKC SP+++ETI SDLLA RAP SDLDKRDFA IHD YAAVS  LN TRKKVAR
Subjt:  LPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVAR

Query:  KIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAM
        KIRSIK+IDR+TCGLVAIT+R LTALV+A   GP     RLKS RRKL+QHQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKAM
Subjt:  KIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAM

Query:  VRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        VRLFVERKEDKFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC+VTI+R+RASVINQM
Subjt:  VRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

TrEMBL top hitse value%identityAlignment
A0A1S3B3L4 UPF0496 protein 3-like4.6e-11563.69Show/hide
Query:  MWAKFRASKIRNN----------------EKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTD-------RRRRKLQPDQLD
        MW KF  SKI N+                E ++I+   ++SFNVNEEYLC LRT+SF EFF K +  VHESPPST+SSS+             LQP QL+
Subjt:  MWAKFRASKIRNN----------------EKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTD-------RRRRKLQPDQLD

Query:  VILSIIESPFLLMLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAV
         + SI+ES FLLMLPELKGL +DYFN+SA+AS+LC R+LAN KLTRS SR IQ+S+DSIEKC S E +E+IAS+LLALR+PFSDL+KRDFALIHD+Y  +
Subjt:  VILSIIESPFLLMLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAV

Query:  SFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLV
        S RLN TRKKVARKIRS+K++D  TCGL AIT RTLT LV A   GP  FG       RKL++H+MLRNGGLEKVGE+LEAAAKGSYIL RE +TTSRLV
Subjt:  SFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLV

Query:  IRLDDAVDHGKAMVRLFVER-KEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVIN
        +RL DAVD+GKAMVRLF  R KEDKF V VAMDE+KK+N +IRK+VEDVEEHL LC+V I+R++AS+IN
Subjt:  IRLDDAVDHGKAMVRLFVER-KEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVIN

A0A5D3DGL5 UPF0496 protein 3-like4.6e-11563.69Show/hide
Query:  MWAKFRASKIRNN----------------EKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTD-------RRRRKLQPDQLD
        MW KF  SKI N+                E ++I+   ++SFNVNEEYLC LRT+SF EFF K +  VHESPPST+SSS+             LQP QL+
Subjt:  MWAKFRASKIRNN----------------EKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTD-------RRRRKLQPDQLD

Query:  VILSIIESPFLLMLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAV
         + SI+ES FLLMLPELKGL +DYFN+SA+AS+LC R+LAN KLTRS SR IQ+S+DSIEKC S E +E+IAS+LLALR+PFSDL+KRDFALIHD+Y  +
Subjt:  VILSIIESPFLLMLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAV

Query:  SFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLV
        S RLN TRKKVARKIRS+K++D  TCGL AIT RTLT LV A   GP  FG       RKL++H+MLRNGGLEKVGE+LEAAAKGSYIL RE +TTSRLV
Subjt:  SFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLV

Query:  IRLDDAVDHGKAMVRLFVER-KEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVIN
        +RL DAVD+GKAMVRLF  R KEDKF V VAMDE+KK+N +IRK+VEDVEEHL LC+V I+R++AS+IN
Subjt:  IRLDDAVDHGKAMVRLFVER-KEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVIN

A0A6J1BWZ5 UPF0496 protein 3-like1.9e-12165.62Show/hide
Query:  MWAKFRASKIRN-----------------------------NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHE-SPPSTSSSSTDRRRRK
        MW K R  KIRN                             + KQ I+N  QKSFNVN+EY+CALRTKS++EFF KAQ  + E SPPSTSSS+  RRRRK
Subjt:  MWAKFRASKIRN-----------------------------NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHE-SPPSTSSSSTDRRRRK

Query:  ------LQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDK
              L+P Q + I SI+ESPFLLMLP+LK LLIDYFNVSAEAS  C R+L +++LTRSNSR IQ+S+DSIE CSSPE +ETIAS+ LALR PFSD DK
Subjt:  ------LQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDK

Query:  RDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSY
         DFALIHD+Y AVS RLN TRKKVARKI+SI++I+  +CGLVAITARTLT L          F    +SFRRKL+++QM+RNGGL +VGEQLEAAAKGSY
Subjt:  RDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSY

Query:  ILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQMQ
        ILNREFDTTSRLV RLDDA+DHGKAM RLFVERKEDKFAV VAMDELKKSNL +R QVE+VEEHLYLC+VTI+RAR  VINQ+Q
Subjt:  ILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQMQ

A0A6J1H6T3 UPF0496 protein 3-like2.5e-13775.42Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL
        MW KFRAS+     +  + K  I+N  QKSFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSSS+ D   RK      LQ DQL+   SI+ESP L+
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK------LQPDQLDVILSIIESPFLL

Query:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA
        MLPEL+GLLID+FNVSAEASNLC+R+LANLKL RSNSR++Q+ +DSIEKCSSP+++ETI SDLLA RAP SDLDKRDFA IHD+YAAVS  LN TRKKVA
Subjt:  MLPELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVA

Query:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA
        RKIRSIK+IDRFTCGLVAIT+R LTALVMA   GP     RLKS RRKL++HQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKA
Subjt:  RKIRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKA

Query:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        MVRLFVERKE+KFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC+VTI+R+RASVINQM
Subjt:  MVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

A0A6J1JMU4 UPF0496 protein 3-like1.4e-13574.44Show/hide
Query:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK----LQPDQLDVILSIIESPFLLML
        MW KFRAS+     +  + K  I+N  QKSFNVNEEYLCALR++SF+EFF KAQL V ESPPSTSS+    R+      LQ DQL+   SI+ESP L+ML
Subjt:  MWAKFRASK-----IRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRK----LQPDQLDVILSIIESPFLLML

Query:  PELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARK
        PEL+GLLID+FNVSAEASNLC+R+LANLKL RSNSR++Q+ +DSIEKCSSP+++ETI SDLLA RAP SDLDKRDFA IHD+YAAVS  LN TRKKVARK
Subjt:  PELKGLLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARK

Query:  IRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV
        IRSIK+I++ TCGLVAIT+R LTALVMA   GP     RLKS RRKL++HQML NGGLE+VGEQLEAAAKGSYILNREFDTTSRLV+RL DAVDHGKAMV
Subjt:  IRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV

Query:  RLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        RLFVERKEDKFAVAVAMDE+K+SN+SIRKQVEDVEEHLYLC+VTI+R+RASVINQM
Subjt:  RLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

SwissProt top hitse value%identityAlignment
A2XCJ1 UPF0496 protein 36.7e-2329.83Show/hide
Query:  SFNVNEEYLCALRTKSFIEFFTK----------AQLTVHESPPSTSSSSTDRRRR-----KLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEAS
        SF+  EEY  A RT+S+ +F+ +          A +  H      ++S      R      L+PDQ  V  ++       + P+++GLL  Y+  +A AS
Subjt:  SFNVNEEYLCALRTKSFIEFFTK----------AQLTVHESPPSTSSSSTDRRRR-----KLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEAS

Query:  NLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSF--RLNYTRKKVARKIRSIKMIDRFTCGLVA
         LC+ +L +++  R   R ++    ++ K +S   +  +A    AL  PF+ L      L   +  +      L+  RKK   +IRS+  + R    +  
Subjt:  NLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSF--RLNYTRKKVARKIRSIKMIDRFTCGLVA

Query:  ITARTLTALV-----------MATGP--GPAHFGLRL---KSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV
        +TA  + A+V            A  P   PA  G R    ++ RR L+               QLEAAAKG+YILNR+ +T SRLV R+ D  +H  A++
Subjt:  ITARTLTALV-----------MATGP--GPAHFGLRL---KSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV

Query:  RLFVERKEDKFA------VAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        RL VE +    A      V   + +L K+  S R+Q++++EEHL+LC +TI++AR  V+N M
Subjt:  RLFVERKEDKFA------VAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

A2YH25 Putative UPF0496 protein 27.2e-1726.04Show/hide
Query:  LLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIK
        LLI+YF+V+ EA   C+ +LA +   R +   +++ +  ++     +  + +A   + L  P S     +F  +H   + ++ RL   ++++ R  R+++
Subjt:  LLIDYFNVSAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIK

Query:  MIDRFTCGLV---AITARTLTALVMAT------GPGPAHFGLRLKSFRRKLIQH--QMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVD
         I R T       A  A  + A+V+A       G   A FG       R   +   + + +    + G  L+AAA+G+YI+ R+ DT SR+V R  D ++
Subjt:  MIDRFTCGLV---AITARTLTALVMAT------GPGPAHFGLRLKSFRRKLIQH--QMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVD

Query:  HGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQMQNG
        HG+ + R+ +    ++  +     E ++    +R Q+ ++EEH+ LC++TI+R R  V ++M  G
Subjt:  HGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQMQNG

Q10RR9 UPF0496 protein 37.4e-2229.56Show/hide
Query:  SFNVNEEYLCALRTKSFIEFFTK----------AQLTVHESPPSTSSSSTDRRRR-----KLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEAS
        SF+  EEY  A RT+S+ +F+ +          A +  H      ++S      R      L+PDQ  V  ++       + P+++GLL  Y+  +A AS
Subjt:  SFNVNEEYLCALRTKSFIEFFTK----------AQLTVHESPPSTSSSSTDRRRR-----KLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEAS

Query:  NLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSF--RLNYTRKKVARKIRSIKMIDRFTCGLVA
         LC+ +L +++  R   R ++    ++ K +S   +  +A    AL  PF+ L      L   +  +      L+  RKK   +IRS+  + R    +  
Subjt:  NLCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSF--RLNYTRKKVARKIRSIKMIDRFTCGLVA

Query:  ITARTLTALV-----------MATGP--GPAHFGLRL---KSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV
        +TA  + A+V            A  P   PA  G R    ++ RR L+               QLEAAAKG+YILNR+ +T SRLV R+ D  +H  A+ 
Subjt:  ITARTLTALV-----------MATGP--GPAHFGLRL---KSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMV

Query:  RLFVERKEDKFA------VAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        RL VE +    A      V   + +L K+  S R+Q++++EEHL+LC +T ++AR  V+N M
Subjt:  RLFVERKEDKFA------VAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

Q6DYE5 UPF0496 protein At1g201802.0e-2228.26Show/hide
Query:  NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQ--------LTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNV
        ++K+  ++ + K  +VNEEY  A RT S++E  TKA+          +  S PS SSSS            LD     +++  L+    L  L++ +F++
Subjt:  NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQ--------LTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNV

Query:  SAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEK-CS-------SPEKLETI----ASDLLALRAPFSDL-DKRDFALIHDEYAAVSFRLNYTRKKVARK
        S+EA ++C  +L  L+  + N   I++ +   ++ C+       SPE L  +     S   AL+ P   + ++  F ++HD  + +  +L   ++++ RK
Subjt:  SAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEK-CS-------SPEKLETI----ASDLLALRAPFSDL-DKRDFALIHDEYAAVSFRLNYTRKKVARK

Query:  IRSIKMIDRFTCGLVAIT--ARTLTALVMA------TGPGPAHFGL----RLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIR
        IR  K   +     + IT  A  +T L++A          PA  GL     L+  + K   H+  ++  LEK+G Q++ AAKG +IL  + DT SRL  R
Subjt:  IRSIKMIDRFTCGLVAIT--ARTLTALVMA------TGPGPAHFGL----RLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIR

Query:  LDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        L D ++H K +  +  + ++ +  +  A+ E          Q++++EEHLYLC  TI+R+R  V+ Q+
Subjt:  LDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

Q9SMU4 UPF0496 protein At3g490707.0e-2030.47Show/hide
Query:  NVNEEYLCALRTKSFIEFFT--------KAQLTVHESPPSTSSSSTDRR--------RRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASN
        +V EEY  A RT+S+  F+T        K+ ++   S P   SSST  R           L PD L+ I  I++     +    + LL DYF  +A A  
Subjt:  NVNEEYLCALRTKSFIEFFT--------KAQLTVHESPPSTSSSSTDRR--------RRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASN

Query:  LCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITA
        LC ++L N+   RS    ++    S E  +S   ++   +++     PF     R   LI      +  RL   R K   K++ I  +   + GL+ + A
Subjt:  LCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITA

Query:  RTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAM-D
         T T +V       A F L   +      +   LRN  L K   +L+ AAKG+YIL+R+ DT SRLV R++D V+H +AM   +V R   +   +  +  
Subjt:  RTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAM-D

Query:  ELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        ELK+   S  ++++++EEH+YLC +TI+RAR  ++ ++
Subjt:  ELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

Arabidopsis top hitse value%identityAlignment
AT1G20180.1 Protein of unknown function (DUF677)1.4e-2328.26Show/hide
Query:  NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQ--------LTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNV
        ++K+  ++ + K  +VNEEY  A RT S++E  TKA+          +  S PS SSSS            LD     +++  L+    L  L++ +F++
Subjt:  NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQ--------LTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNV

Query:  SAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEK-CS-------SPEKLETI----ASDLLALRAPFSDL-DKRDFALIHDEYAAVSFRLNYTRKKVARK
        S+EA ++C  +L  L+  + N   I++ +   ++ C+       SPE L  +     S   AL+ P   + ++  F ++HD  + +  +L   ++++ RK
Subjt:  SAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEK-CS-------SPEKLETI----ASDLLALRAPFSDL-DKRDFALIHDEYAAVSFRLNYTRKKVARK

Query:  IRSIKMIDRFTCGLVAIT--ARTLTALVMA------TGPGPAHFGL----RLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIR
        IR  K   +     + IT  A  +T L++A          PA  GL     L+  + K   H+  ++  LEK+G Q++ AAKG +IL  + DT SRL  R
Subjt:  IRSIKMIDRFTCGLVAIT--ARTLTALVMA------TGPGPAHFGL----RLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIR

Query:  LDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        L D ++H K +  +  + ++ +  +  A+ E          Q++++EEHLYLC  TI+R+R  V+ Q+
Subjt:  LDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

AT1G20180.2 Protein of unknown function (DUF677)8.4e-2126.67Show/hide
Query:  NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQ--------LTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNV
        ++K+  ++ + K  +VNEEY  A RT S++E  TKA+          +  S PS SSSS            LD     +++  L+    L  L++ +F++
Subjt:  NEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQ--------LTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNV

Query:  SAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEK-CS-------SPEKLETI----ASDLLALRAPFSDL-DKRDFALIHDEYAAVSFRLNYTRKKVARK
        S+EA ++C  +L  L+  + N   I++ +   ++ C+       SPE L  +     S   AL+ P   + ++  F ++HD  + +  +L   ++++ RK
Subjt:  SAEASNLCNRVLANLKLTRSNSRYIQQSIDSIEK-CS-------SPEKLETI----ASDLLALRAPFSDL-DKRDFALIHDEYAAVSFRLNYTRKKVARK

Query:  IRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGL----RLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHG
        I                        ++     PA  GL     L+  + K   H+  ++  LEK+G Q++ AAKG +IL  + DT SRL  RL D ++H 
Subjt:  IRSIKMIDRFTCGLVAITARTLTALVMATGPGPAHFGL----RLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHG

Query:  KAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        K +  +  + ++ +  +  A+ E          Q++++EEHLYLC  TI+R+R  V+ Q+
Subjt:  KAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

AT3G19330.1 Protein of unknown function (DUF677)1.1e-1525.21Show/hide
Query:  SQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPE---------LKGLLIDYFNVSAEASNLC
        S  +FN++ E   A +T S+ +  ++  + V           T    R +QPD ++++LS +  P    + E         L  L+  YF  S +A+ LC
Subjt:  SQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPE---------LKGLLIDYFNVSAEASNLC

Query:  NRVLANLKLTRSNSRYIQQSIDSIEKCSS----PEKLETIASDLL----ALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRS-IKMIDRFTC
          +  N+   R +       + +I    S     E L  +A D+         PFS  +   F    D     S +L +   +  RK RS +++I   T 
Subjt:  NRVLANLKLTRSNSRYIQQSIDSIEKCSS----PEKLETIASDLL----ALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRS-IKMIDRFTC

Query:  G-----LVAITARTLTALVMATG--------PGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAM
        G     + A+ A   +A+V+A+          GP        SF+RK + +             QL AA+KG+++LN++ DT  RLV RL   +++ K +
Subjt:  G-----LVAITARTLTALVMATG--------PGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAM

Query:  VRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        +RL +ER  D  ++   +  L+KS+L +  Q++D+E+H+ L    +++AR+ ++ ++
Subjt:  VRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

AT3G19330.3 Protein of unknown function (DUF677)1.0e-1324.71Show/hide
Query:  SQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPE---------LKGLLIDYFNVSAEASNLC
        S  +FN++ E   A +T S+ +  ++  + V           T    R +QPD ++++LS +  P    + E         L  L+  YF  S +A+ LC
Subjt:  SQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPE---------LKGLLIDYFNVSAEASNLC

Query:  NRVLANLKLTRSNSRYIQQSIDSIEKCSS----PEKLETIASDLL----ALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRS-IKMIDRFTC
          +  N+   R +       + +I    S     E L  +A D+         PFS  +   F    D     S +L +   +  RK RS +++I   T 
Subjt:  NRVLANLKLTRSNSRYIQQSIDSIEKCSS----PEKLETIASDLL----ALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRS-IKMIDRFTC

Query:  GLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFA
        G +               P   H      SF+RK + +             QL AA+KG+++LN++ DT  RLV RL   +++ K ++RL +ER  D  +
Subjt:  GLVAITARTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFA

Query:  VAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        +   +  L+KS+L +  Q++D+E+H+ L    +++AR+ ++ ++
Subjt:  VAVAMDELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM

AT3G49070.1 Protein of unknown function (DUF677)5.0e-2130.47Show/hide
Query:  NVNEEYLCALRTKSFIEFFT--------KAQLTVHESPPSTSSSSTDRR--------RRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASN
        +V EEY  A RT+S+  F+T        K+ ++   S P   SSST  R           L PD L+ I  I++     +    + LL DYF  +A A  
Subjt:  NVNEEYLCALRTKSFIEFFT--------KAQLTVHESPPSTSSSSTDRR--------RRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASN

Query:  LCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITA
        LC ++L N+   RS    ++    S E  +S   ++   +++     PF     R   LI      +  RL   R K   K++ I  +   + GL+ + A
Subjt:  LCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITA

Query:  RTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAM-D
         T T +V       A F L   +      +   LRN  L K   +L+ AAKG+YIL+R+ DT SRLV R++D V+H +AM   +V R   +   +  +  
Subjt:  RTLTALVMATGPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAM-D

Query:  ELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM
        ELK+   S  ++++++EEH+YLC +TI+RAR  ++ ++
Subjt:  ELKKSNLSIRKQVEDVEEHLYLCVVTIDRARASVINQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGCAAAATTTAGGGCTTCAAAGATCAGAAACAATGAGAAGCAGCTAATTAAAAACAATTCTCAAAAAAGTTTCAATGTAAACGAGGAATACCTTTGTGCACTGAG
GACCAAATCCTTCATTGAATTCTTCACAAAAGCTCAATTGACTGTCCATGAATCGCCGCCATCTACATCCTCCTCCTCCACTGATCGCCGCCGCCGCAAATTGCAGCCTG
ATCAATTAGATGTCATTCTTTCAATTATCGAATCGCCATTTCTTTTGATGTTGCCGGAGCTTAAAGGCCTCTTAATCGACTACTTCAATGTCAGTGCCGAGGCTTCAAAT
CTTTGCAATCGTGTCCTTGCAAACCTCAAATTAACCCGATCTAACTCCCGCTACATTCAACAATCGATCGATTCGATCGAGAAATGCTCTTCTCCCGAAAAACTCGAAAC
AATCGCCTCCGATCTTCTCGCGCTTAGGGCACCATTTTCCGATCTCGACAAACGCGATTTCGCGCTGATCCACGACGAATACGCAGCGGTTTCGTTTCGCCTGAACTACA
CGAGGAAGAAGGTAGCAAGAAAAATCAGATCGATCAAAATGATCGATAGATTTACATGTGGATTGGTCGCCATTACAGCTCGCACTCTGACTGCCTTAGTAATGGCGACC
GGTCCAGGTCCAGCACATTTCGGCCTCCGACTGAAATCCTTCCGCAGAAAGCTTATCCAGCATCAGATGCTCAGAAATGGCGGCCTTGAAAAAGTAGGCGAACAACTGGA
GGCGGCGGCCAAAGGAAGTTACATACTGAACAGAGAGTTCGACACGACGAGCCGACTCGTCATCCGACTCGACGACGCCGTGGATCACGGGAAGGCGATGGTGCGGTTGT
TTGTGGAAAGGAAGGAGGATAAATTTGCAGTTGCGGTGGCCATGGATGAGCTGAAGAAGAGCAATCTGAGCATCAGAAAGCAAGTTGAAGACGTTGAAGAGCATTTATAT
TTGTGCGTCGTGACCATTGACAGAGCCAGAGCTTCTGTGATTAACCAAATGCAAAATGGCAACGGGACGGGGATTGATTACGGGATTTGCGATGGGACAGATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGCAAAATTTAGGGCTTCAAAGATCAGAAACAATGAGAAGCAGCTAATTAAAAACAATTCTCAAAAAAGTTTCAATGTAAACGAGGAATACCTTTGTGCACTGAG
GACCAAATCCTTCATTGAATTCTTCACAAAAGCTCAATTGACTGTCCATGAATCGCCGCCATCTACATCCTCCTCCTCCACTGATCGCCGCCGCCGCAAATTGCAGCCTG
ATCAATTAGATGTCATTCTTTCAATTATCGAATCGCCATTTCTTTTGATGTTGCCGGAGCTTAAAGGCCTCTTAATCGACTACTTCAATGTCAGTGCCGAGGCTTCAAAT
CTTTGCAATCGTGTCCTTGCAAACCTCAAATTAACCCGATCTAACTCCCGCTACATTCAACAATCGATCGATTCGATCGAGAAATGCTCTTCTCCCGAAAAACTCGAAAC
AATCGCCTCCGATCTTCTCGCGCTTAGGGCACCATTTTCCGATCTCGACAAACGCGATTTCGCGCTGATCCACGACGAATACGCAGCGGTTTCGTTTCGCCTGAACTACA
CGAGGAAGAAGGTAGCAAGAAAAATCAGATCGATCAAAATGATCGATAGATTTACATGTGGATTGGTCGCCATTACAGCTCGCACTCTGACTGCCTTAGTAATGGCGACC
GGTCCAGGTCCAGCACATTTCGGCCTCCGACTGAAATCCTTCCGCAGAAAGCTTATCCAGCATCAGATGCTCAGAAATGGCGGCCTTGAAAAAGTAGGCGAACAACTGGA
GGCGGCGGCCAAAGGAAGTTACATACTGAACAGAGAGTTCGACACGACGAGCCGACTCGTCATCCGACTCGACGACGCCGTGGATCACGGGAAGGCGATGGTGCGGTTGT
TTGTGGAAAGGAAGGAGGATAAATTTGCAGTTGCGGTGGCCATGGATGAGCTGAAGAAGAGCAATCTGAGCATCAGAAAGCAAGTTGAAGACGTTGAAGAGCATTTATAT
TTGTGCGTCGTGACCATTGACAGAGCCAGAGCTTCTGTGATTAACCAAATGCAAAATGGCAACGGGACGGGGATTGATTACGGGATTTGCGATGGGACAGATTTCTAG
Protein sequenceShow/hide protein sequence
MWAKFRASKIRNNEKQLIKNNSQKSFNVNEEYLCALRTKSFIEFFTKAQLTVHESPPSTSSSSTDRRRRKLQPDQLDVILSIIESPFLLMLPELKGLLIDYFNVSAEASN
LCNRVLANLKLTRSNSRYIQQSIDSIEKCSSPEKLETIASDLLALRAPFSDLDKRDFALIHDEYAAVSFRLNYTRKKVARKIRSIKMIDRFTCGLVAITARTLTALVMAT
GPGPAHFGLRLKSFRRKLIQHQMLRNGGLEKVGEQLEAAAKGSYILNREFDTTSRLVIRLDDAVDHGKAMVRLFVERKEDKFAVAVAMDELKKSNLSIRKQVEDVEEHLY
LCVVTIDRARASVINQMQNGNGTGIDYGICDGTDF