; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009023 (gene) of Snake gourd v1 genome

Gene IDTan0009023
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4283 domain-containing protein
Genome locationLG04:3418557..3420760
RNA-Seq ExpressionTan0009023
SyntenyTan0009023
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]9.4e-23867.53Show/hide
Query:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK
        QSDH PTG+ DDE AARN  SRKK K+ PP++ SSD +S     TT+ T+  CNL+PS+T RITQQF HSLIA + GKD RP QLA RL  HL LT+DVK
Subjt:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK

Query:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI
        VFELGLGYFVLKFSETDYLALEDLPWSIPNLCI+AFPWTPDFKPSE INSSV+VWIRL ELSIEYYD E+L++IA+ IGG LVKIDPVT+DR KCKFAR 
Subjt:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI

Query:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRF
        CI VNLCDPLPSMI+LGRI+Q IEYEGF+ LC KC RVGDL+HDC +L+ PSGS G + H D PH   TR  KE GS+SSSKQ LIP ESS  SAW S  
Subjt:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRF

Query:  RVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPK-ESSTITIKAPKLKLVAPSVVEDKFKAAKTSNPT
        R +E +P LDLK  + P+L  SE  KAGTS+RI+SP+V VK +A+ K+K KC I  SVQPL +LPK +SSTITIKAP+LK V PSVVED+ K AKT N T
Subjt:  RVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPK-ESSTITIKAPKLKLVAPSVVEDKFKAAKTSNPT

Query:  LIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDI
        +I DHN+QP    PSP  SIP   LQP PASEATL+F+S AI   TR++E+ N+PS+E      PTV+TID +K  S  ++LS++QT  +S QNQY +++
Subjt:  LIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDI

Query:  VPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQD
        VPT +GGD GGVG EV SGSE C+KKML WKFH MDNAKL RALKDLIQLHEPSIVLIFG KI+G DA +V++EL+FCGSY  +PD YNGGVWLLLS+QD
Subjt:  VPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQD

Query:  VQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        VQ +VNSYS  QVSASV F S+TN+  FSP N DTE SSGPWG  FFY STNWMTS+AY
Subjt:  VQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]1.3e-26673.49Show/hide
Query:  MPFQSNCSQSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHL
        M   SNCSQS+ QPTG+ DDE AAR   SRKKAK P MASSSDL+SH    TT  T+CNLSPSQTARITQQFDHSLIAW+FG+DIRPRQLAGRL RHLHL
Subjt:  MPFQSNCSQSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHL

Query:  TEDVKVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKC
        T+DV+VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAF WTPDFKPSE INSSVDVWIRL ELSIEYYDEE+L+QIA TIGGVLVK DPVTK+RRKC
Subjt:  TEDVKVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKC

Query:  KFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRP-HPHGTRPIKEIGSSSSSKQALIPSESSPAS
        KFARICIR+NLCDPLPSMIKLGRIQQ+IEYEG DLLC  CR V DLK +CLN   PSGSSGLD+  DRP H H TRP+ E+GSSSSSKQ LIPS SSPAS
Subjt:  KFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRP-HPHGTRPIKEIGSSSSSKQALIPSESSPAS

Query:  AWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAK
        A GSRF+VLEND +LD            ECEKA  SIRI+SP+V VK +A AK K  CG    V+ L  LPK+ ST T KAP+L+LVAP+VVE +FK AK
Subjt:  AWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAK

Query:  TSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQ
        TSNPTLI DHNN                  QPC   +ATL+FISA I+RS +EKEMP+ PS+EIIVDGCP VHTI+T+K  SFK+ LS LQTN I  +N 
Subjt:  TSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQ

Query:  YAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLL
        Y MD +PTAR  D+ G GS+ VSGSESCSKKMLCWKFHG DNA L +ALKDLIQLHEPSIVLIFGTKISGA+A+ VVRELSFCGSYCRKPD YNGGVWLL
Subjt:  YAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLL

Query:  LSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        LSRQDVQIEV+SYS  QVSASVYFGS TN  AFSP NVDTE SSGPWG  FFY STNWM+SVAY
Subjt:  LSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-26773.64Show/hide
Query:  MPFQSNCSQSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHL
        M   SNCSQS+ QPTG+ DDE AAR   SRKKAK P MASSSDL+SH    TT  T+CNLSPSQTARITQQFDHSLIAW+FG+DIRPRQLAGRL RHLHL
Subjt:  MPFQSNCSQSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHL

Query:  TEDVKVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKC
        T+DV+VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAF WTPDFKPSE INSSVDVWIRLRELSIEYYDEE+L+QIA TIGGVLVK DPVTK+RRKC
Subjt:  TEDVKVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKC

Query:  KFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRP-HPHGTRPIKEIGSSSSSKQALIPSESSPAS
        KFARICIR+NLCDPLPSMIKLGRIQQ+IEYEG DLLC  CR V DLK +CLN   PSGSSGLD+  DRP H H TRP+ E+GSSSSSKQ LIPS SSPAS
Subjt:  KFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRP-HPHGTRPIKEIGSSSSSKQALIPSESSPAS

Query:  AWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAK
          GSRF+VLEND +LD            ECEKA  SIRI+SP+V VK +A AK K  CG    V+ L  LPK+ ST T KAP+L+LVAP+VVE +FK AK
Subjt:  AWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAK

Query:  TSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQ
        TSNPTLI DHNN                  QPC   +ATL+FISA I+RST+EKEMP+ PS+EIIVDGCP VHTI+T+K  SFK+ LS LQTN I  +N 
Subjt:  TSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQ

Query:  YAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLL
        Y MD +PTAR  D+ G GS+ VSGSESCSKKMLCWKFHG DNA L +ALKDLIQLHEPSIVLIFGTKISGA+A+ VVRELSFCGSYCRKPD YNGGVWLL
Subjt:  YAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLL

Query:  LSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        LSRQDVQIEV+SYS  QVSASVYFGS TN  AFSP NVDTE SSGPWG  FFY STNWM+SVAY
Subjt:  LSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

KAG7031864.1 hypothetical protein SDJN02_05905, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-20762.2Show/hide
Query:  QSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFE
        QSD+Q TG+ DDE AA    +RKKA                      T+C LS SQTARITQQFDHS IAWIFGKD+RP ++A  L RHL LT  VKVFE
Subjt:  QSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFE

Query:  LGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIR
        LGLGYFVLKF ETD+LAL+DLPWS+PNLCI+  PWTPDFKPSETI SSVDVW+RL ELSIEYYD+EVLQ+IA  IGG LVKIDPVTK+R KCKFARIC+R
Subjt:  LGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIR

Query:  VNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRFRVL
        VNLCDPLPSMI+LG+I+QEIEYEGF+LLC  C RV  L+H+CLNL  PSG SG + HR +PH HG R         S KQ LIPSESS  S  GSRF+V 
Subjt:  VNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRFRVL

Query:  ENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAKTS-NPTLIP
             LDL  N+ P+L   E  KAGTSIR +S  V VK +A+AK+K KCG+  SVQP   LPKESS +TIK             D+ KAAKTS NPT+  
Subjt:  ENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAKTS-NPTLIP

Query:  DHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPT
              QPPSP+P  S+P   L PCPASEA L F SAAIQRSTR KE+ + PS+EI VD CPTV+TID  K  +  + LS+ +T  +S Q QYA++ VPT
Subjt:  DHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPT

Query:  ARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQI
         R GD GGV     SGSESC KK+LCWKFH  DN KL R+LKDLI+LHEPSIVLIFGTKISGAD D+VV+EL FC SY RKPD Y+GGVWLLLS QDV+ 
Subjt:  ARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQI

Query:  EVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        +VNS S  Q+ AS+YF SKTN  AF+P  V T+ SSGPWG PFF+  TNWMTS+AY
Subjt:  EVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

KGN50455.1 hypothetical protein Csa_000357 [Cucumis sativus]2.3e-22864.82Show/hide
Query:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK
        QS H PTG+ DDE AARN  SRKK K+ PP+  SSD   H    TT+ T+  CNL+PS+T RITQQF HSLIA + GKD RP QLA RL  HL LT+DVK
Subjt:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK

Query:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI
        VF+LGLGYFVLKFSETDYLALEDLPWSIPNLCI+AFPWTPDFKPSE INSSV+VWIRL ELSIEYYD  +L++IA+ IG  LVKIDPVT+DR KCKFAR 
Subjt:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI

Query:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCL----------NLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSES
        CI VNLCDPLPSMI+LGR++Q IEYEGF+ LC KC RVGDL+HDC           +L+ PSGS G + H D PH   TR  KEIGS+S+SKQ LIP ES
Subjt:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCL----------NLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSES

Query:  SPASAWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKF
        SP SAW S  R +E +P LDLKL D P+L   E  KAG+ +RI+SP V VK + + K+K KC I  SVQ L NLPK+ STITIKAP+LK V PSVVED+ 
Subjt:  SPASAWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKF

Query:  KAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLIS
        K  KT N T+I DHN+QP    PSP  SIP   LQP PASEATL+F+S AI   TR++E+ N+PS+ I     PTV+TID +K  S  + LS++QT    
Subjt:  KAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLIS

Query:  KQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGG
              +++VPT +GGD+GGVGSEV SGSE C+KK+L WKFH MDNAKL RALKDLIQLHEPSIVLIFG KISG D D+V+REL+FCGSY  KPD YNGG
Subjt:  KQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGG

Query:  VWLLLSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        VWLLLS+QDVQ +VNS+S+ QVSASV F S+TN+  FSP N DT+ SSGPWG  FFY STNWMTS+AY
Subjt:  VWLLLSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein1.1e-22864.82Show/hide
Query:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK
        QS H PTG+ DDE AARN  SRKK K+ PP+  SSD   H    TT+ T+  CNL+PS+T RITQQF HSLIA + GKD RP QLA RL  HL LT+DVK
Subjt:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK

Query:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI
        VF+LGLGYFVLKFSETDYLALEDLPWSIPNLCI+AFPWTPDFKPSE INSSV+VWIRL ELSIEYYD  +L++IA+ IG  LVKIDPVT+DR KCKFAR 
Subjt:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI

Query:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCL----------NLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSES
        CI VNLCDPLPSMI+LGR++Q IEYEGF+ LC KC RVGDL+HDC           +L+ PSGS G + H D PH   TR  KEIGS+S+SKQ LIP ES
Subjt:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCL----------NLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSES

Query:  SPASAWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKF
        SP SAW S  R +E +P LDLKL D P+L   E  KAG+ +RI+SP V VK + + K+K KC I  SVQ L NLPK+ STITIKAP+LK V PSVVED+ 
Subjt:  SPASAWGSRFRVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKF

Query:  KAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLIS
        K  KT N T+I DHN+QP    PSP  SIP   LQP PASEATL+F+S AI   TR++E+ N+PS+ I     PTV+TID +K  S  + LS++QT    
Subjt:  KAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLIS

Query:  KQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGG
              +++VPT +GGD+GGVGSEV SGSE C+KK+L WKFH MDNAKL RALKDLIQLHEPSIVLIFG KISG D D+V+REL+FCGSY  KPD YNGG
Subjt:  KQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGG

Query:  VWLLLSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        VWLLLS+QDVQ +VNS+S+ QVSASV F S+TN+  FSP N DT+ SSGPWG  FFY STNWMTS+AY
Subjt:  VWLLLSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

A0A5A7SSJ3 DUF4283 domain-containing protein4.5e-23867.53Show/hide
Query:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK
        QSDH PTG+ DDE AARN  SRKK K+ PP++ SSD +S     TT+ T+  CNL+PS+T RITQQF HSLIA + GKD RP QLA RL  HL LT+DVK
Subjt:  QSDHQPTGSDDDETAARNNPSRKKAKI-PPMASSSDLKSHHPNPTTVPTI--CNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVK

Query:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI
        VFELGLGYFVLKFSETDYLALEDLPWSIPNLCI+AFPWTPDFKPSE INSSV+VWIRL ELSIEYYD E+L++IA+ IGG LVKIDPVT+DR KCKFAR 
Subjt:  VFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARI

Query:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRF
        CI VNLCDPLPSMI+LGRI+Q IEYEGF+ LC KC RVGDL+HDC +L+ PSGS G + H D PH   TR  KE GS+SSSKQ LIP ESS  SAW S  
Subjt:  CIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRF

Query:  RVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPK-ESSTITIKAPKLKLVAPSVVEDKFKAAKTSNPT
        R +E +P LDLK  + P+L  SE  KAGTS+RI+SP+V VK +A+ K+K KC I  SVQPL +LPK +SSTITIKAP+LK V PSVVED+ K AKT N T
Subjt:  RVLENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPK-ESSTITIKAPKLKLVAPSVVEDKFKAAKTSNPT

Query:  LIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDI
        +I DHN+QP    PSP  SIP   LQP PASEATL+F+S AI   TR++E+ N+PS+E      PTV+TID +K  S  ++LS++QT  +S QNQY +++
Subjt:  LIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDI

Query:  VPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQD
        VPT +GGD GGVG EV SGSE C+KKML WKFH MDNAKL RALKDLIQLHEPSIVLIFG KI+G DA +V++EL+FCGSY  +PD YNGGVWLLLS+QD
Subjt:  VPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQD

Query:  VQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY
        VQ +VNSYS  QVSASV F S+TN+  FSP N DTE SSGPWG  FFY STNWMTS+AY
Subjt:  VQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTSVAY

A0A5A7SUD3 DUF4283 domain-containing protein1.0e-14147.21Show/hide
Query:  SDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFEL
        S HQPTG+  DE A+RN+ SRKK+K  P++ S D  S H   TT  T+C  S SQT  I ++F HSLIAW+ GK+IRP +LA  LHRHL LTE   VFEL
Subjt:  SDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFEL

Query:  GLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRV
        GLGYFVLKF ETD+LALED PW IPNLCIYAFPWTP+FKPSE ++S++D WIRL+EL IEYY E++L+ I +T+G  LVKIDP+TKDR+KCK+ARIC+R+
Subjt:  GLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRV

Query:  NLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRFRV--
        N+ +PLPS I++G+I QEIEYEGFD+LC +C  V  LKHDCLN      SSG  S  +  HP          + S+SKQ L+PSESS   AWGSRF V  
Subjt:  NLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRFRV--

Query:  -LENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVP----VKTEAVAKQKGKCGISASVQPLANLPK--------------ESSTITIKAPKLK----
             P+ +LK    PS+  SE     TS   +SP +P    +  E + KQK KCG S  + P  NLPK              ESS+ TI  P  +    
Subjt:  -LENDPILDLKLNDRPSLAISECEKAGTSIRINSPNVP----VKTEAVAKQKGKCGISASVQPLANLPK--------------ESSTITIKAPKLK----

Query:  ----LVAPSVVEDKFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQP-CPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKT
            ++AP   E+ F  A+TS    +  HNNQPQ PS SP+ +  S+  QP  P+S+    F S+ I RS  +K++ +  S+   ++  P ++TI  E  
Subjt:  ----LVAPSVVEDKFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQP-CPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKT

Query:  GSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVREL
         SF+V LS+   +   KQNQ+++  V T R       G++ +S  +  SKKML W F GMDN  L   L  ++Q +EPSIV+IFGT+I+    ++VV +L
Subjt:  GSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVREL

Query:  SFCGSYCRKPDDYNGGVWLLLSRQDVQ---IEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSS-GPWGPPFFYNST
        +F GSY +K D+Y+GGVWL + R+DVQ    EVNSYS  QVSAS YF  + N         D  ++S   WG   F +ST
Subjt:  SFCGSYCRKPDDYNGGVWLLLSRQDVQ---IEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSS-GPWGPPFFYNST

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X26.1e-15054.33Show/hide
Query:  TICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFKPSETI
        T+CNL+PSQTARI QQFD SLI W+ GK I PRQLA RL R+LHL  D+ VFELGLG+FVLKFS     Y ALE+ PWSIP+LCIY FPW P+FKPSE  
Subjt:  TICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFKPSETI

Query:  NSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNL
           VDVWIRL ELSIEYYD+EVL++IAETIGG LVKIDPVT  R KC +ARICIR+NL  PL    + G+  Q+I YEG DLLC+ C  V DLKHDC  L
Subjt:  NSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNL

Query:  SKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSS----------------------KQALIPSESSPASAWGSRFRVLENDPILDLKLNDRPSLAISECEK
        S  S SSG D     PH H  RP++  GSS SS                      K  LIPS+ +PASA GSRF+VLE      L LN+ PSL +SE +K
Subjt:  SKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSS----------------------KQALIPSESSPASAWGSRFRVLENDPILDLKLNDRPSLAISECEK

Query:  AGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVED-KFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSL-QL
                SP++ +                        P    T  I++  L   AP V+ED +F+  KTS+PT +   NN+PQP S    ++I S+  L
Subjt:  AGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVED-KFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSL-QL

Query:  QPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSES-CS
        QP  A EA L+F S AIQ+ST +K + NTPS  I VD  PT++TID   T      L    T   S QN++A+ IVPT          SE VS S S CS
Subjt:  QPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSES-CS

Query:  KKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQIEVNSYSANQVSASVYFGSKTN
        KKMLCW F   DNAKL RALKDLIQLH+PSIVLIFGTKISGADAD VVREL+F GSYCRKPD Y GG WLLLS+QDVQIEV+SYS  QVSASV   SK N
Subjt:  KKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQIEVNSYSANQVSASVYFGSKTN

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X11.0e-14954.15Show/hide
Query:  TICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFKPSETI
        T+CNL+PSQTARI QQFD SLI W+ GK I PRQLA RL R+LHL  D+ VFELGLG+FVLKFS     Y ALE+ PWSIP+LCIY FPW P+FKPSE  
Subjt:  TICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFKPSETI

Query:  NSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNL
           VDVWIRL ELSIEYYD+EVL++IAETIGG LVKIDPVT  R KC +ARICIR+NL  PL    + G+  Q+I YEG DLLC+ C  V DLKHDC  L
Subjt:  NSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNL

Query:  SKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSS------------------------KQALIPSESSPASAWGSRFRVLENDPILDLKLNDRPSLAISEC
        S  S SSG D     PH H  RP++  GSS SS                        K  LIPS+ +PASA GSRF+VLE      L LN+ PSL +SE 
Subjt:  SKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSS------------------------KQALIPSESSPASAWGSRFRVLENDPILDLKLNDRPSLAISEC

Query:  EKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVED-KFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSL-
        +K        SP++ +                        P    T  I++  L   AP V+ED +F+  KTS+PT +   NN+PQP S    ++I S+ 
Subjt:  EKAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVED-KFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSL-

Query:  QLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSES-
         LQP  A EA L+F S AIQ+ST +K + NTPS  I VD  PT++TID   T      L    T   S QN++A+ IVPT          SE VS S S 
Subjt:  QLQPCPASEATLEFISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSES-

Query:  CSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQIEVNSYSANQVSASVYFGSK
        CSKKMLCW F   DNAKL RALKDLIQLH+PSIVLIFGTKISGADAD VVREL+F GSYCRKPD Y GG WLLLS+QDVQIEV+SYS  QVSASV   SK
Subjt:  CSKKMLCWKFHGMDNAKLARALKDLIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQIEVNSYSANQVSASVYFGSK

Query:  TN
         N
Subjt:  TN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding7.9e-1730.1Show/hide
Query:  LIAWIFGKDIRPRQLAGRLHRHLHLTEDV-KVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYD
        +I  + G  I P  +  R  R L     V  V +L   +F+++F  E +Y+ AL   PW +    +    W+  F P      +  VW+RL  +   YY 
Subjt:  LIAWIFGKDIRPRQLAGRLHRHLHLTEDV-KVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYD

Query:  EEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDR
          +L +IA  +G  L K+D  T +  K +FAR+CI VNL  PL   + +   +  + YEG   +C  C   G L H C        S+G ++  DR
Subjt:  EEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDR

AT2G41590.1 unknown protein3.1e-0523.73Show/hide
Query:  PWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQE--
        PW   N  + A  W  +  P+    +++D+W+++R + + Y  EE + +IA+ +G VL+ +D       +  + R+ +R  + D L    ++     E  
Subjt:  PWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIKLGRIQQE--

Query:  ---IEYEGFDLLCLKCRR
            +YE    +C  C R
Subjt:  ---IEYEGFDLLCLKCRR

AT5G36228.1 nucleic acid binding;zinc ion binding1.1e-0729.17Show/hide
Query:  FVLKF-SETDYL-ALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNL
        F ++F SE D L  L   PW      I    W  DF P+E   + +DVW+ +R + + Y  E  ++ IA T+G V V +D   +   +  F R+ +R++ 
Subjt:  FVLKF-SETDYL-ALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNL

Query:  CDPLPSMIKLGRIQQE-----IEYEGFDLLCLKCRRVGDLKHDC
         +PL    ++    +E      EYE    +C  C RV      C
Subjt:  CDPLPSMIKLGRIQQE-----IEYEGFDLLCLKCRRVGDLKHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTCAATCCAACTGTTCACAATCCGACCACCAACCCACCGGCTCCGACGACGACGAAACCGCCGCCAGAAACAACCCGAGCAGAAAGAAGGCCAAGATACCACC
AATGGCTTCCTCATCCGACTTGAAATCTCATCATCCAAATCCAACTACCGTCCCCACCATCTGCAACCTCTCTCCCTCTCAAACAGCTCGGATCACTCAACAGTTCGATC
ACTCCCTCATTGCTTGGATCTTCGGAAAAGACATCCGTCCACGGCAACTCGCCGGTCGCCTTCACCGTCATCTTCATCTCACCGAAGATGTGAAGGTCTTCGAGCTTGGT
CTCGGCTACTTCGTGCTCAAATTCTCTGAGACTGACTATTTAGCCCTAGAAGACCTTCCTTGGTCAATCCCTAACCTCTGCATCTACGCCTTTCCATGGACTCCTGATTT
CAAACCCTCCGAGACCATCAATTCCTCTGTTGATGTTTGGATCCGCCTCCGTGAGCTCTCCATCGAGTATTACGACGAGGAAGTTCTGCAACAAATTGCGGAGACCATCG
GCGGCGTTCTTGTGAAGATCGATCCAGTAACGAAAGATCGCAGGAAATGTAAGTTCGCTCGCATCTGTATTAGAGTAAATTTGTGCGATCCCCTTCCATCGATGATCAAA
CTTGGTAGAATTCAACAGGAAATTGAGTATGAGGGTTTTGATTTGTTGTGCCTGAAGTGTAGACGTGTTGGTGATCTGAAACATGATTGTTTGAATTTGAGTAAACCTTC
TGGTTCTTCTGGATTAGATTCCCATAGAGATAGACCCCACCCCCATGGAACTCGTCCTATTAAGGAAATTGGCTCGAGTTCAAGCTCGAAGCAGGCGTTGATTCCTTCTG
AATCTTCACCAGCATCAGCTTGGGGATCCAGATTTCGAGTTCTTGAAAATGACCCGATCCTTGATTTGAAATTGAATGATCGCCCAAGCCTTGCCATTAGTGAATGTGAA
AAAGCAGGAACAAGTATCAGAATAAATTCTCCAAATGTTCCTGTGAAGACTGAAGCAGTCGCAAAACAAAAGGGGAAATGTGGAATCTCTGCCTCTGTCCAACCATTGGC
CAACTTGCCAAAAGAATCTTCAACAATTACCATTAAAGCTCCTAAGTTAAAACTTGTAGCCCCTTCTGTTGTTGAAGATAAGTTCAAGGCTGCAAAAACCAGCAACCCCA
CCTTGATTCCAGACCATAATAACCAACCACAACCACCGTCACCCTCACCCATCGTAAGCATTCCATCCCTACAACTACAACCATGTCCTGCTTCAGAGGCCACCCTTGAG
TTCATTTCGGCTGCGATCCAACGCTCGACAAGAGAGAAAGAAATGCCCAACACACCATCTAGGGAGATCATTGTTGATGGTTGCCCCACTGTTCACACGATCGACACAGA
GAAGACCGGAAGCTTTAAAGTGACTTTGTCACAACTGCAAACAAACTTGATATCAAAACAAAATCAATATGCAATGGACATTGTTCCAACTGCGAGAGGTGGAGACGATG
GTGGAGTTGGTTCGGAGGTGGTATCAGGATCCGAATCTTGTTCTAAGAAGATGTTATGCTGGAAGTTTCATGGGATGGACAATGCCAAGCTAGCGCGAGCATTGAAAGAT
CTGATTCAGCTACACGAGCCATCCATTGTGCTCATCTTTGGCACCAAGATCAGTGGTGCTGATGCAGATCAGGTTGTGCGGGAGCTTTCTTTCTGCGGTTCGTACTGCAG
AAAGCCTGATGACTACAATGGTGGTGTTTGGTTGTTATTGTCCCGGCAAGATGTGCAGATTGAAGTCAACTCGTACAGCGCAAATCAGGTCTCTGCATCGGTATATTTTG
GTTCCAAAACCAATATAACAGCGTTTAGTCCTTTGAATGTCGATACCGAAAAATCATCAGGACCATGGGGACCACCTTTCTTCTATAATTCGACGAACTGGATGACTTCA
GTGGCATACTGA
mRNA sequenceShow/hide mRNA sequence
CGAAAGGTCAAAAAATGGAAGAACTGTGTGTTCCTTTTCCTACAAGTTTCAATCTCCTTCATATTCCAACGTCAATCATCAGTGACTGCAAAATCCAATCACCTAACAAT
GCCATTTCAATCCAACTGTTCACAATCCGACCACCAACCCACCGGCTCCGACGACGACGAAACCGCCGCCAGAAACAACCCGAGCAGAAAGAAGGCCAAGATACCACCAA
TGGCTTCCTCATCCGACTTGAAATCTCATCATCCAAATCCAACTACCGTCCCCACCATCTGCAACCTCTCTCCCTCTCAAACAGCTCGGATCACTCAACAGTTCGATCAC
TCCCTCATTGCTTGGATCTTCGGAAAAGACATCCGTCCACGGCAACTCGCCGGTCGCCTTCACCGTCATCTTCATCTCACCGAAGATGTGAAGGTCTTCGAGCTTGGTCT
CGGCTACTTCGTGCTCAAATTCTCTGAGACTGACTATTTAGCCCTAGAAGACCTTCCTTGGTCAATCCCTAACCTCTGCATCTACGCCTTTCCATGGACTCCTGATTTCA
AACCCTCCGAGACCATCAATTCCTCTGTTGATGTTTGGATCCGCCTCCGTGAGCTCTCCATCGAGTATTACGACGAGGAAGTTCTGCAACAAATTGCGGAGACCATCGGC
GGCGTTCTTGTGAAGATCGATCCAGTAACGAAAGATCGCAGGAAATGTAAGTTCGCTCGCATCTGTATTAGAGTAAATTTGTGCGATCCCCTTCCATCGATGATCAAACT
TGGTAGAATTCAACAGGAAATTGAGTATGAGGGTTTTGATTTGTTGTGCCTGAAGTGTAGACGTGTTGGTGATCTGAAACATGATTGTTTGAATTTGAGTAAACCTTCTG
GTTCTTCTGGATTAGATTCCCATAGAGATAGACCCCACCCCCATGGAACTCGTCCTATTAAGGAAATTGGCTCGAGTTCAAGCTCGAAGCAGGCGTTGATTCCTTCTGAA
TCTTCACCAGCATCAGCTTGGGGATCCAGATTTCGAGTTCTTGAAAATGACCCGATCCTTGATTTGAAATTGAATGATCGCCCAAGCCTTGCCATTAGTGAATGTGAAAA
AGCAGGAACAAGTATCAGAATAAATTCTCCAAATGTTCCTGTGAAGACTGAAGCAGTCGCAAAACAAAAGGGGAAATGTGGAATCTCTGCCTCTGTCCAACCATTGGCCA
ACTTGCCAAAAGAATCTTCAACAATTACCATTAAAGCTCCTAAGTTAAAACTTGTAGCCCCTTCTGTTGTTGAAGATAAGTTCAAGGCTGCAAAAACCAGCAACCCCACC
TTGATTCCAGACCATAATAACCAACCACAACCACCGTCACCCTCACCCATCGTAAGCATTCCATCCCTACAACTACAACCATGTCCTGCTTCAGAGGCCACCCTTGAGTT
CATTTCGGCTGCGATCCAACGCTCGACAAGAGAGAAAGAAATGCCCAACACACCATCTAGGGAGATCATTGTTGATGGTTGCCCCACTGTTCACACGATCGACACAGAGA
AGACCGGAAGCTTTAAAGTGACTTTGTCACAACTGCAAACAAACTTGATATCAAAACAAAATCAATATGCAATGGACATTGTTCCAACTGCGAGAGGTGGAGACGATGGT
GGAGTTGGTTCGGAGGTGGTATCAGGATCCGAATCTTGTTCTAAGAAGATGTTATGCTGGAAGTTTCATGGGATGGACAATGCCAAGCTAGCGCGAGCATTGAAAGATCT
GATTCAGCTACACGAGCCATCCATTGTGCTCATCTTTGGCACCAAGATCAGTGGTGCTGATGCAGATCAGGTTGTGCGGGAGCTTTCTTTCTGCGGTTCGTACTGCAGAA
AGCCTGATGACTACAATGGTGGTGTTTGGTTGTTATTGTCCCGGCAAGATGTGCAGATTGAAGTCAACTCGTACAGCGCAAATCAGGTCTCTGCATCGGTATATTTTGGT
TCCAAAACCAATATAACAGCGTTTAGTCCTTTGAATGTCGATACCGAAAAATCATCAGGACCATGGGGACCACCTTTCTTCTATAATTCGACGAACTGGATGACTTCAGT
GGCATACTGATGAAGGTAACTGGGAAAGTAAAAAAAGTATTAGAATAATTCCAAGAATCGTATTGCTTATTGATTGCTTTTGTTTCTTGATTTGCTGAGAATGGATTGAA
AAAC
Protein sequenceShow/hide protein sequence
MPFQSNCSQSDHQPTGSDDDETAARNNPSRKKAKIPPMASSSDLKSHHPNPTTVPTICNLSPSQTARITQQFDHSLIAWIFGKDIRPRQLAGRLHRHLHLTEDVKVFELG
LGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKPSETINSSVDVWIRLRELSIEYYDEEVLQQIAETIGGVLVKIDPVTKDRRKCKFARICIRVNLCDPLPSMIK
LGRIQQEIEYEGFDLLCLKCRRVGDLKHDCLNLSKPSGSSGLDSHRDRPHPHGTRPIKEIGSSSSSKQALIPSESSPASAWGSRFRVLENDPILDLKLNDRPSLAISECE
KAGTSIRINSPNVPVKTEAVAKQKGKCGISASVQPLANLPKESSTITIKAPKLKLVAPSVVEDKFKAAKTSNPTLIPDHNNQPQPPSPSPIVSIPSLQLQPCPASEATLE
FISAAIQRSTREKEMPNTPSREIIVDGCPTVHTIDTEKTGSFKVTLSQLQTNLISKQNQYAMDIVPTARGGDDGGVGSEVVSGSESCSKKMLCWKFHGMDNAKLARALKD
LIQLHEPSIVLIFGTKISGADADQVVRELSFCGSYCRKPDDYNGGVWLLLSRQDVQIEVNSYSANQVSASVYFGSKTNITAFSPLNVDTEKSSGPWGPPFFYNSTNWMTS
VAY