; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009944 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009944
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold7:8079094..8080962
RNA-Seq ExpressionSpg009944
SyntenySpg009944
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]8.8e-23870.59Show/hide
Query:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC
        ++ S D ES  STT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VFELGLGYFVLKFSETDYLALEDLPWSIPNLC
Subjt:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC

Query:  IYAFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC
        I+AFPWTPDFKPS+AINSSV+VWIRL ELSIEYYD E+L+RIA+ IGG LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGRIRQ IEYEGF+ LC
Subjt:  IYAFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC

Query:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGI
         KC RVGDL+HDC +L+NPSGS GF+PH D+PHH+ TR  KE  S S+SKQPLIP ESS  SAW  SRF  +E +P LDLK  + P+LP  E  K GT +
Subjt:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGI

Query:  RISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP---CEK
        RISSP V VK +A  K+KEKC   +SVQPLP+LPK +SSTITIKAPEL+ V PSVVED+ K AKT N TMIADH     SQP SPTASIP LQP    E 
Subjt:  RISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP---CEK

Query:  ILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWK
         LKF SD    LT  +EI N+ SKE +   FPTVYTIDPKKITSL I+LSE Q  TT++SNQNQY I++VPT+KGGD+GGV  E  SGSE C+KKML WK
Subjt:  ILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWK

Query:  CHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPM
         H MDNAKL+R+LKDLI+LH+PSIVLIFG KI+G DA +V+QELAFCGSY  +PDGYNGGVWLLLS+QDV+ +VNSYSPQQVSASV FHSETNV  F+P 
Subjt:  CHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPM

Query:  DLDTKTSSGTWGSTFFYTSTNWMTNTMAY
        + DT+TSSG WGSTFFYTSTNWMT ++AY
Subjt:  DLDTKTSSGTWGSTFFYTSTNWMTNTMAY

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]4.4e-23770.11Show/hide
Query:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
        MASS DLESHRSTT ATVCNLSPSQTARITQQFDHSLIAW+ G+DIRPRQLAGRL RHL LT DVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
Subjt:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY

Query:  AFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK
        AF WTPDFKPS+AINSSVDVWIRLHELSIEYYD+E+L++IA TIGGVLVK DP+TK+R+KCKFARICIR+NLCDPLPSMIKLGRI+Q+IEYEG DLLCP 
Subjt:  AFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK

Query:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRI
        CR V DLK +CLN  NPSGSSG D   D+P HH+RTRP+ E  S S+SKQPLIPS SSPASA GSRFQVLEND +LD            ECEK    IRI
Subjt:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRI

Query:  SSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEKILKFHS
        SSP V VK +AA K KE CG    V+ LP LPK+ ST T KAPELE VAP+VVE +FK AKTSNPT+IADH NQP   P +              L F S
Subjt:  SSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEKILKFHS

Query:  DYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDN
           +     KE+ +  SKEI VD  P V+TI+ KKI S ++ LS  Q  T ++ N+N Y +D +PT +  DE G  S+ VSGSESCSKKMLCWK HG DN
Subjt:  DYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDN

Query:  AKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKT
        A L+++LKDLI+LH+PSIVLIFGTKISGA+A+ VV+EL+FCGSYCRKPDGYNGGVWLLLSRQDV+IEV+SYSPQQVSASVYF S TN P F+P ++DT+T
Subjt:  AKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKT

Query:  SSGTWGSTFFYTSTNWMTN
        SSG WGSTFFYTSTNWM++
Subjt:  SSGTWGSTFFYTSTNWMTN

KAG6601052.1 hypothetical protein SDJN03_06285, partial [Cucurbita argyrosperma subsp. sororia]1.5e-21366.07Show/hide
Query:  RSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKP
        R+   ATVC LS SQTARITQQFDHS IAWI GKD+RP ++A  L RHLCLTG V+VFELGLGYFVLKF ETD+LAL+DLPWS+PNLCI+  PWTPDFKP
Subjt:  RSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKP

Query:  SDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHD
        S+ I SSVDVW+RLHELSIEYYDDEVLQ+IA  IGG LVKIDP+TK+R KCKFARIC+RVNLCDPLPSMI+LG+IRQEIEYEGF+LLCP C RV  L+H+
Subjt:  SDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHD

Query:  CLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRISSPRVPVKVEA
        CLNL  PSG SGF+PHR KPHH+  R         + KQPLIPSESS  S  GSRFQV      LDL  N+ P+L  GE  K GT IR SS  V VK +A
Subjt:  CLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRISSPRVPVKVEA

Query:  AGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTS-NPTMIADHKNQPQSQPSSPTASIPSLQPC---EKILKFHSDYFQGLT
          K+KEKCG  VSVQP   LPKESS +TIK             D+ K AKTS NPT+          QP+SPT S+P L PC   E IL FHS   Q  T
Subjt:  AGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTS-NPTMIADHKNQPQSQPSSPTASIPSLQPC---EKILKFHSDYFQGLT

Query:  TMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVRSL
          KEIT+  SKEI+VD  PTVYTIDP KI +L+IALSE  TRTT++SNQ QYAI+ VPT + GD+GGVD    SGSESC KK+LCWK H  DN KL+RSL
Subjt:  TMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVRSL

Query:  KDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKTSSGTWGS
        KDLIKLH+PSIVLIFGTKISGAD D+VVQEL FC SY RKPDGY+GGVWLLLS QDV  +VNS SPQQ+ AS+YF S+TN   FNP  + TK SSG WGS
Subjt:  KDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKTSSGTWGS

Query:  TFFYTSTNWMTNTMAY
         FF+T TNWMT +MAY
Subjt:  TFFYTSTNWMTNTMAY

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-23669.95Show/hide
Query:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
        MASS DLESHRSTT ATVCNLSPSQTARITQQFDHSLIAW+ G+DIRPRQLAGRL RHL LT DVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
Subjt:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY

Query:  AFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK
        AF WTPDFKPS+AINSSVDVWIRL ELSIEYYD+E+L++IA TIGGVLVK DP+TK+R+KCKFARICIR+NLCDPLPSMIKLGRI+Q+IEYEG DLLCP 
Subjt:  AFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK

Query:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRI
        CR V DLK +CLN  NPSGSSG D   D+P HH+RTRP+ E  S S+SKQPLIPS SSPAS  GSRFQVLEND +LD            ECEK    IRI
Subjt:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRI

Query:  SSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEKILKFHS
        SSP V VK +AA K KE CG    V+ LP LPK+ ST T KAPELE VAP+VVE +FK AKTSNPT+IADH NQP   P +              L F S
Subjt:  SSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEKILKFHS

Query:  DYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDN
           +  T  KE+ +  SKEI VD  P V+TI+ KKI S ++ LS  Q  T ++ N+N Y +D +PT +  DE G  S+ VSGSESCSKKMLCWK HG DN
Subjt:  DYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDN

Query:  AKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKT
        A L+++LKDLI+LH+PSIVLIFGTKISGA+A+ VV+EL+FCGSYCRKPDGYNGGVWLLLSRQDV+IEV+SYSPQQVSASVYF S TN P F+P ++DT+T
Subjt:  AKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKT

Query:  SSGTWGSTFFYTSTNWMTN
        SSG WGSTFFYTSTNWM++
Subjt:  SSGTWGSTFFYTSTNWMTN

KGN50455.1 hypothetical protein Csa_000357 [Cucumis sativus]9.4e-23268.98Show/hide
Query:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYA
        S D  S RSTT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VF+LGLGYFVLKFSETDYLALEDLPWSIPNLCI+A
Subjt:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYA

Query:  FPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC
        FPWTPDFKPS+AINSSV+VWIRL ELSIEYYD  +L+RIA+ IG  LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGR+RQ IEYEGF+ LC KC
Subjt:  FPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC

Query:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC
         RVGDL+HDC           +L+NPSGS GF+PH D+PHH+ TR  KE  S SNSKQPLIP ESSP SAW  SRF  +E +P LDLKL D P+LP  E 
Subjt:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC

Query:  EKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP
         K G+G+RISSPRV VK +   K+KEKC   +SVQ LPNLPK+ STITIKAPEL+ V PSVVEDR K  KT N TMIADH     SQP SPTASIP LQP
Subjt:  EKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP

Query:  ---CEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSK
            E  LKF SD    LT  +EI N+ SK I+   FPTVYTIDPKKITSL IALSE QT            I++VPT+KGGDEGGV SE  SGSE C+K
Subjt:  ---CEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSK

Query:  KMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNV
        K+L WK H MDNAKL+R+LKDLI+LH+PSIVLIFG KISG D D+V++ELAFCGSY  KPDGYNGGVWLLLS+QDV+ +VNS+S QQVSASV FHSETNV
Subjt:  KMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNV

Query:  PVFNPMDLDTKTSSGTWGSTFFYTSTNWMTNTMAY
          F+P + DTKTSSG WGSTFFYTSTNWMT ++AY
Subjt:  PVFNPMDLDTKTSSGTWGSTFFYTSTNWMTNTMAY

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein4.6e-23268.98Show/hide
Query:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYA
        S D  S RSTT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VF+LGLGYFVLKFSETDYLALEDLPWSIPNLCI+A
Subjt:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYA

Query:  FPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC
        FPWTPDFKPS+AINSSV+VWIRL ELSIEYYD  +L+RIA+ IG  LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGR+RQ IEYEGF+ LC KC
Subjt:  FPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC

Query:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC
         RVGDL+HDC           +L+NPSGS GF+PH D+PHH+ TR  KE  S SNSKQPLIP ESSP SAW  SRF  +E +P LDLKL D P+LP  E 
Subjt:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC

Query:  EKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP
         K G+G+RISSPRV VK +   K+KEKC   +SVQ LPNLPK+ STITIKAPEL+ V PSVVEDR K  KT N TMIADH     SQP SPTASIP LQP
Subjt:  EKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP

Query:  ---CEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSK
            E  LKF SD    LT  +EI N+ SK I+   FPTVYTIDPKKITSL IALSE QT            I++VPT+KGGDEGGV SE  SGSE C+K
Subjt:  ---CEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSK

Query:  KMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNV
        K+L WK H MDNAKL+R+LKDLI+LH+PSIVLIFG KISG D D+V++ELAFCGSY  KPDGYNGGVWLLLS+QDV+ +VNS+S QQVSASV FHSETNV
Subjt:  KMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNV

Query:  PVFNPMDLDTKTSSGTWGSTFFYTSTNWMTNTMAY
          F+P + DTKTSSG WGSTFFYTSTNWMT ++AY
Subjt:  PVFNPMDLDTKTSSGTWGSTFFYTSTNWMTNTMAY

A0A5A7SSJ3 DUF4283 domain-containing protein4.3e-23870.59Show/hide
Query:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC
        ++ S D ES  STT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VFELGLGYFVLKFSETDYLALEDLPWSIPNLC
Subjt:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC

Query:  IYAFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC
        I+AFPWTPDFKPS+AINSSV+VWIRL ELSIEYYD E+L+RIA+ IGG LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGRIRQ IEYEGF+ LC
Subjt:  IYAFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC

Query:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGI
         KC RVGDL+HDC +L+NPSGS GF+PH D+PHH+ TR  KE  S S+SKQPLIP ESS  SAW  SRF  +E +P LDLK  + P+LP  E  K GT +
Subjt:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGI

Query:  RISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP---CEK
        RISSP V VK +A  K+KEKC   +SVQPLP+LPK +SSTITIKAPEL+ V PSVVED+ K AKT N TMIADH     SQP SPTASIP LQP    E 
Subjt:  RISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQP---CEK

Query:  ILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWK
         LKF SD    LT  +EI N+ SKE +   FPTVYTIDPKKITSL I+LSE Q  TT++SNQNQY I++VPT+KGGD+GGV  E  SGSE C+KKML WK
Subjt:  ILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWK

Query:  CHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPM
         H MDNAKL+R+LKDLI+LH+PSIVLIFG KI+G DA +V+QELAFCGSY  +PDGYNGGVWLLLS+QDV+ +VNSYSPQQVSASV FHSETNV  F+P 
Subjt:  CHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPM

Query:  DLDTKTSSGTWGSTFFYTSTNWMTNTMAY
        + DT+TSSG WGSTFFYTSTNWMT ++AY
Subjt:  DLDTKTSSGTWGSTFFYTSTNWMTNTMAY

A0A5A7SUD3 DUF4283 domain-containing protein2.6e-14248.33Show/hide
Query:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
        ++ S D  S  STT +TVC  S SQT  I ++F HSLIAW+VGK+IRP +LA  LHRHL LT   +VFELGLGYFVLKF ETD+LALED PW IPNLCIY
Subjt:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY

Query:  AFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK
        AFPWTP+FKPS+A++S++D WIRL EL IEYY +++L+ I +T+G  LVKIDPITKDRKKCK+ARIC+R+N+ +PLPS I++G+I QEIEYEGFD+LCP+
Subjt:  AFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK

Query:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQV---LENDPMLDLKLNDSPSLPMGECEKVGTGI
        C  V  LKHDCLN S    SS F+P  D P +            SNSKQPL+PSESS   AWGSRF+V       P+ +LK    PS  MG  EK  T  
Subjt:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQV---LENDPMLDLKLNDSPSLPMGECEKVGTGI

Query:  RISSPRVP----VKVEAAGKQKEKCGVSVSVQPLPNLPK--------------ESSTITIKAPELEH--------VAPSVVEDRFKTAKTSNPTMIADHK
          SSP +P    +  E   KQKEKCG S  +   PNLPK              ESS+ TI  P  E         +AP   E+ F  A+TS    +  H 
Subjt:  RISSPRVP----VKVEAAGKQKEKCGVSVSVQPLPNLPK--------------ESSTITIKAPELEH--------VAPSVVEDRFKTAKTSNPTMIADHK

Query:  NQPQSQPSSPTASI---PSLQPCEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKG
        NQPQ   S   ASI   PS    + I  F S         K+IT+ SS+   ++  P +YTI P+ I S E+ LSE      +   QNQ++I  V T + 
Subjt:  NQPQSQPSSPTASI---PSLQPCEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKG

Query:  GDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVR---I
        G      ++A+S  +  SKKML W   GMDN  L+  L  +++ ++PSIV+IFGT+I+    ++VV +LAF GSY +K D Y+GGVWL + R+DV+    
Subjt:  GDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVR---I

Query:  EVNSYSPQQVSASVYFHSETNVPVFNPMDLD-TKTSSGTWGSTFFYTSTNWMTNTMAY
        EVNSYS QQVSAS YF  E N P+      D  +TS  TWG   F  ST    N +AY
Subjt:  EVNSYSPQQVSASVYFHSETNVPVFNPMDLD-TKTSSGTWGSTFFYTSTNWMTNTMAY

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.4e-15655.92Show/hide
Query:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFK
        ST  ATVCNL+PSQTARI QQFD SLI W+VGK I PRQLA RL R+L L GD++VFELGLG+FVLKFS     Y ALE+ PWSIP+LCIY FPW P+FK
Subjt:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFK

Query:  PSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH
        PS+A    VDVWIRL ELSIEYYD EVL++IAETIGG LVKIDP+T  R+KC +ARICIR+NL  PL    + G+  Q+I YEG DLLC  C  V DLKH
Subjt:  PSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH

Query:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK----------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPM
        DC  LSN S SSGFD     PHH+  RP++                       S S SN K  LIPS+ +PASA GSRFQVLE      L LN+ PSLP+
Subjt:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK----------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPM

Query:  GECEKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-PTASI
         E +K                      + K   S+++ P   L K+++ I     +   +AP V+ED +F+T KTS+PT +A   N+P  QPSS    SI
Subjt:  GECEKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-PTASI

Query:  PSLQPCEKI---LKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGS
          LQP   +   LKF+S   Q  T  K I NT S+ I VD  PT+YTIDP  ITSL I L E  + TT  SNQN++AI IVPT          SEAVS S
Subjt:  PSLQPCEKI---LKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGS

Query:  ES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYF
         S CSKKMLCW     DNAKL+R+LKDLI+LH+PSIVLIFGTKISGADAD VV+ELAF GSYCRKPDGY GG WLLLS+QDV+IEV+SYSPQQVSASV  
Subjt:  ES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASVYF

Query:  HSETNVPV
        HS+ N  V
Subjt:  HSETNVPV

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X12.4e-15655.74Show/hide
Query:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFK
        ST  ATVCNL+PSQTARI QQFD SLI W+VGK I PRQLA RL R+L L GD++VFELGLG+FVLKFS     Y ALE+ PWSIP+LCIY FPW P+FK
Subjt:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYAFPWTPDFK

Query:  PSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH
        PS+A    VDVWIRL ELSIEYYD EVL++IAETIGG LVKIDP+T  R+KC +ARICIR+NL  PL    + G+  Q+I YEG DLLC  C  V DLKH
Subjt:  PSDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH

Query:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK------------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSL
        DC  LSN S SSGFD     PHH+  RP++                         S S SN K  LIPS+ +PASA GSRFQVLE      L LN+ PSL
Subjt:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK------------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSL

Query:  PMGECEKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-PTA
        P+ E +K                      + K   S+++ P   L K+++ I     +   +AP V+ED +F+T KTS+PT +A   N+P  QPSS    
Subjt:  PMGECEKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-PTA

Query:  SIPSLQPCEKI---LKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVS
        SI  LQP   +   LKF+S   Q  T  K I NT S+ I VD  PT+YTIDP  ITSL I L E  + TT  SNQN++AI IVPT          SEAVS
Subjt:  SIPSLQPCEKI---LKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEIALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVS

Query:  GSES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASV
         S S CSKKMLCW     DNAKL+R+LKDLI+LH+PSIVLIFGTKISGADAD VV+ELAF GSYCRKPDGY GG WLLLS+QDV+IEV+SYSPQQVSASV
Subjt:  GSES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNGGVWLLLSRQDVRIEVNSYSPQQVSASV

Query:  YFHSETNVPV
          HS+ N  V
Subjt:  YFHSETNVPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.5e-1730.51Show/hide
Query:  LIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIYAFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDD
        +I  ++G  I    L  +L      +G + V +L   +F+++F  E +Y+ AL   PW +    +    W+  F P      +  VW+RL  +   YY  
Subjt:  LIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIYAFPWTPDFKPSDAINSSVDVWIRLHELSIEYYDD

Query:  EVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHDC
         +L  IA  +G  L K+D  T +  K +FAR+CI VNL  PL   + +   R  + YEG   +C  C   G L H C
Subjt:  EVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATTGGACTTGGAATCTCATCGCTCAACCACCGCCGCCACCGTCTGTAACCTCTCTCCCTCTCAAACTGCTCGGATCACTCAACAGTTCGATCACTCTCT
CATAGCTTGGATCGTCGGAAAAGACATCCGTCCCCGGCAACTCGCCGGCCGCCTTCACCGTCATCTTTGTCTCACCGGAGATGTGGAGGTCTTCGAGCTTGGTCTCGGCT
ACTTCGTGCTCAAATTCTCTGAGACTGACTATTTAGCACTGGAAGACCTTCCCTGGTCCATCCCCAATCTCTGCATCTACGCCTTTCCATGGACTCCCGATTTCAAACCC
TCCGACGCCATCAATTCCTCTGTTGATGTCTGGATCCGCCTCCATGAGCTCTCCATCGAGTATTACGACGACGAAGTTCTGCAGCGAATTGCGGAGACCATTGGCGGCGT
TCTTGTGAAGATCGATCCGATAACGAAAGATCGCAAGAAATGTAAGTTCGCTCGTATCTGTATTAGAGTAAATTTGTGCGATCCCCTTCCATCGATGATCAAACTTGGTA
GAATTCGACAGGAAATTGAGTATGAGGGTTTTGATTTGTTGTGCCCTAAATGTAGACGTGTTGGTGATCTGAAACATGATTGTTTGAATTTGAGTAATCCTTCTGGTTCT
TCTGGATTCGATCCCCATAGAGATAAACCCCACCACAATAGAACTCGTCCTGTTAAGGAATCTGAATCGATTTCAAACTCGAAGCAGCCATTGATTCCTTCTGAATCTTC
ACCAGCATCAGCTTGGGGTTCTAGATTCCAAGTTCTTGAAAATGACCCAATGCTTGATTTGAAATTGAATGACTCGCCAAGCCTTCCAATGGGTGAATGTGAAAAAGTAG
GTACAGGTATAAGAATAAGTTCTCCACGTGTTCCTGTGAAGGTTGAAGCAGCTGGAAAGCAAAAGGAGAAATGTGGAGTCTCTGTCTCTGTTCAACCATTGCCCAACTTG
CCAAAAGAATCTTCAACAATAACCATTAAAGCTCCTGAGTTAGAACATGTAGCTCCTTCTGTTGTTGAGGATCGGTTCAAGACTGCAAAAACAAGCAACCCCACCATGAT
TGCAGACCATAAGAACCAACCACAATCACAACCATCATCACCCACAGCAAGCATTCCATCCCTACAACCATGTGAAAAGATCCTCAAGTTCCACTCGGATTATTTCCAGG
GCTTGACAACAATGAAAGAGATAACCAACACATCTTCTAAAGAGATCCATGTCGATTGTTTCCCCACTGTTTACACCATCGACCCAAAGAAGATCACAAGCCTTGAAATT
GCTTTGTCAGAAGCACAAACAAGAACAACCGCATTATCGAACCAAAATCAATATGCTATAGACATTGTTCCGACTTTGAAAGGTGGAGACGAAGGTGGTGTTGATTCGGA
GGCGGTATCAGGATCAGAATCATGTTCTAAGAAGATGCTGTGCTGGAAGTGTCATGGGATGGACAATGCCAAGCTTGTGCGATCATTGAAAGATCTGATTAAGCTGCACC
AGCCATCCATTGTGCTGATCTTTGGCACCAAGATCAGTGGTGCTGATGCGGATCAGGTCGTGCAAGAGCTCGCTTTCTGCGGTTCATACTGCAGAAAGCCCGATGGCTAC
AATGGTGGTGTTTGGCTGTTATTGTCCCGGCAAGATGTGCGAATTGAAGTCAACTCATACAGCCCACAACAAGTTTCTGCATCAGTATATTTTCATTCTGAAACCAATGT
ACCAGTGTTCAATCCTATGGATTTAGATACCAAAACATCATCGGGAACGTGGGGATCGACTTTCTTCTATACTTCGACGAACTGGATGACCAACACAATGGCATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCATTGGACTTGGAATCTCATCGCTCAACCACCGCCGCCACCGTCTGTAACCTCTCTCCCTCTCAAACTGCTCGGATCACTCAACAGTTCGATCACTCTCT
CATAGCTTGGATCGTCGGAAAAGACATCCGTCCCCGGCAACTCGCCGGCCGCCTTCACCGTCATCTTTGTCTCACCGGAGATGTGGAGGTCTTCGAGCTTGGTCTCGGCT
ACTTCGTGCTCAAATTCTCTGAGACTGACTATTTAGCACTGGAAGACCTTCCCTGGTCCATCCCCAATCTCTGCATCTACGCCTTTCCATGGACTCCCGATTTCAAACCC
TCCGACGCCATCAATTCCTCTGTTGATGTCTGGATCCGCCTCCATGAGCTCTCCATCGAGTATTACGACGACGAAGTTCTGCAGCGAATTGCGGAGACCATTGGCGGCGT
TCTTGTGAAGATCGATCCGATAACGAAAGATCGCAAGAAATGTAAGTTCGCTCGTATCTGTATTAGAGTAAATTTGTGCGATCCCCTTCCATCGATGATCAAACTTGGTA
GAATTCGACAGGAAATTGAGTATGAGGGTTTTGATTTGTTGTGCCCTAAATGTAGACGTGTTGGTGATCTGAAACATGATTGTTTGAATTTGAGTAATCCTTCTGGTTCT
TCTGGATTCGATCCCCATAGAGATAAACCCCACCACAATAGAACTCGTCCTGTTAAGGAATCTGAATCGATTTCAAACTCGAAGCAGCCATTGATTCCTTCTGAATCTTC
ACCAGCATCAGCTTGGGGTTCTAGATTCCAAGTTCTTGAAAATGACCCAATGCTTGATTTGAAATTGAATGACTCGCCAAGCCTTCCAATGGGTGAATGTGAAAAAGTAG
GTACAGGTATAAGAATAAGTTCTCCACGTGTTCCTGTGAAGGTTGAAGCAGCTGGAAAGCAAAAGGAGAAATGTGGAGTCTCTGTCTCTGTTCAACCATTGCCCAACTTG
CCAAAAGAATCTTCAACAATAACCATTAAAGCTCCTGAGTTAGAACATGTAGCTCCTTCTGTTGTTGAGGATCGGTTCAAGACTGCAAAAACAAGCAACCCCACCATGAT
TGCAGACCATAAGAACCAACCACAATCACAACCATCATCACCCACAGCAAGCATTCCATCCCTACAACCATGTGAAAAGATCCTCAAGTTCCACTCGGATTATTTCCAGG
GCTTGACAACAATGAAAGAGATAACCAACACATCTTCTAAAGAGATCCATGTCGATTGTTTCCCCACTGTTTACACCATCGACCCAAAGAAGATCACAAGCCTTGAAATT
GCTTTGTCAGAAGCACAAACAAGAACAACCGCATTATCGAACCAAAATCAATATGCTATAGACATTGTTCCGACTTTGAAAGGTGGAGACGAAGGTGGTGTTGATTCGGA
GGCGGTATCAGGATCAGAATCATGTTCTAAGAAGATGCTGTGCTGGAAGTGTCATGGGATGGACAATGCCAAGCTTGTGCGATCATTGAAAGATCTGATTAAGCTGCACC
AGCCATCCATTGTGCTGATCTTTGGCACCAAGATCAGTGGTGCTGATGCGGATCAGGTCGTGCAAGAGCTCGCTTTCTGCGGTTCATACTGCAGAAAGCCCGATGGCTAC
AATGGTGGTGTTTGGCTGTTATTGTCCCGGCAAGATGTGCGAATTGAAGTCAACTCATACAGCCCACAACAAGTTTCTGCATCAGTATATTTTCATTCTGAAACCAATGT
ACCAGTGTTCAATCCTATGGATTTAGATACCAAAACATCATCGGGAACGTGGGGATCGACTTTCTTCTATACTTCGACGAACTGGATGACCAACACAATGGCATACTGA
Protein sequenceShow/hide protein sequence
MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYAFPWTPDFKP
SDAINSSVDVWIRLHELSIEYYDDEVLQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHDCLNLSNPSGS
SGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGTGIRISSPRVPVKVEAAGKQKEKCGVSVSVQPLPNL
PKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEKILKFHSDYFQGLTTMKEITNTSSKEIHVDCFPTVYTIDPKKITSLEI
ALSEAQTRTTALSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGY
NGGVWLLLSRQDVRIEVNSYSPQQVSASVYFHSETNVPVFNPMDLDTKTSSGTWGSTFFYTSTNWMTNTMAY