; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040697 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040697
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr13:7425056..7426933
RNA-Seq ExpressionLag0040697
SyntenyLag0040697
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034062.1 uncharacterized protein E6C27_scaffold65G00480 [Cucumis melo var. makuwa]2.8e-23670.21Show/hide
Query:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC
        ++ S D ES  STT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VFELGLGYFVLKFSETDYLALEDLPWSIPNLC
Subjt:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC

Query:  IYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC
        I+ FPWTPDFKPSEAINS V+VWIRL ELSIEYYD EIL+RIA+ IGG LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGRIRQ IEYEGF+ LC
Subjt:  IYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC

Query:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGI
         KC RVGDL+HDC +L+NPSGS GF+PH D+PHH+ TR  KE  S S+SKQPLIP ESS  SAW  SRF  +E +P LDLK  + P+LP  E  K G  +
Subjt:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGI

Query:  RISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQ--PCE
        RISSP V VK +A  K+KEKC     +SVQPLP+LPK +SSTITIKAPEL+ V PSVVED+ K AKT N TMIADH     SQP SPTASIP LQ  P  
Subjt:  RISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQ--PCE

Query:  EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLC
        E  LKF SD    LT  +EI N+P KE N   FPTVYTIDPKKITSL I+LSE Q  TT+ SNQNQY I++VPT+KGGD+GGV  E  SGSE C+KKML 
Subjt:  EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLC

Query:  WKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSN
        WK H MDNAKL+R+LKDLI+LH+PSIVLIFG KI+G DA +V+QELAFCGSY  +PDGYN GVWLLLS+QDVQ +VNSYSP+QV+ASV FHSETNV   +
Subjt:  WKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSN

Query:  PMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY
        P + DT+TSSGPWGSTFFYTSTNWMT +LAY
Subjt:  PMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]2.8e-23669.77Show/hide
Query:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
        MASS DLESHRSTT ATVCNLSPSQTARITQQFDHSLIAW+ G+DIRPRQLAGRL RHL LT DVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
Subjt:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY

Query:  VFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK
         F WTPDFKPSEAINS VDVWIRLHELSIEYYD+EIL++IA TIGGVLVK DP+TK+R+KCKFARICIR+NLCDPLPSMIKLGRI+Q+IEYEG DLLCP 
Subjt:  VFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK

Query:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRI
        CR V DLK +CLN  NPSGSSG D   D+P HH+RTRP+ E  S S+SKQPLIPS SSPASA GSRFQVLEND +LD            ECEK    IRI
Subjt:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRI

Query:  SSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKILK
        SSP V VK +AA K KE CG      V+ LP LPK+ ST T KAPELE VAP+VVE +FK AKTSNPT+IADH NQP   P +               L 
Subjt:  SSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKILK

Query:  FHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHG
        F S + R     KE+ + P KEI VDG P V+TI+ KKI S ++ LS  QT +    N+N Y +D +PT +  DE G  S+ VSGSESCSKKMLCWK HG
Subjt:  FHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHG

Query:  MDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLD
         DNA L+++LKDLI+LH+PSIVLIFGTKISGA+A+ VV+EL+FCGSYCRKPDGYN GVWLLLSRQDVQIEV+SYSP+QV+ASVYF S TN P  +P ++D
Subjt:  MDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLD

Query:  TKTSSGPWGSTFFYTSTNWMTN
        T+TSSGPWGSTFFYTSTNWM++
Subjt:  TKTSSGPWGSTFFYTSTNWMTN

KAG6601052.1 hypothetical protein SDJN03_06285, partial [Cucurbita argyrosperma subsp. sororia]1.4e-21165.21Show/hide
Query:  RSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYVFPWTPDFKP
        R+   ATVC LS SQTARITQQFDHS IAWI GKD+RP ++A  L RHLCLTG V+VFELGLGYFVLKF ETD+LAL+DLPWS+PNLCI+V PWTPDFKP
Subjt:  RSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYVFPWTPDFKP

Query:  SEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHD
        SE I S VDVW+RLHELSIEYYDDE+LQ+IA  IGG LVKIDP+TK+R KCKFARIC+RVNLCDPLPSMI+LG+IRQEIEYEGF+LLCP C RV  L+H+
Subjt:  SEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHD

Query:  CLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRISSPRVPVKVEA
        CLNL  PSG SGF+PHR KPHH+  R         + KQPLIPSESS  S  GSRFQV      LDL  N+ P+L  GE  K G  IR SS  V VK +A
Subjt:  CLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRISSPRVPVKVEA

Query:  AGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTS-NPTMIADHKNQPQSQPSSPTASIPSLQPC--EEKILKFHSDMFRG
          K+KEKCG    VSVQP   LPKESS +TIK             D+ K AKTS NPT+          QP+SPT S+P L PC   E IL FHS   + 
Subjt:  AGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTS-NPTMIADHKNQPQSQPSSPTASIPSLQPC--EEKILKFHSDMFRG

Query:  LTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVR
         T  KEIT+ P KEINVD  PTVYTIDP KI +L+IALSE  TRTT+ SNQ QYAI+ VPT + GD+GGVD    SGSESC KK+LCWK H  DN KL+R
Subjt:  LTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVR

Query:  SLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLDTKTSSGPW
        SLKDLIKLH+PSIVLIFGTKISGAD D+VVQEL FC SY RKPDGY+ GVWLLLS QDV+ +VNS SP+Q+ AS+YF S+TN    NP  + TK SSGPW
Subjt:  SLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLDTKTSSGPW

Query:  GSTFFYTSTNWMTNTLAY
        GS FF+T TNWMT ++AY
Subjt:  GSTFFYTSTNWMTNTLAY

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-23569.61Show/hide
Query:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
        MASS DLESHRSTT ATVCNLSPSQTARITQQFDHSLIAW+ G+DIRPRQLAGRL RHL LT DVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY
Subjt:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIY

Query:  VFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK
         F WTPDFKPSEAINS VDVWIRL ELSIEYYD+EIL++IA TIGGVLVK DP+TK+R+KCKFARICIR+NLCDPLPSMIKLGRI+Q+IEYEG DLLCP 
Subjt:  VFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPK

Query:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRI
        CR V DLK +CLN  NPSGSSG D   D+P HH+RTRP+ E  S S+SKQPLIPS SSPAS  GSRFQVLEND +LD            ECEK    IRI
Subjt:  CRRVGDLKHDCLNLSNPSGSSGFDPHRDKP-HHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRI

Query:  SSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKILK
        SSP V VK +AA K KE CG      V+ LP LPK+ ST T KAPELE VAP+VVE +FK AKTSNPT+IADH NQP   P +               L 
Subjt:  SSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKILK

Query:  FHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHG
        F S + R  T  KE+ + P KEI VDG P V+TI+ KKI S ++ LS  QT +    N+N Y +D +PT +  DE G  S+ VSGSESCSKKMLCWK HG
Subjt:  FHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHG

Query:  MDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLD
         DNA L+++LKDLI+LH+PSIVLIFGTKISGA+A+ VV+EL+FCGSYCRKPDGYN GVWLLLSRQDVQIEV+SYSP+QV+ASVYF S TN P  +P ++D
Subjt:  MDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLD

Query:  TKTSSGPWGSTFFYTSTNWMTN
        T+TSSGPWGSTFFYTSTNWM++
Subjt:  TKTSSGPWGSTFFYTSTNWMTN

KGN50455.1 hypothetical protein Csa_000357 [Cucumis sativus]3.6e-23168.76Show/hide
Query:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYV
        S D  S RSTT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VF+LGLGYFVLKFSETDYLALEDLPWSIPNLCI+ 
Subjt:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYV

Query:  FPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC
        FPWTPDFKPSEAINS V+VWIRL ELSIEYYD  IL+RIA+ IG  LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGR+RQ IEYEGF+ LC KC
Subjt:  FPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC

Query:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC
         RVGDL+HDC           +L+NPSGS GF+PH D+PHH+ TR  KE  S SNSKQPLIP ESSP SAW  SRF  +E +P LDLKL D P+LP  E 
Subjt:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC

Query:  EKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSL
         K G+G+RISSPRV VK +   K+KEKC     +SVQ LPNLPK+ STITIKAPEL+ V PSVVEDR K  KT N TMIADH     SQP SPTASIP L
Subjt:  EKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSL

Query:  Q--PCEEKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESC
        Q  P  E  LKF SD    LT  +EI N+P K IN   FPTVYTIDPKKITSL IALSE QT            I++VPT+KGGDEGGV SE  SGSE C
Subjt:  Q--PCEEKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESC

Query:  SKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSET
        +KK+L WK H MDNAKL+R+LKDLI+LH+PSIVLIFG KISG D D+V++ELAFCGSY  KPDGYN GVWLLLS+QDVQ +VNS+S +QV+ASV FHSET
Subjt:  SKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSET

Query:  NVPVSNPMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY
        NV   +P + DTKTSSGPWGSTFFYTSTNWMT +LAY
Subjt:  NVPVSNPMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein1.7e-23168.76Show/hide
Query:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYV
        S D  S RSTT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VF+LGLGYFVLKFSETDYLALEDLPWSIPNLCI+ 
Subjt:  SLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYV

Query:  FPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC
        FPWTPDFKPSEAINS V+VWIRL ELSIEYYD  IL+RIA+ IG  LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGR+RQ IEYEGF+ LC KC
Subjt:  FPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKC

Query:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC
         RVGDL+HDC           +L+NPSGS GF+PH D+PHH+ TR  KE  S SNSKQPLIP ESSP SAW  SRF  +E +P LDLKL D P+LP  E 
Subjt:  RRVGDLKHDCL----------NLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGEC

Query:  EKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSL
         K G+G+RISSPRV VK +   K+KEKC     +SVQ LPNLPK+ STITIKAPEL+ V PSVVEDR K  KT N TMIADH     SQP SPTASIP L
Subjt:  EKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSL

Query:  Q--PCEEKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESC
        Q  P  E  LKF SD    LT  +EI N+P K IN   FPTVYTIDPKKITSL IALSE QT            I++VPT+KGGDEGGV SE  SGSE C
Subjt:  Q--PCEEKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESC

Query:  SKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSET
        +KK+L WK H MDNAKL+R+LKDLI+LH+PSIVLIFG KISG D D+V++ELAFCGSY  KPDGYN GVWLLLS+QDVQ +VNS+S +QV+ASV FHSET
Subjt:  SKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSET

Query:  NVPVSNPMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY
        NV   +P + DTKTSSGPWGSTFFYTSTNWMT +LAY
Subjt:  NVPVSNPMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY

A0A5A7SSJ3 DUF4283 domain-containing protein1.4e-23670.21Show/hide
Query:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC
        ++ S D ES  STT ATV  CNL+PS+T RITQQF HSLIA +VGKD RP QLA RL  HL LT DV+VFELGLGYFVLKFSETDYLALEDLPWSIPNLC
Subjt:  MASSLDLESHRSTTAATV--CNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLC

Query:  IYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC
        I+ FPWTPDFKPSEAINS V+VWIRL ELSIEYYD EIL+RIA+ IGG LVKIDP+T+DR KCKFAR CI VNLCDPLPSMI+LGRIRQ IEYEGF+ LC
Subjt:  IYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC

Query:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGI
         KC RVGDL+HDC +L+NPSGS GF+PH D+PHH+ TR  KE  S S+SKQPLIP ESS  SAW  SRF  +E +P LDLK  + P+LP  E  K G  +
Subjt:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAW-GSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGI

Query:  RISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQ--PCE
        RISSP V VK +A  K+KEKC     +SVQPLP+LPK +SSTITIKAPEL+ V PSVVED+ K AKT N TMIADH     SQP SPTASIP LQ  P  
Subjt:  RISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPK-ESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQ--PCE

Query:  EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLC
        E  LKF SD    LT  +EI N+P KE N   FPTVYTIDPKKITSL I+LSE Q  TT+ SNQNQY I++VPT+KGGD+GGV  E  SGSE C+KKML 
Subjt:  EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLC

Query:  WKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSN
        WK H MDNAKL+R+LKDLI+LH+PSIVLIFG KI+G DA +V+QELAFCGSY  +PDGYN GVWLLLS+QDVQ +VNSYSP+QV+ASV FHSETNV   +
Subjt:  WKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSN

Query:  PMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY
        P + DT+TSSGPWGSTFFYTSTNWMT +LAY
Subjt:  PMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY

A0A5A7SY10 DUF4283 domain-containing protein8.3e-14148.93Show/hide
Query:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLC
        M+SS +   H  T      +L+PSQTARI Q F HSLIA + G ++  R LA RL R+L LTGD++VFEL LG+FVLKFS  +DY  ALE+LPWSI +LC
Subjt:  MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLC

Query:  IYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC
        I+V PW P+FKPSEA+   VDVWIRL EL IEYYD EIL++IAE IG  LVKIDP+T+ R+KC FARICIR+ LC+PL   I+ G+  Q+++YEG D LC
Subjt:  IYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLC

Query:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESIS-NSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGI
          C  + +LKH CLN +NPSGSSG DPH+  P   +      S  ++ +SK+PLI S  S  SA GS+ Q  E +P L+LKL D P L MG+        
Subjt:  PKCRRVGDLKHDCLNLSNPSGSSGFDPHRDKPHHNRTRPVKESESIS-NSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGI

Query:  RISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKI
                                V    + LPN P+ESST T + PE   +A  +V D+F+ AK S+PT +    N   S  S+  A I S     ++ 
Subjt:  RISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKI

Query:  LKFHSDMFRGLTTMKEITNTPFKEIN-VDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWK
        +           T K++ NTPF  I  VD +PTVYTIDP   T++ + +  ++  T   SNQN+YAI+ V   +  ++  VDS+A S    C KKMLCW 
Subjt:  LKFHSDMFRGLTTMKEITNTPFKEIN-VDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWK

Query:  CHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPM
          GMD AKL+++ K LI+L +PSIVLIFG+KIS ADA++VV+ELAF GSYCRKPDGYN GVW++LS QDV+IEV+SYSP++V+ASVYF S+ N P    +
Subjt:  CHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPM

Query:  DLDTKTSSG
        D DT+TS G
Subjt:  DLDTKTSSG

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X24.8e-15756.07Show/hide
Query:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYVFPWTPDFK
        ST  ATVCNL+PSQTARI QQFD SLI W+VGK I PRQLA RL R+L L GD++VFELGLG+FVLKFS     Y ALE+ PWSIP+LCIYVFPW P+FK
Subjt:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYVFPWTPDFK

Query:  PSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH
        PSEA   FVDVWIRL ELSIEYYD E+L++IAETIGG LVKIDP+T  R+KC +ARICIR+NL  PL    + G+  Q+I YEG DLLC  C  V DLKH
Subjt:  PSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH

Query:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK----------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPM
        DC  LSN S SSGFD     PHH+  RP++                       S S SN K  LIPS+ +PASA GSRFQVLE      L LN+ PSLP+
Subjt:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK----------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPM

Query:  GECEKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-PTA
         E +K                    + KE    S S+++ P   L K+++ I     +   +AP V+ED +F+T KTS+PT +A   N+P  QPSS    
Subjt:  GECEKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-PTA

Query:  SIPSLQPCE--EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVS
        SI  LQP    E  LKF+S   +  T  K I NTP + I+VD  PT+YTIDP  ITSL I L E  + TT  SNQN++AI IVPT          SEAVS
Subjt:  SIPSLQPCE--EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVS

Query:  GSES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASV
         S S CSKKMLCW     DNAKL+R+LKDLI+LH+PSIVLIFGTKISGADAD VV+ELAF GSYCRKPDGY  G WLLLS+QDVQIEV+SYSP+QV+ASV
Subjt:  GSES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTASV

Query:  YFHSETNVPV
          HS+ N  V
Subjt:  YFHSETNVPV

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X18.2e-15755.88Show/hide
Query:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYVFPWTPDFK
        ST  ATVCNL+PSQTARI QQFD SLI W+VGK I PRQLA RL R+L L GD++VFELGLG+FVLKFS     Y ALE+ PWSIP+LCIYVFPW P+FK
Subjt:  STTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSET--DYLALEDLPWSIPNLCIYVFPWTPDFK

Query:  PSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH
        PSEA   FVDVWIRL ELSIEYYD E+L++IAETIGG LVKIDP+T  R+KC +ARICIR+NL  PL    + G+  Q+I YEG DLLC  C  V DLKH
Subjt:  PSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKH

Query:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK------------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSL
        DC  LSN S SSGFD     PHH+  RP++                         S S SN K  LIPS+ +PASA GSRFQVLE      L LN+ PSL
Subjt:  DCLNLSNPSGSSGFDPHRDKPHHNRTRPVK------------------------ESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSL

Query:  PMGECEKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-P
        P+ E +K                    + KE    S S+++ P   L K+++ I     +   +AP V+ED +F+T KTS+PT +A   N+P  QPSS  
Subjt:  PMGECEKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLPNLPKESSTITIKAPELEHVAPSVVED-RFKTAKTSNPTMIADHKNQPQSQPSS-P

Query:  TASIPSLQPCE--EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEA
          SI  LQP    E  LKF+S   +  T  K I NTP + I+VD  PT+YTIDP  ITSL I L E  + TT  SNQN++AI IVPT          SEA
Subjt:  TASIPSLQPCE--EKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITSLEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEA

Query:  VSGSES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTA
        VS S S CSKKMLCW     DNAKL+R+LKDLI+LH+PSIVLIFGTKISGADAD VV+ELAF GSYCRKPDGY  G WLLLS+QDVQIEV+SYSP+QV+A
Subjt:  VSGSES-CSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKPDGYNDGVWLLLSRQDVQIEVNSYSPKQVTA

Query:  SVYFHSETNVPV
        SV  HS+ N  V
Subjt:  SVYFHSETNVPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding8.8e-1831.07Show/hide
Query:  LIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDD
        +I  ++G  I    L  +L      +G + V +L   +F+++F  E +Y+ AL   PW +    + V  W+  F P         VW+RL  +   YY  
Subjt:  LIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFS-ETDYL-ALEDLPWSIPNLCIYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDD

Query:  EILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHDC
         +L  IA  +G  L K+D  T +  K +FAR+CI VNL  PL   + +   R  + YEG   +C  C   G L H C
Subjt:  EILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.2e-0624.63Show/hide
Query:  FVLKF-SETDYL-ALEDLPWSIPNLCIYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNL
        F ++F SE D L  L   PW      I +  W  DF P+E   +F+DVW+ +  + + Y  +  ++ IA T+G V V +D   +   +  F R+ +R++ 
Subjt:  FVLKF-SETDYL-ALEDLPWSIPNLCIYVFPWTPDFKPSEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNL

Query:  CDPLPSMIKLGRIRQE-----IEYEGFDLLCPKCRRVGDLKHDCLNL-------SNPSGSSGFDPHRDKPHHNRTRPVKESE-SISNSKQPLIP--SESS
         +PL    ++    +E      EYE    +C  C RV      C  +       + P      + + D+   N+    + S+ S+ +S   L P    + 
Subjt:  CDPLPSMIKLGRIRQE-----IEYEGFDLLCPKCRRVGDLKHDCLNL-------SNPSGSSGFDPHRDKPHHNRTRPVKESE-SISNSKQPLIP--SESS

Query:  PASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRISS---PRVPVKVEAAGKQKEKCGVSV
        P   W        ND M+    +  PS  +     V  G   +S   P+  V  E     K K G  V
Subjt:  PASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRISS---PRVPVKVEAAGKQKEKCGVSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATTGGACTTGGAATCTCATCGCTCAACCACCGCCGCCACCGTCTGTAACCTCTCTCCCTCTCAAACTGCTCGGATCACTCAACAGTTCGATCACTCTCT
CATAGCTTGGATCGTCGGAAAAGACATCCGTCCCCGGCAACTCGCCGGCCGCCTTCACCGTCATCTTTGTCTCACCGGAGATGTGGAGGTCTTCGAGCTTGGTCTCGGCT
ACTTCGTGCTCAAATTCTCTGAGACTGACTATTTAGCACTGGAAGACCTTCCCTGGTCCATCCCCAATCTCTGCATCTACGTCTTTCCATGGACTCCCGATTTCAAACCC
TCCGAGGCCATCAATTCCTTTGTCGATGTCTGGATCCGCCTCCATGAGCTCTCCATCGAGTATTACGACGACGAAATTCTGCAGCGAATTGCGGAGACCATTGGCGGCGT
TCTTGTGAAGATCGATCCGATAACGAAAGATCGCAAGAAATGTAAGTTCGCTCGTATCTGTATTAGAGTAAATTTGTGCGATCCCCTTCCATCGATGATCAAACTTGGTA
GAATTCGACAGGAAATTGAGTATGAGGGTTTTGATTTGTTGTGCCCTAAATGTAGACGTGTTGGTGATCTGAAACATGATTGTTTGAATTTGAGTAATCCTTCTGGTTCT
TCTGGATTCGATCCCCATAGAGATAAACCCCACCACAATAGAACTCGTCCTGTTAAGGAATCTGAATCGATTTCAAACTCGAAGCAGCCATTGATTCCTTCTGAATCTTC
ACCAGCATCAGCTTGGGGTTCTAGATTCCAAGTTCTTGAAAATGACCCAATGCTTGATTTGAAATTGAATGACTCGCCAAGCCTTCCAATGGGTGAATGTGAAAAAGTAG
GTGCAGGTATAAGAATAAGTTCCCCACGTGTTCCTGTGAAGGTTGAAGCAGCTGGAAAGCAAAAGGAGAAATGTGGAGTCTCTGTCTCTGTCTCTGTTCAACCATTGCCC
AACTTGCCAAAAGAATCTTCAACAATAACCATTAAAGCTCCTGAGTTAGAACATGTAGCTCCTTCTGTTGTTGAGGATCGGTTCAAGACTGCAAAAACCAGCAACCCCAC
CATGATTGCAGACCATAAGAACCAACCACAATCACAACCATCATCACCCACAGCAAGCATTCCATCCCTACAACCATGTGAAGAGAAGATCCTCAAGTTCCACTCGGATA
TGTTCCGGGGCTTGACAACAATGAAAGAGATAACCAACACACCTTTTAAAGAGATCAATGTCGATGGTTTCCCCACTGTTTACACCATCGACCCAAAGAAGATCACAAGC
CTTGAAATTGCTTTGTCAGAAGCACAAACAAGAACAACCGCATCATCGAACCAAAATCAATATGCTATAGACATTGTTCCGACTTTGAAAGGTGGAGACGAAGGTGGTGT
TGATTCGGAGGCGGTATCAGGATCAGAATCATGTTCTAAGAAGATGCTGTGCTGGAAGTGTCATGGGATGGACAATGCCAAGCTTGTGCGATCATTGAAAGATCTGATTA
AGCTGCACCAGCCATCCATTGTGTTGATATTTGGCACCAAGATCAGTGGTGCTGATGCGGATCAGGTCGTGCAAGAGCTCGCTTTCTGCGGTTCATACTGCAGAAAACCC
GATGGCTACAATGATGGTGTTTGGCTGTTATTGTCCCGGCAAGATGTGCAAATTGAAGTCAACTCATACAGCCCAAAACAGGTTACTGCATCAGTATATTTTCATTCTGA
AACCAATGTACCAGTGTCCAATCCTATGGATTTAGATACCAAAACATCATCGGGACCGTGGGGATCGACTTTCTTCTATACTTCGACGAACTGGATGACCAACACATTGG
CATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCATTGGACTTGGAATCTCATCGCTCAACCACCGCCGCCACCGTCTGTAACCTCTCTCCCTCTCAAACTGCTCGGATCACTCAACAGTTCGATCACTCTCT
CATAGCTTGGATCGTCGGAAAAGACATCCGTCCCCGGCAACTCGCCGGCCGCCTTCACCGTCATCTTTGTCTCACCGGAGATGTGGAGGTCTTCGAGCTTGGTCTCGGCT
ACTTCGTGCTCAAATTCTCTGAGACTGACTATTTAGCACTGGAAGACCTTCCCTGGTCCATCCCCAATCTCTGCATCTACGTCTTTCCATGGACTCCCGATTTCAAACCC
TCCGAGGCCATCAATTCCTTTGTCGATGTCTGGATCCGCCTCCATGAGCTCTCCATCGAGTATTACGACGACGAAATTCTGCAGCGAATTGCGGAGACCATTGGCGGCGT
TCTTGTGAAGATCGATCCGATAACGAAAGATCGCAAGAAATGTAAGTTCGCTCGTATCTGTATTAGAGTAAATTTGTGCGATCCCCTTCCATCGATGATCAAACTTGGTA
GAATTCGACAGGAAATTGAGTATGAGGGTTTTGATTTGTTGTGCCCTAAATGTAGACGTGTTGGTGATCTGAAACATGATTGTTTGAATTTGAGTAATCCTTCTGGTTCT
TCTGGATTCGATCCCCATAGAGATAAACCCCACCACAATAGAACTCGTCCTGTTAAGGAATCTGAATCGATTTCAAACTCGAAGCAGCCATTGATTCCTTCTGAATCTTC
ACCAGCATCAGCTTGGGGTTCTAGATTCCAAGTTCTTGAAAATGACCCAATGCTTGATTTGAAATTGAATGACTCGCCAAGCCTTCCAATGGGTGAATGTGAAAAAGTAG
GTGCAGGTATAAGAATAAGTTCCCCACGTGTTCCTGTGAAGGTTGAAGCAGCTGGAAAGCAAAAGGAGAAATGTGGAGTCTCTGTCTCTGTCTCTGTTCAACCATTGCCC
AACTTGCCAAAAGAATCTTCAACAATAACCATTAAAGCTCCTGAGTTAGAACATGTAGCTCCTTCTGTTGTTGAGGATCGGTTCAAGACTGCAAAAACCAGCAACCCCAC
CATGATTGCAGACCATAAGAACCAACCACAATCACAACCATCATCACCCACAGCAAGCATTCCATCCCTACAACCATGTGAAGAGAAGATCCTCAAGTTCCACTCGGATA
TGTTCCGGGGCTTGACAACAATGAAAGAGATAACCAACACACCTTTTAAAGAGATCAATGTCGATGGTTTCCCCACTGTTTACACCATCGACCCAAAGAAGATCACAAGC
CTTGAAATTGCTTTGTCAGAAGCACAAACAAGAACAACCGCATCATCGAACCAAAATCAATATGCTATAGACATTGTTCCGACTTTGAAAGGTGGAGACGAAGGTGGTGT
TGATTCGGAGGCGGTATCAGGATCAGAATCATGTTCTAAGAAGATGCTGTGCTGGAAGTGTCATGGGATGGACAATGCCAAGCTTGTGCGATCATTGAAAGATCTGATTA
AGCTGCACCAGCCATCCATTGTGTTGATATTTGGCACCAAGATCAGTGGTGCTGATGCGGATCAGGTCGTGCAAGAGCTCGCTTTCTGCGGTTCATACTGCAGAAAACCC
GATGGCTACAATGATGGTGTTTGGCTGTTATTGTCCCGGCAAGATGTGCAAATTGAAGTCAACTCATACAGCCCAAAACAGGTTACTGCATCAGTATATTTTCATTCTGA
AACCAATGTACCAGTGTCCAATCCTATGGATTTAGATACCAAAACATCATCGGGACCGTGGGGATCGACTTTCTTCTATACTTCGACGAACTGGATGACCAACACATTGG
CATACTGA
Protein sequenceShow/hide protein sequence
MASSLDLESHRSTTAATVCNLSPSQTARITQQFDHSLIAWIVGKDIRPRQLAGRLHRHLCLTGDVEVFELGLGYFVLKFSETDYLALEDLPWSIPNLCIYVFPWTPDFKP
SEAINSFVDVWIRLHELSIEYYDDEILQRIAETIGGVLVKIDPITKDRKKCKFARICIRVNLCDPLPSMIKLGRIRQEIEYEGFDLLCPKCRRVGDLKHDCLNLSNPSGS
SGFDPHRDKPHHNRTRPVKESESISNSKQPLIPSESSPASAWGSRFQVLENDPMLDLKLNDSPSLPMGECEKVGAGIRISSPRVPVKVEAAGKQKEKCGVSVSVSVQPLP
NLPKESSTITIKAPELEHVAPSVVEDRFKTAKTSNPTMIADHKNQPQSQPSSPTASIPSLQPCEEKILKFHSDMFRGLTTMKEITNTPFKEINVDGFPTVYTIDPKKITS
LEIALSEAQTRTTASSNQNQYAIDIVPTLKGGDEGGVDSEAVSGSESCSKKMLCWKCHGMDNAKLVRSLKDLIKLHQPSIVLIFGTKISGADADQVVQELAFCGSYCRKP
DGYNDGVWLLLSRQDVQIEVNSYSPKQVTASVYFHSETNVPVSNPMDLDTKTSSGPWGSTFFYTSTNWMTNTLAY