; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015871 (gene) of Chayote v1 genome

Gene IDSed0015871
OrganismSechium edule (Chayote v1)
DescriptionHomeodomain-like superfamily protein
Genome locationLG10:5698126..5700991
RNA-Seq ExpressionSed0015871
SyntenySed0015871
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594922.1 hypothetical protein SDJN03_11475, partial [Cucurbita argyrosperma subsp. sororia]2.0e-17465.6Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNAL+GK+AK+SKS +T     +  K          SN+G KKSSTF +G EG+QTPRRSSRLS    ID   EGKN K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +QTPRRSSRLSYA  IDN L+GKNTK+SKSSIT      K  + PR++N V G QSI K +                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R  +KKT LL+V NVV ++SSH ENVVE  +RRKGNSADHE  AT+G        E+EKKSVA RKRKRED VVGIR+GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ++KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFSS VERVVSPPVLKQVKNKALHEKYIDQLH REAKRKS++KCTK CIS E
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLR AKNALISDA+D IHQLQHLQAN M+++PDFDD  DLYD+V  ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

KAG7026885.1 hypothetical protein SDJN02_10892, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-17465.6Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNAL+GK+AK+SKS +T     +  K          SN+G KKSSTF +G EG+QTPRRSSRLS    ID   EGKN K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +QTPRRSSRLSYA  IDN L+GKNTK+SKSSIT      K  + PR++N V G QSI K +                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R  +KKT LL+V NVV ++SSH ENVVE  +RRKGNSADHE  AT+G        E+EKKSVA RKRKRED VVGIR+GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ++KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFSS VERVVSPPVLKQVKNKALHEKYIDQLH REAKRKS++KCTK CIS E
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLR AKNALISDA+D IHQLQHLQAN M+++PDFDD  DLYD+V  ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

XP_022963424.1 uncharacterized protein LOC111463634 [Cucurbita moschata]1.1e-17565.96Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNALEGK+AK+SKS +T     +  K          SN+G KKSSTF +G EG+QTPRRSSRLS    ID   EGKN+K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +QTPR+SSRLSYAP IDN  +GKNTK+SKSSI       K  + PRL+N V G QSI KS+                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKGDVE------IEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R  +KKT LL+V NVV ++SSH ENVVE  +RRKGNSADHEG AT+G         +EKKSVA RKRKRED VVGIR+GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKGDVE------IEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ +KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFSS VERVVSPPVLKQVKNKALHEKYIDQLH REAKRKSM+KCTK CIS E
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLRAAKNALISDA+D IHQLQHLQAN M+++PD DDN  LYD+V+ ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

XP_023003751.1 uncharacterized protein LOC111497241 [Cucurbita maxima]5.9e-17465.07Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNA EGK+AK+SK+ +T     +  K          SN+G KKSSTF +G EG+QTPRRSSRLS    ID   EGKN+K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +QTPRRSSRLSYAP IDN  +GKNTK+ KSSIT      K  + PRL+N V G QSI KS+                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R L+KKT LL+V NVV +++SH EN+VE  +RRKGNSADHEG AT+G        E+EKKSVA RKRKRED VVGIR+GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ +KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN+N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFS+ VERVVSP VLKQVKNKALHEKYIDQLH REAKRKSM+KCTK CIS +
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLRAAKNALISDA+D IHQLQH+QAN ++++PDFDD  DLYD+V+ ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

XP_023517243.1 uncharacterized protein LOC111781067 [Cucurbita pepo subsp. pepo]2.0e-17766.31Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNALEGK+AK+SKS +T     +  K          SN+GLKKSSTF +G EG+QTPRRSSRLS    ID  LEGKN+K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +++PRRSSRLSYA  IDN L+GKNTK+SKSSITL     K  + PR++N V G QSI K +                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R  +KKT LL+V NVV ++SSH ENVVE  +RRKGNSADHEG AT+G        E+EKKSVA RKRKRED VVGIR GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ++KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFSS VERVVSPPVLKQVKNKALHEKYIDQLH REAKRKSM+KCTK CIS E
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLRAAKNALISDA+D IHQLQHLQAN M+++P+FDDN  LYD+V+ ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

TrEMBL top hitse value%identityAlignment
A0A0A0KIP3 Uncharacterized protein1.6e-14861.63Show/hide
Query:  QTPRRSSRLSHVLTIDCALEGKNAKL-AKSLVTLNTPKKPCKPNVVLSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSS
        Q P R    SH          KN  L     V +NTPK+  K  VV SK ++S+ GSK+ S FENG +  ++PRRS RLS AP IDN LEG+N K+SKSS
Subjt:  QTPRRSSRLSHVLTIDCALEGKNAKL-AKSLVTLNTPKKPCKPNVVLSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSS

Query:  IT-------LKNPIPKVRRPPRLSNGVGGLQSIGKS----------ERDLNKK---------TGLLHVHNVVMKKSSHGENVVERKRRKGNSADHEGTAT
        I+       LKNP P VRR PR SNGVGG +S G S          E+   K+         TG L   NV    SSHG+ V   +R+KGNSADHE  AT
Subjt:  IT-------LKNPIPKVRRPPRLSNGVGGLQSIGKS----------ERDLNKK---------TGLLHVHNVVMKKSSHGENVVERKRRKGNSADHEGTAT

Query:  KG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQI
        K       D E+EKKSV  RKRKRED VVGIR+GWTKEQE +LQRAYYAAKPTP FWKKVSKLV GKSAQ+CFDKVH DH+TPP PRPR RT++TK S +
Subjt:  KG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQI

Query:  DLLSLSESKLLNLDGAVSRKP--SRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRS
        +LL  SE +LLN+DGA SRKP    +KSHNAQK VRYLLEKNF GA+N EADLFS+LEPNINLSN TPLPSKQLSS  DLQG Q  +  +SLS HKKP S
Subjt:  DLLSLSESKLLNLDGAVSRKP--SRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRS

Query:  RFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKG-LKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFD
        RFS+ VERVVSPPVLKQVKN+ LHEKYIDQLHSREAKRKSMSKC K+CIS E G  K +HA  TNDLRAAKNALISDA+D I QLQHL+ NAM+N P F+
Subjt:  RFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKG-LKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFD

Query:  DNGDLYDDVEGENEDE
        D+ D  D+V+ ++ED+
Subjt:  DNGDLYDDVEGENEDE

A0A1S3B194 uncharacterized protein LOC1034849108.9e-15262.36Show/hide
Query:  NGVEGLQTP---RRSSRLSHVLTIDCALEGKN---AKLAKSLVTLNTPKKPCKPNVVLSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKL
        N +    TP   R  S +  V +   +LE  N    K  KS V +NTPK+  K  VV SK + S+ GSK+ S FEN  E    PRRS RLS AP IDN L
Subjt:  NGVEGLQTP---RRSSRLSHVLTIDCALEGKN---AKLAKSLVTLNTPKKPCKPNVVLSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKL

Query:  EGKNTKLSKSSIT-------LKNPIPKVRRPPRLSNGVGGLQSIGKS----------ERDLNKK---------TGLLHVHNVVMKKSSHGENVVERKRRK
        EG+NTK+SKSSI+       LKNP P VRR PR SNGVGG +SIGKS          E+   K+         TG L   NV    SSHGE V   +R++
Subjt:  EGKNTKLSKSSIT-------LKNPIPKVRRPPRLSNGVGGLQSIGKS----------ERDLNKK---------TGLLHVHNVVMKKSSHGENVVERKRRK

Query:  GNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPR
        GNSAD E  ATK       D E+EKKSV  RKRKRED VVGIR+GWTKEQE ALQRAYY AKPTPQFWKKVSKLV GKSAQ+CFDKVH DH+TPP PRPR
Subjt:  GNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPR

Query:  SRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLD
         RT++TKSS  +LL  SE +LLNLDGA SRKPSR+  KSHNAQK VRYLLEKNF GA+N EADLFS+LEPNINLSNHTPLPSKQLSS  DLQG Q  +  
Subjt:  SRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLD

Query:  KSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQA
        +SLS HKKP SRFS+ VERVVSPPVLKQVKN+ LHEKYIDQLH REAKRKSMSKC K+CIS E+G K +H   TNDLRAAKNALISDA+D I Q QHL+A
Subjt:  KSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQA

Query:  NAMSNAPDFDDNGDLYDDVEGENEDE
        NA +N PDF+D  D   +V+ +NED+
Subjt:  NAMSNAPDFDDNGDLYDDVEGENEDE

A0A5D3CMV2 Uncharacterized protein8.9e-15262.36Show/hide
Query:  NGVEGLQTP---RRSSRLSHVLTIDCALEGKN---AKLAKSLVTLNTPKKPCKPNVVLSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKL
        N +    TP   R  S +  V +   +LE  N    K  KS V +NTPK+  K  VV SK + S+ GSK+ S FEN  E    PRRS RLS AP IDN L
Subjt:  NGVEGLQTP---RRSSRLSHVLTIDCALEGKN---AKLAKSLVTLNTPKKPCKPNVVLSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKL

Query:  EGKNTKLSKSSIT-------LKNPIPKVRRPPRLSNGVGGLQSIGKS----------ERDLNKK---------TGLLHVHNVVMKKSSHGENVVERKRRK
        EG+NTK+SKSSI+       LKNP P VRR PR SNGVGG +SIGKS          E+   K+         TG L   NV    SSHGE V   +R++
Subjt:  EGKNTKLSKSSIT-------LKNPIPKVRRPPRLSNGVGGLQSIGKS----------ERDLNKK---------TGLLHVHNVVMKKSSHGENVVERKRRK

Query:  GNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPR
        GNSAD E  ATK       D E+EKKSV  RKRKRED VVGIR+GWTKEQE ALQRAYY AKPTPQFWKKVSKLV GKSAQ+CFDKVH DH+TPP PRPR
Subjt:  GNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPR

Query:  SRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLD
         RT++TKSS  +LL  SE +LLNLDGA SRKPSR+  KSHNAQK VRYLLEKNF GA+N EADLFS+LEPNINLSNHTPLPSKQLSS  DLQG Q  +  
Subjt:  SRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLD

Query:  KSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQA
        +SLS HKKP SRFS+ VERVVSPPVLKQVKN+ LHEKYIDQLH REAKRKSMSKC K+CIS E+G K +H   TNDLRAAKNALISDA+D I Q QHL+A
Subjt:  KSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQA

Query:  NAMSNAPDFDDNGDLYDDVEGENEDE
        NA +N PDF+D  D   +V+ +NED+
Subjt:  NAMSNAPDFDDNGDLYDDVEGENEDE

A0A6J1HK26 uncharacterized protein LOC1114636345.2e-17665.96Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNALEGK+AK+SKS +T     +  K          SN+G KKSSTF +G EG+QTPRRSSRLS    ID   EGKN+K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +QTPR+SSRLSYAP IDN  +GKNTK+SKSSI       K  + PRL+N V G QSI KS+                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKGDVE------IEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R  +KKT LL+V NVV ++SSH ENVVE  +RRKGNSADHEG AT+G         +EKKSVA RKRKRED VVGIR+GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKGDVE------IEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ +KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFSS VERVVSPPVLKQVKNKALHEKYIDQLH REAKRKSM+KCTK CIS E
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLRAAKNALISDA+D IHQLQHLQAN M+++PD DDN  LYD+V+ ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

A0A6J1KQ47 uncharacterized protein LOC1114972412.9e-17465.07Show/hide
Query:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV
        IDNA EGK+AK+SK+ +T     +  K          SN+G KKSSTF +G EG+QTPRRSSRLS    ID   EGKN+K++KS +T     K  K    
Subjt:  IDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV

Query:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------
             +SN GS++SSTFENG+E +QTPRRSSRLSYAP IDN  +GKNTK+ KSSIT      K  + PRL+N V G QSI KS+                
Subjt:  LSKKENSNLGSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSE----------------

Query:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP
           R L+KKT LL+V NVV +++SH EN+VE  +RRKGNSADHEG AT+G        E+EKKSVA RKRKRED VVGIR+GWTKEQEAALQRAYYAAKP
Subjt:  ---RDLNKKTGLLHVHNVVMKKSSHGENVVE-RKRRKGNSADHEGTATKG------DVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKP

Query:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD
        TPQFWKKVSKLV GKSAQ+CFDKVH DHLTPP PRPRSRTQ +KS QI+L SLSE KLLN +GA SRKP R+  +S NAQKTVRYLLEK F  A++ EAD
Subjt:  TPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRR--KSHNAQKTVRYLLEKNFHGALNCEAD

Query:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE
        LFS+LEPN+N SNH+PLPSKQLS TKDLQG Q  + ++SLS HKKP SRFS+ VERVVSP VLKQVKNKALHEKYIDQLH REAKRKSM+KCTK CIS +
Subjt:  LFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQ--VLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGE

Query:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI
        KGLK VHA  TNDLRAAKNALISDA+D IHQLQH+QAN ++++PDFDD  DLYD+V+ ENEDEI
Subjt:  KGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53440.1 Homeodomain-like superfamily protein2.7e-4432.9Show/hide
Query:  QTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV--LSKK----ENSNLGSKRSSTFENGVEGIQTPRRSSRLS--YAPMIDN---KLE
        Q  RRS+RL  +   D        K  K L   + P K  +  V+  LSK+     +S +   RS       +G  + RRS RLS    P ++    K  
Subjt:  QTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV--LSKK----ENSNLGSKRSSTFENGVEGIQTPRRSSRLS--YAPMIDN---KLE

Query:  GKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSERDLNKKTG-----LLHVHNVVMKKSSHGEN------------------------------
          ++  +KSS T+ +    +RR PR S+G G  +S+ +S   + KK+G          ++  +K   G+                               
Subjt:  GKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSERDLNKKTG-----LLHVHNVVMKKSSHGEN------------------------------

Query:  -----VVERKRRKGNSADHEGTATKGDVEIEKKSVANRKRKREDCVVGIRK-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDK
             + ERKR +    + E  A     E+++K     + + ED   G++        GWT+E E ALQ AY   KP+P FWKKV+K+V GKSAQECFD+
Subjt:  -----VVERKRRKGNSADHEGTATKGDVEIEKKSVANRKRKREDCVVGIRK-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDK

Query:  VHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSST
        V+   +TP   +PR R   T  S I   SLS SKLL    +   K  +R+++ ++K VR+LLEK  H       DLFS LEPN   SN    P ++  S 
Subjt:  VHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSST

Query:  KDLQGIQVLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKD
                      I + P    S     +VSPPVLKQVKNKALHEKYID LH R+AKRK+ S      ++G++ ++ +     + +RAAK+AL  D +D
Subjt:  KDLQGIQVLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKD

Query:  VIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENED
         I +L+ L+A   S++ +F      YD VE + ED
Subjt:  VIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENED

AT3G53440.2 Homeodomain-like superfamily protein2.7e-4432.9Show/hide
Query:  QTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV--LSKK----ENSNLGSKRSSTFENGVEGIQTPRRSSRLS--YAPMIDN---KLE
        Q  RRS+RL  +   D        K  K L   + P K  +  V+  LSK+     +S +   RS       +G  + RRS RLS    P ++    K  
Subjt:  QTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVV--LSKK----ENSNLGSKRSSTFENGVEGIQTPRRSSRLS--YAPMIDN---KLE

Query:  GKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSERDLNKKTG-----LLHVHNVVMKKSSHGEN------------------------------
          ++  +KSS T+ +    +RR PR S+G G  +S+ +S   + KK+G          ++  +K   G+                               
Subjt:  GKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSERDLNKKTG-----LLHVHNVVMKKSSHGEN------------------------------

Query:  -----VVERKRRKGNSADHEGTATKGDVEIEKKSVANRKRKREDCVVGIRK-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDK
             + ERKR +    + E  A     E+++K     + + ED   G++        GWT+E E ALQ AY   KP+P FWKKV+K+V GKSAQECFD+
Subjt:  -----VVERKRRKGNSADHEGTATKGDVEIEKKSVANRKRKREDCVVGIRK-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDK

Query:  VHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSST
        V+   +TP   +PR R   T  S I   SLS SKLL    +   K  +R+++ ++K VR+LLEK  H       DLFS LEPN   SN    P ++  S 
Subjt:  VHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSESKLLNLDGAVSRKPSRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSST

Query:  KDLQGIQVLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKD
                      I + P    S     +VSPPVLKQVKNKALHEKYID LH R+AKRK+ S      ++G++ ++ +     + +RAAK+AL  D +D
Subjt:  KDLQGIQVLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNKALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKD

Query:  VIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENED
         I +L+ L+A   S++ +F      YD VE + ED
Subjt:  VIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGACAATGCGCTCGAGGGGAAGAACGCTAAGCTTTCTAAGAGTTTACTTACTTTGAATACCCCGAAGAAACCGAGCAAACCTAATGCTGTTTTTAGTAAGAAGGA
AGATTCTAATTCGGGGTTGAAAAAGTCTTCGACATTTGGAAATGGGGTTGAGGGTTTACAAACTCCAAGACGATCCTCTAGGTTATCCCATGTGCTGACAATTGACTGTG
CGCTTGAGGGGAAGAATGCTAAGCTTGCAAAGAGTTTGGTTACTTTGAATACTCCAAAGAAACCGTGTAAACCTAATGTTGTTTTGAGTAAAAAGGAAAATTCAAATTTG
GGGTCGAAAAGGTCTTCGACATTTGAAAATGGGGTTGAGGGTATACAAACTCCAAGGCGGTCCTCTAGGTTATCCTATGCGCCGATGATTGACAATAAACTCGAGGGGAA
GAACACTAAGCTTTCCAAGAGTTCAATTACTTTGAAGAATCCAATCCCTAAGGTAAGAAGACCTCCAAGACTCAGTAATGGAGTTGGGGGGCTTCAAAGCATTGGAAAAT
CCGAAAGGGATTTGAATAAGAAAACAGGGCTATTACATGTTCATAATGTTGTTATGAAGAAGAGCTCCCACGGGGAAAATGTGGTTGAACGAAAGAGGAGGAAAGGGAAT
TCTGCTGATCATGAGGGTACTGCAACAAAAGGTGATGTAGAAATCGAAAAGAAGTCAGTGGCTAATAGGAAAAGAAAGCGAGAGGATTGTGTTGTTGGGATTAGAAAAGG
GTGGACTAAAGAACAGGAAGCTGCATTACAGAGAGCTTATTATGCTGCAAAGCCCACTCCCCAATTTTGGAAGAAGGTTTCCAAACTGGTGTCTGGAAAGTCCGCTCAAG
AATGCTTTGATAAAGTTCATTTCGACCATTTAACCCCTCCTCCACCTCGACCTCGATCCCGAACGCAGAACACAAAATCATCCCAAATTGATCTTTTGTCTCTTTCAGAG
AGTAAACTTCTAAATCTTGATGGTGCTGTGTCGAGAAAGCCTAGTCGTAGGAAAAGCCATAATGCGCAGAAAACTGTGCGATATTTGTTAGAGAAGAATTTTCACGGGGC
GCTTAACTGTGAGGCTGATTTGTTCTCAAAACTCGAGCCGAATATTAATCTCTCTAACCATACCCCTCTACCAAGTAAACAACTCTCTAGCACTAAGGATTTACAGGGAA
TTCAAGTATTGGATAAATCCTTGTCCATTCACAAGAAGCCCCGTTCGAGATTTAGCAGCCCTGTCGAGAGAGTCGTTAGTCCTCCAGTACTGAAACAGGTGAAGAACAAG
GCCTTGCACGAGAAGTACATCGACCAGTTACATAGTAGGGAAGCAAAGAGAAAATCAATGTCGAAATGTACAAAAAATTGCATCTCGGGCGAGAAGGGCTTGAAGGTAGT
TCATGCTGCTGGAACTAATGATCTTAGAGCTGCTAAAAATGCCTTGATTTCTGATGCAAAAGATGTCATCCATCAGTTACAACACTTGCAAGCCAATGCCATGAGCAATG
CTCCCGATTTCGATGACAATGGGGATTTATACGACGATGTTGAAGGCGAAAATGAAGATGAAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGACAATGCGCTCGAGGGGAAGAACGCTAAGCTTTCTAAGAGTTTACTTACTTTGAATACCCCGAAGAAACCGAGCAAACCTAATGCTGTTTTTAGTAAGAAGGA
AGATTCTAATTCGGGGTTGAAAAAGTCTTCGACATTTGGAAATGGGGTTGAGGGTTTACAAACTCCAAGACGATCCTCTAGGTTATCCCATGTGCTGACAATTGACTGTG
CGCTTGAGGGGAAGAATGCTAAGCTTGCAAAGAGTTTGGTTACTTTGAATACTCCAAAGAAACCGTGTAAACCTAATGTTGTTTTGAGTAAAAAGGAAAATTCAAATTTG
GGGTCGAAAAGGTCTTCGACATTTGAAAATGGGGTTGAGGGTATACAAACTCCAAGGCGGTCCTCTAGGTTATCCTATGCGCCGATGATTGACAATAAACTCGAGGGGAA
GAACACTAAGCTTTCCAAGAGTTCAATTACTTTGAAGAATCCAATCCCTAAGGTAAGAAGACCTCCAAGACTCAGTAATGGAGTTGGGGGGCTTCAAAGCATTGGAAAAT
CCGAAAGGGATTTGAATAAGAAAACAGGGCTATTACATGTTCATAATGTTGTTATGAAGAAGAGCTCCCACGGGGAAAATGTGGTTGAACGAAAGAGGAGGAAAGGGAAT
TCTGCTGATCATGAGGGTACTGCAACAAAAGGTGATGTAGAAATCGAAAAGAAGTCAGTGGCTAATAGGAAAAGAAAGCGAGAGGATTGTGTTGTTGGGATTAGAAAAGG
GTGGACTAAAGAACAGGAAGCTGCATTACAGAGAGCTTATTATGCTGCAAAGCCCACTCCCCAATTTTGGAAGAAGGTTTCCAAACTGGTGTCTGGAAAGTCCGCTCAAG
AATGCTTTGATAAAGTTCATTTCGACCATTTAACCCCTCCTCCACCTCGACCTCGATCCCGAACGCAGAACACAAAATCATCCCAAATTGATCTTTTGTCTCTTTCAGAG
AGTAAACTTCTAAATCTTGATGGTGCTGTGTCGAGAAAGCCTAGTCGTAGGAAAAGCCATAATGCGCAGAAAACTGTGCGATATTTGTTAGAGAAGAATTTTCACGGGGC
GCTTAACTGTGAGGCTGATTTGTTCTCAAAACTCGAGCCGAATATTAATCTCTCTAACCATACCCCTCTACCAAGTAAACAACTCTCTAGCACTAAGGATTTACAGGGAA
TTCAAGTATTGGATAAATCCTTGTCCATTCACAAGAAGCCCCGTTCGAGATTTAGCAGCCCTGTCGAGAGAGTCGTTAGTCCTCCAGTACTGAAACAGGTGAAGAACAAG
GCCTTGCACGAGAAGTACATCGACCAGTTACATAGTAGGGAAGCAAAGAGAAAATCAATGTCGAAATGTACAAAAAATTGCATCTCGGGCGAGAAGGGCTTGAAGGTAGT
TCATGCTGCTGGAACTAATGATCTTAGAGCTGCTAAAAATGCCTTGATTTCTGATGCAAAAGATGTCATCCATCAGTTACAACACTTGCAAGCCAATGCCATGAGCAATG
CTCCCGATTTCGATGACAATGGGGATTTATACGACGATGTTGAAGGCGAAAATGAAGATGAAATATGA
Protein sequenceShow/hide protein sequence
MIDNALEGKNAKLSKSLLTLNTPKKPSKPNAVFSKKEDSNSGLKKSSTFGNGVEGLQTPRRSSRLSHVLTIDCALEGKNAKLAKSLVTLNTPKKPCKPNVVLSKKENSNL
GSKRSSTFENGVEGIQTPRRSSRLSYAPMIDNKLEGKNTKLSKSSITLKNPIPKVRRPPRLSNGVGGLQSIGKSERDLNKKTGLLHVHNVVMKKSSHGENVVERKRRKGN
SADHEGTATKGDVEIEKKSVANRKRKREDCVVGIRKGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVSGKSAQECFDKVHFDHLTPPPPRPRSRTQNTKSSQIDLLSLSE
SKLLNLDGAVSRKPSRRKSHNAQKTVRYLLEKNFHGALNCEADLFSKLEPNINLSNHTPLPSKQLSSTKDLQGIQVLDKSLSIHKKPRSRFSSPVERVVSPPVLKQVKNK
ALHEKYIDQLHSREAKRKSMSKCTKNCISGEKGLKVVHAAGTNDLRAAKNALISDAKDVIHQLQHLQANAMSNAPDFDDNGDLYDDVEGENEDEI