; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010349 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010349
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold878:388157..393864
RNA-Seq ExpressionMS010349
SyntenyMS010349
Gene Ontology termsNA
InterPro domainsIPR038745 - AT4G37440-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133783.1 uncharacterized protein LOC101222847 isoform X1 [Cucumis sativus]6.1e-18673.8Show/hide
Query:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL
        K KEE LMEVSR +ED+NA Q+++N     ASS Q+ I D+EA S  R++ML+ S++ME+DVIGCS+NC+ GP+ ECNVSTENSSSFGDTVSGTDYGLLL
Subjt:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL

Query:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR
        DDEEVESQLYG +NL+  SNGY EVFPRKKKLT HWRKFISP+MWRCRWLE+QIKKLQ+Q+ KYDRELALYDQRKQS Y +FS +GF VKS GFS+HTQR
Subjt:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR

Query:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM
        HR MKRK RK  EETTD ASYM HHN+FSYYEKKRSLADDM+LEDTFLKLDKT+N++R DIN FGT AT+GW  SMLG NDN LEDIFLKIEAAQSKVH 
Subjt:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM

Query:  LKNRIDKVVNENPMKFASINQL-NFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVK
        LKNRIDKVVNENPMKF+ INQL +   SSDDP+SP DGND LVRSLHEASQH+SEHA  DVLMPE+A K HGEV+LLPDM++S DCG+T+KV MQ+ AVK
Subjt:  LKNRIDKVVNENPMKFASINQL-NFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVK

Query:  EELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKAT
        EELQLS+EVKGQ +E  Q  EEQK+I  AA+S+ADL SK+ EP++ H TK  SA KPN SKKT+KRGRRK GSSK+N+KAT
Subjt:  EELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKAT

XP_008437823.1 PREDICTED: uncharacterized protein LOC103483139 [Cucumis melo]5.2e-18573.8Show/hide
Query:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL
        K KEEALMEVSR +ED NA ++++N     ASS Q+ I D+EA S  R++MLD S++ME+D+IGCS+NC+ GP+ ECNV TENSSSFGDTVSGTDYGLLL
Subjt:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL

Query:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR
        DDEEVESQLYG +NL+  SNGY EVFPRKKKLT HWRKFISP+MWRCRWLE+QIKKLQ+Q+ KYDRELALYDQRKQS Y +FS +GF VKS GFS+HTQR
Subjt:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR

Query:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM
        HR MKRK RKK EETTDVASYM HHN+FSYYEKKRSLADDM+LEDTFLKLDKT+N++R DIN  GT AT+GW  SMLG NDN LEDIFLKIEAAQSKVH 
Subjt:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM

Query:  LKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKE
        LKNRIDKVVNENPMKF++INQL  L SSDDP+SPEDGND LVRSLHEASQH+SEHA  DVLMPE+A K HGEV+LLPDM QS DCG+T+KV MQ+ AVKE
Subjt:  LKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKE

Query:  ELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
        ELQLS+E K Q IE  Q  EEQK++  AA+S+AD +SK+ EP++ H  K  SA KPN SKKT+KRGRRK GSSK+N+KATG
Subjt:  ELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG

XP_022145458.1 uncharacterized protein LOC111014899 [Momordica charantia]1.6e-27199.59Show/hide
Query:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
        MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
Subjt:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG

Query:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
        TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
Subjt:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG

Query:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
        FSSHTQRHR MKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
Subjt:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA

Query:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
         QSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
Subjt:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM

Query:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
        QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
Subjt:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG

XP_023527403.1 uncharacterized protein LOC111790643 [Cucurbita pepo subsp. pepo]1.0e-18074.23Show/hide
Query:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
        MG E   KVKEEALMEV   +ED+NAAQ+     +QSASS Q+ I DMEA S  RT+MLD S++MELD+IGC+D C+ GP+ E NVSTENSSSFGDT+SG
Subjt:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG

Query:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
        TDYGLLLDDEEVES LYG +NL+ MS+GY +VFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQ+ KYDRELAL+DQRKQSVY +FS EG DVKS G
Subjt:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG

Query:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
        FSSHTQRHR MKRKRRKKTEETT+VASYM HHNLFSYYEKKRSLADDM+LEDTF KLDKT+NM+R DIN FG  AT+GW P MLG NDN LEDIFLKIE 
Subjt:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA

Query:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
        AQSKVH LKNRI+KVV ENPMKF+SINQL FL SSDDP+SPEDGN  LVRSLHEASQ ISEHA  DVLMPE+A K HGEV+LLP M+Q+ DCG T+KV M
Subjt:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM

Query:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKP-NPSKKTKKR-GRRKTGSSKRNQKAT
        Q+ AVKEELQ S +VKG  IE PQ+L EQK       SEADL SKN EPN+   TKP S  KP + SKKT+KR GRRKTGSSK+ +KAT
Subjt:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKP-NPSKKTKKR-GRRKTGSSKRNQKAT

XP_038886802.1 uncharacterized protein LOC120076910 [Benincasa hispida]2.1e-19476.23Show/hide
Query:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
        MG E   KVKEE LMEVSR +ED NAAQ++     QSA SCQD I D+EA S   ++MLD S++MELD+IGC++NC+ GP+ ECNVSTE SSSFGDTVSG
Subjt:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG

Query:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
        TDYGLLLDDEEVESQLYG +NL+ MSNGY EVFPRKKKLT HWRKFI PLMWRCRWLELQIKKLQSQ+ KYDRELALYDQRKQSVY +FS EGFDVKS G
Subjt:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG

Query:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
        FSSHTQRHR MKRK RKK EETTD+ASYM HHNLFSYYEKKRSLADDM+LEDTF KLDKT+N++R DIN FGT AT+GW  SMLG +DN L+D+FLKIEA
Subjt:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA

Query:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
        AQSKVH LKNRIDKVVNENPMKF++INQL  L SSDDP+SPEDGND LVRSLHEASQHISEHA  DVLMPE+A K HGEV+LLPDM QSADCG+T KV  
Subjt:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM

Query:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
        Q+ AVKEELQLSE VKGQ IE  Q+LEEQK I  AAVS++DL S + EPN  H TK  SAAKPN SK+T+KRGRRK GSSK+N+KATG
Subjt:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG

TrEMBL top hitse value%identityAlignment
A0A0A0L704 Uncharacterized protein3.0e-18673.8Show/hide
Query:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL
        K KEE LMEVSR +ED+NA Q+++N     ASS Q+ I D+EA S  R++ML+ S++ME+DVIGCS+NC+ GP+ ECNVSTENSSSFGDTVSGTDYGLLL
Subjt:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL

Query:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR
        DDEEVESQLYG +NL+  SNGY EVFPRKKKLT HWRKFISP+MWRCRWLE+QIKKLQ+Q+ KYDRELALYDQRKQS Y +FS +GF VKS GFS+HTQR
Subjt:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR

Query:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM
        HR MKRK RK  EETTD ASYM HHN+FSYYEKKRSLADDM+LEDTFLKLDKT+N++R DIN FGT AT+GW  SMLG NDN LEDIFLKIEAAQSKVH 
Subjt:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM

Query:  LKNRIDKVVNENPMKFASINQL-NFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVK
        LKNRIDKVVNENPMKF+ INQL +   SSDDP+SP DGND LVRSLHEASQH+SEHA  DVLMPE+A K HGEV+LLPDM++S DCG+T+KV MQ+ AVK
Subjt:  LKNRIDKVVNENPMKFASINQL-NFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVK

Query:  EELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKAT
        EELQLS+EVKGQ +E  Q  EEQK+I  AA+S+ADL SK+ EP++ H TK  SA KPN SKKT+KRGRRK GSSK+N+KAT
Subjt:  EELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKAT

A0A1S3AUK2 uncharacterized protein LOC1034831392.5e-18573.8Show/hide
Query:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL
        K KEEALMEVSR +ED NA ++++N     ASS Q+ I D+EA S  R++MLD S++ME+D+IGCS+NC+ GP+ ECNV TENSSSFGDTVSGTDYGLLL
Subjt:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL

Query:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR
        DDEEVESQLYG +NL+  SNGY EVFPRKKKLT HWRKFISP+MWRCRWLE+QIKKLQ+Q+ KYDRELALYDQRKQS Y +FS +GF VKS GFS+HTQR
Subjt:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR

Query:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM
        HR MKRK RKK EETTDVASYM HHN+FSYYEKKRSLADDM+LEDTFLKLDKT+N++R DIN  GT AT+GW  SMLG NDN LEDIFLKIEAAQSKVH 
Subjt:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM

Query:  LKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKE
        LKNRIDKVVNENPMKF++INQL  L SSDDP+SPEDGND LVRSLHEASQH+SEHA  DVLMPE+A K HGEV+LLPDM QS DCG+T+KV MQ+ AVKE
Subjt:  LKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKE

Query:  ELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
        ELQLS+E K Q IE  Q  EEQK++  AA+S+AD +SK+ EP++ H  K  SA KPN SKKT+KRGRRK GSSK+N+KATG
Subjt:  ELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG

A0A5A7U0V8 Uncharacterized protein2.5e-18573.8Show/hide
Query:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL
        K KEEALMEVSR +ED NA ++++N     ASS Q+ I D+EA S  R++MLD S++ME+D+IGCS+NC+ GP+ ECNV TENSSSFGDTVSGTDYGLLL
Subjt:  KVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLL

Query:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR
        DDEEVESQLYG +NL+  SNGY EVFPRKKKLT HWRKFISP+MWRCRWLE+QIKKLQ+Q+ KYDRELALYDQRKQS Y +FS +GF VKS GFS+HTQR
Subjt:  DDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQR

Query:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM
        HR MKRK RKK EETTDVASYM HHN+FSYYEKKRSLADDM+LEDTFLKLDKT+N++R DIN  GT AT+GW  SMLG NDN LEDIFLKIEAAQSKVH 
Subjt:  HRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHM

Query:  LKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKE
        LKNRIDKVVNENPMKF++INQL  L SSDDP+SPEDGND LVRSLHEASQH+SEHA  DVLMPE+A K HGEV+LLPDM QS DCG+T+KV MQ+ AVKE
Subjt:  LKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKE

Query:  ELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
        ELQLS+E K Q IE  Q  EEQK++  AA+S+AD +SK+ EP++ H  K  SA KPN SKKT+KRGRRK GSSK+N+KATG
Subjt:  ELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG

A0A6J1CVZ4 uncharacterized protein LOC1110148997.9e-27299.59Show/hide
Query:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
        MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
Subjt:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG

Query:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
        TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
Subjt:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG

Query:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
        FSSHTQRHR MKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
Subjt:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA

Query:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
         QSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
Subjt:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM

Query:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
        QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG
Subjt:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG

A0A6J1IQ89 uncharacterized protein LOC1114790281.6e-17973.27Show/hide
Query:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG
        MG E   KVKEEA MEV   +ED+NAAQ+     +QSASS Q+ I DMEA S  RT+MLD S++MELD+IGC+D C+ GP+ E NVSTENSSSFGDT+SG
Subjt:  MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSG

Query:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG
        TDYGLLLDDEEVES LYG +NL+ MS+GY EVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQ+ KYDRELAL+DQRKQSVY +FS  G DVKS G
Subjt:  TDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIG

Query:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA
        FSSHTQRHR MKRKRRKKTEETT+VASYM HHNLFSYYEKKRSLADDM+LED F KLDKT+NM+R DIN FG  A +GW PSMLG NDN LE IFLKIEA
Subjt:  FSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA

Query:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM
        AQSKVH LKNRI+KVV ENPMKF+SINQL FL SSDDP+SPEDGN  LVRSLHEAS+ IS+HA  DVLMPE+A K HGEV+LLP  +Q+ DCG T+KV M
Subjt:  AQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQM

Query:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKP-NPSKKTKKR-GRRKTGSSKRNQKATG
        Q+ AVKEELQ S +VK + +E PQ+L EQK I     S+ADL SKN EPN+ H TKP SA KP + SKKT+KR GRRKTGSSK+ +KATG
Subjt:  QNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKP-NPSKKTKKR-GRRKTGSSKRNQKATG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G50040.1 unknown protein1.9e-2833.46Show/hide
Query:  LDVIGCSDNCDEGPNGECNVSTENSSSFGDTV---SGTDYGLLLDDEEVESQLYGGDNL-RRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIK
        +D++   D+ +E    E      +SSSFGD++    G D+G     +E +S L     L     +G   +   KKK    WR+   P+MWRC+W+EL++K
Subjt:  LDVIGCSDNCDEGPNGECNVSTENSSSFGDTV---SGTDYGLLLDDEEVESQLYGGDNL-RRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIK

Query:  KLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKN
        ++QSQA  Y++E+  Y   KQ       +EGFD KSI F  + QR    KR RRK+ EETTDVA+YM +HNLFSY +K+      + ++  +L  D    
Subjt:  KLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKN

Query:  MRRYDINYFGTNATEGWEP----SMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVV-NENPMKFASINQL
           +      T   +  E     S L  +D++L     KI+ AQ K   L+ R+D+++ +  P   +S+ Q+
Subjt:  MRRYDINYFGTNATEGWEP----SMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVV-NENPMKFASINQL

AT3G59670.1 unknown protein3.9e-4533.16Show/hide
Query:  EDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDN----CDEGPNGECNVSTENSSSFGDTVSGTDYGLLLD----DEEVE
        ++++   E K   L SA++ ++ +             + G + +++D++   +N     DE PN     +TE SSSF DT S  +  +LLD    + EVE
Subjt:  EDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDN----CDEGPNGECNVSTENSSSFGDTVSGTDYGLLLD----DEEVE

Query:  SQLYGGDNLRRMSNGYREVFP-RKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSME---GFDVKSIGFSSHTQRHR
        S  +   +L    + +  +F  RKK+LT HWR+FI PLMWR +W+EL+I++L+S+A +Y +EL LYDQ K     + S+    G  +KS+ FS+   + R
Subjt:  SQLYGGDNLRRMSNGYREVFP-RKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSME---GFDVKSIGFSSHTQRHR

Query:  -TMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHML
           KR++RKK E T D+ASYM  HNLFSY E KR  +D M L D F      ++     ++    ++            D++LE++  KIE   S+VH L
Subjt:  -TMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHML

Query:  KNRIDKVVNENPMKFASINQLNFLESSDDPSS--PEDGNDALVR--SLHEASQHISEHAFGDVLM-PESANKNHGEVILLPDMIQS
        K ++D V+++N  +F+S   L+ L +S  PS      GN  ++   +++ ASQH++++  GD++   E    ++G+   +PD+I+S
Subjt:  KNRIDKVVNENPMKFASINQLNFLESSDDPSS--PEDGNDALVR--SLHEASQHISEHAFGDVLM-PESANKNHGEVILLPDMIQS

AT4G37440.1 unknown protein2.2e-4034.68Show/hide
Query:  DNMELDVIGCSDN-------CDEGPNGECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISP-LMWRC
        D  E+D++ C+DN       CD+G +G        SSSFG T S  +     +D+EV+S +    +L         ++ RK+KLT HWR+F+ P LMWRC
Subjt:  DNMELDVIGCSDN-------CDEGPNGECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISP-LMWRC

Query:  RWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSI-GFSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDT
        +W+EL+ K+LQ+QA KYD+E+  Y Q K+    N   E   VK++     +TQ+ R MKRK RK+ EET DV SY  +HNLFSYY+ ++SLA D+AL D 
Subjt:  RWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSI-GFSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDT

Query:  FLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLH
           LDK KN    D   F    +E   P      D  LE I LKIEAA+S+   LK R+DKV++ENP  F   N +N L ++D  +S E     L     
Subjt:  FLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLH

Query:  EASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKEELQLSEEVKGQSIEQPQEL--EEQKTI---PPAAVSEADLASKNTE
        +    ISE      +   S + +H    + P+  ++ D             +  E+  S+  +G+SI   + L   EQ +I   P   V +     +N E
Subjt:  EASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKEELQLSEEVKGQSIEQPQEL--EEQKTI---PPAAVSEADLASKNTE

Query:  PNLQHDTKP----LSAAKPNP--------SKKTKKRGRRKTGSS
           + ++ P    +S  KP          S + +KRG+R++GS+
Subjt:  PNLQHDTKP----LSAAKPNP--------SKKTKKRGRRKTGSS

AT4G37440.2 unknown protein3.3e-4436.91Show/hide
Query:  DNMELDVIGCSDN-------CDEGPNGECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISP-LMWRC
        D  E+D++ C+DN       CD+G +G        SSSFG T S  +     +D+EV+S +    +L         ++ RK+KLT HWR+F+ P LMWRC
Subjt:  DNMELDVIGCSDN-------CDEGPNGECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISP-LMWRC

Query:  RWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSI-GFSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDT
        +W+EL+ K+LQ+QA KYD+E+  Y Q K+    N   E   VK++     +TQ+ R MKRK RK+ EET DV SY  +HNLFSYY+ ++SLA D+AL D 
Subjt:  RWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSI-GFSSHTQRHRTMKRKRRKKTEETTDVASYMGHHNLFSYYEKKRSLADDMALEDT

Query:  FLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDAL-----
           LDK KN    D   F    +E   P      D  LE I LKIEAA+S+   LK R+DKV++ENP  F   N +N L ++D  +S E     L     
Subjt:  FLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDAL-----

Query:  -----------VRSLHEASQHIS--EHAFGDVLMPE-SANKNHGEVILLPDM------IQSADCGSTRKVQMQNCAVKEELQ
                   V+S   +S H+S  +    D+L+ E  A+K      ++PD         S + G +R V+ ++ A+ + L+
Subjt:  -----------VRSLHEASQHIS--EHAFGDVLMPE-SANKNHGEVILLPDM------IQSADCGSTRKVQMQNCAVKEELQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACCTGAGATTGTGCCTAAAGTGAAGGAGGAAGCTTTGATGGAGGTTTCGAGGGGTATGGAGGACAAAAATGCTGCTCAGGAGGAAAAAAATAACTTTTTGCAGAG
TGCAAGTAGTTGTCAGGACAAAATTCTTGACATGGAAGCTATATCTGTTGGACGAACTGTCATGTTAGATGGAAGTGACAATATGGAGCTTGATGTTATTGGGTGTTCAG
ATAATTGTGATGAAGGTCCTAATGGGGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGAGATACTGTATCTGGGACAGATTATGGCTTGTTATTGGATGACGAA
GAAGTTGAATCCCAATTATATGGTGGTGATAACTTGCGGCGCATGTCCAATGGATACAGAGAAGTGTTTCCTAGGAAGAAAAAGTTGACAGTTCATTGGAGAAAGTTTAT
AAGTCCCCTTATGTGGCGCTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAAGCGTTCAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGT
CCGTCTACGGAAACTTCTCAATGGAAGGTTTTGACGTGAAGTCAATAGGATTCTCAAGTCACACTCAAAGACACAGGACTATGAAAAGAAAGAGAAGGAAGAAAACTGAA
GAGACAACTGACGTAGCTTCATATATGGGACACCATAATCTATTCTCCTATTATGAGAAGAAGAGGTCTCTTGCTGACGACATGGCTTTGGAAGATACTTTTCTTAAACT
AGACAAGACGAAGAATATGAGACGTTATGACATCAATTACTTTGGGACAAATGCAACGGAAGGATGGGAACCTTCTATGTTGGGAGGTAACGATAACATTTTGGAAGACA
TCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAGTTCACATGTTGAAGAACAGAATTGACAAGGTGGTGAACGAAAATCCCATGAAGTTTGCTTCAATCAATCAGCTAAAC
TTTCTTGAATCAAGCGATGATCCCTCTTCACCTGAAGATGGAAATGATGCCTTAGTAAGGTCTTTGCATGAAGCATCACAACACATATCTGAGCATGCGTTTGGAGATGT
ACTTATGCCTGAAAGCGCAAATAAAAATCACGGAGAGGTCATTCTACTTCCTGATATGATTCAGAGTGCTGATTGTGGAAGTACTCGGAAAGTTCAGATGCAAAATTGTG
CAGTCAAGGAAGAGTTGCAACTTTCCGAAGAGGTTAAAGGTCAGTCTATTGAACAGCCTCAGGAATTGGAGGAGCAGAAAACCATTCCTCCAGCTGCAGTTTCTGAAGCT
GACTTAGCCTCAAAGAACACTGAGCCCAACTTGCAACACGATACAAAACCGCTTTCTGCTGCCAAACCCAATCCATCAAAGAAAACAAAAAAGCGGGGAAGGCGAAAAAC
AGGTTCAAGTAAGCGGAATCAGAAAGCAACAGGT
mRNA sequenceShow/hide mRNA sequence
ATGGGACCTGAGATTGTGCCTAAAGTGAAGGAGGAAGCTTTGATGGAGGTTTCGAGGGGTATGGAGGACAAAAATGCTGCTCAGGAGGAAAAAAATAACTTTTTGCAGAG
TGCAAGTAGTTGTCAGGACAAAATTCTTGACATGGAAGCTATATCTGTTGGACGAACTGTCATGTTAGATGGAAGTGACAATATGGAGCTTGATGTTATTGGGTGTTCAG
ATAATTGTGATGAAGGTCCTAATGGGGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGAGATACTGTATCTGGGACAGATTATGGCTTGTTATTGGATGACGAA
GAAGTTGAATCCCAATTATATGGTGGTGATAACTTGCGGCGCATGTCCAATGGATACAGAGAAGTGTTTCCTAGGAAGAAAAAGTTGACAGTTCATTGGAGAAAGTTTAT
AAGTCCCCTTATGTGGCGCTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAAGCGTTCAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGT
CCGTCTACGGAAACTTCTCAATGGAAGGTTTTGACGTGAAGTCAATAGGATTCTCAAGTCACACTCAAAGACACAGGACTATGAAAAGAAAGAGAAGGAAGAAAACTGAA
GAGACAACTGACGTAGCTTCATATATGGGACACCATAATCTATTCTCCTATTATGAGAAGAAGAGGTCTCTTGCTGACGACATGGCTTTGGAAGATACTTTTCTTAAACT
AGACAAGACGAAGAATATGAGACGTTATGACATCAATTACTTTGGGACAAATGCAACGGAAGGATGGGAACCTTCTATGTTGGGAGGTAACGATAACATTTTGGAAGACA
TCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAGTTCACATGTTGAAGAACAGAATTGACAAGGTGGTGAACGAAAATCCCATGAAGTTTGCTTCAATCAATCAGCTAAAC
TTTCTTGAATCAAGCGATGATCCCTCTTCACCTGAAGATGGAAATGATGCCTTAGTAAGGTCTTTGCATGAAGCATCACAACACATATCTGAGCATGCGTTTGGAGATGT
ACTTATGCCTGAAAGCGCAAATAAAAATCACGGAGAGGTCATTCTACTTCCTGATATGATTCAGAGTGCTGATTGTGGAAGTACTCGGAAAGTTCAGATGCAAAATTGTG
CAGTCAAGGAAGAGTTGCAACTTTCCGAAGAGGTTAAAGGTCAGTCTATTGAACAGCCTCAGGAATTGGAGGAGCAGAAAACCATTCCTCCAGCTGCAGTTTCTGAAGCT
GACTTAGCCTCAAAGAACACTGAGCCCAACTTGCAACACGATACAAAACCGCTTTCTGCTGCCAAACCCAATCCATCAAAGAAAACAAAAAAGCGGGGAAGGCGAAAAAC
AGGTTCAAGTAAGCGGAATCAGAAAGCAACAGGT
Protein sequenceShow/hide protein sequence
MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLDGSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLLDDE
EVESQLYGGDNLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQRKQSVYGNFSMEGFDVKSIGFSSHTQRHRTMKRKRRKKTE
ETTDVASYMGHHNLFSYYEKKRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEAAQSKVHMLKNRIDKVVNENPMKFASINQLN
FLESSDDPSSPEDGNDALVRSLHEASQHISEHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKEELQLSEEVKGQSIEQPQELEEQKTIPPAAVSEA
DLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSSKRNQKATG