; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi02G001036 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi02G001036
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationchr2:29451338..29455683
RNA-Seq ExpressionBhi02G001036
SyntenyBhi02G001036
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062546.1 ankyrin-2-like [Cucumis melo var. makuwa]3.3e-9597.27Show/hide
Query:  MPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCG
        MPSSLSSSSSCPS HES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEE GHKLCG
Subjt:  MPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCG

Query:  TLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        TLPALDLYFAVNRQLSPSPVSTPQSS STSSLG RSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  TLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

XP_004142530.1 uncharacterized protein LOC101212719 [Cucumis sativus]1.1e-9594.79Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKKHLQR+PSSLSSSSSCPS HESEDELKHMPLAPP+LKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKG IELGFAFNE
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTS-TSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        E GHKLCGTLPALDLYFAVNRQLSPSPVSTPQSS S +SSLG RSSSFESPRSE DTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTS-TSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

XP_008462705.1 PREDICTED: uncharacterized protein LOC103501007 [Cucumis melo]4.5e-10097.38Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKKHLQRMPSSLSSSSSCPS HES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        E GHKLCGTLPALDLYFAVNRQLSPSPVSTPQSS STSSLG RSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

XP_023533045.1 uncharacterized protein LOC111795047 [Cucurbita pepo subsp. pepo]8.7e-9694.21Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKK LQRMPSSLSSSSSCPS HESEDELKHMPLAPPR KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRN STDRDDLTDEDWNELKGCIELGFAFNE
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE
        EDGHKLCGTLPALDLYFAVNRQLSPSPVSTP SSTS+SSLG RSSSFESP+SE +TW+VCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE

XP_038878789.1 uncharacterized protein LOC120070939 [Benincasa hispida]1.5e-103100Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

TrEMBL top hitse value%identityAlignment
A0A0A0LY38 Uncharacterized protein5.5e-9694.79Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKKHLQR+PSSLSSSSSCPS HESEDELKHMPLAPP+LKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKG IELGFAFNE
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTS-TSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        E GHKLCGTLPALDLYFAVNRQLSPSPVSTPQSS S +SSLG RSSSFESPRSE DTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTS-TSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

A0A1S3CHL5 uncharacterized protein LOC1035010072.2e-10097.38Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKKHLQRMPSSLSSSSSCPS HES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        E GHKLCGTLPALDLYFAVNRQLSPSPVSTPQSS STSSLG RSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

A0A5A7V807 Ankyrin-2-like1.6e-9597.27Show/hide
Query:  MPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCG
        MPSSLSSSSSCPS HES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEE GHKLCG
Subjt:  MPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCG

Query:  TLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
        TLPALDLYFAVNRQLSPSPVSTPQSS STSSLG RSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK
Subjt:  TLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK

A0A6J1H8L6 uncharacterized protein LOC1114615428.0e-9593.16Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKK LQRMPSSLSSSSSCPS HESEDELKHMPLAPPR KNKK LSKQLSMCETPRDLAWEKRRRQMLRPRN STDRDDLTDEDWNELKGCIELGFAF E
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE
        EDGHKLCGTLPALDLYFAVNRQLSPSPVSTP SSTS+SSLG RSSSFESP+SE +TW+VCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE

A0A6J1JGY3 uncharacterized protein LOC1114850032.3e-9493.16Show/hide
Query:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE
        MDKK  QRM SSLSSSSSCPS HESEDELKHMPLAPPR KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRN STDRDDLTDEDWNELKGCIELGFAF E
Subjt:  MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNE

Query:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE
        EDGHKLCGTLPALDLYFAVNRQLSPSPVSTP SSTS+SSLG RSSSFESP+SE +TWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE
Subjt:  EDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08790.1 Protein of unknown function (DUF1685)1.1e-5163.83Show/hide
Query:  RMPSSLS-SSSSCPSFHESE-DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP-RNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGH
        R+  +LS S+SS  SF++SE +EL+ MPL PP+ K KKRLSKQLSM ET RD+AWE+RRRQML      +   DDLTDED +ELKG IELGF FNEE G 
Subjt:  RMPSSLS-SSSSCPSFHESE-DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP-RNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGH

Query:  KLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTS-----SLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQS
         L  TLPALDLYFAV RQ+  SPVSTP S  S+S     SLG RSSSF SP S+ D+ +V SPG+DP+QVK +LRHWAQAVACSV+QS
Subjt:  KLCGTLPALDLYFAVNRQLSPSPVSTPQSSTSTS-----SLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQS

AT2G43340.1 Protein of unknown function (DUF1685)1.7e-2035.84Show/hide
Query:  SLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLP
        S SS SSC      E+E+ +        K  K+L K+ S       +        + R ++       LTD+D  ELKGC++LGF FN E+  +LC TLP
Subjt:  SLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLP

Query:  ALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSV
        AL+L ++++++     +       S+SS  ++SS  +SP S   +W++ SPG++P  VKA+L+ WAQAVAC+V
Subjt:  ALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSV

AT3G04700.1 Protein of unknown function (DUF1685)1.6e-4259.12Show/hide
Query:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPR------NGSTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V
        K K RLSKQLSMCETPRD+AWE+RRRQM+  +       G++D    + +LTDED NELKG IELGF FNEE G KLC TLPALDLYFAVNRQLSP P  
Subjt:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPR------NGSTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V

Query:  STPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQS
        S+ +SS++++S    S      +++ D+ ++  PG+DP+Q+K +LRHWAQAVACSVMQS
Subjt:  STPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQS

AT3G04710.3 ankyrin repeat family protein8.6e-4157.69Show/hide
Query:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPR------NGSTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V
        K K RLSKQLSMCETPRD+AWE+RRRQM+  +       G++D    + +LTDED NELKG IELGF FNEE G KLC TLPALDLYFAVNRQLSP P  
Subjt:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPR------NGSTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V

Query:  STPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSV
        S+ +SS++++S    S      +++ D+ ++  PG+DP+Q+K +LRHWAQAVACS+
Subjt:  STPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSV

AT5G28690.1 Protein of unknown function (DUF1685)4.9e-4457.75Show/hide
Query:  SLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPR--------NGSTDRDDLTDEDWNELKGCIELGFAFNEEDG
        +L  SSS PS  ES  ++K   +AP   K K+RLSKQLSM ETPRD+AWEKRRRQML+ +            D  DLTDED  ELKG IELGF F+EE G
Subjt:  SLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPR--------NGSTDRDDLTDEDWNELKGCIELGFAFNEEDG

Query:  HKLCGTLPALDLYFAVNRQLS--PSPVSTPQSSTSTSSLGRRSSSFE-SPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQS
         KLC TLPALDLYFAVNRQLS  PSP S+     S SS    SSS   SP+++ D+ ++  PG++P+QVK +LRHWAQAVACS+MQS
Subjt:  HKLCGTLPALDLYFAVNRQLS--PSPVSTPQSSTSTSSLGRRSSSFE-SPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGAAACACTTGCAAAGGATGCCTTCATCACTATCTTCGTCGTCATCGTGCCCATCGTTCCACGAATCGGAGGACGAGCTAAAGCACATGCCGTTGGCACCACC
AAGGTTGAAGAACAAGAAACGTTTGTCGAAGCAACTTTCAATGTGTGAGACGCCAAGGGACCTTGCATGGGAGAAGCGGCGAAGACAGATGCTTCGCCCGAGGAATGGCT
CGACGGATAGGGACGACTTGACCGATGAGGACTGGAATGAGCTCAAAGGCTGCATAGAGCTAGGGTTTGCATTCAATGAGGAAGATGGGCACAAGCTGTGTGGTACGTTG
CCGGCCCTTGACCTTTACTTTGCCGTCAATCGACAGCTATCGCCAAGCCCCGTGTCGACGCCGCAGAGCAGCACCTCGACGTCGTCACTTGGCAGAAGGTCTTCATCATT
TGAAAGCCCCAGGAGTGAGTTTGATACATGGAGGGTTTGTAGCCCAGGGGAAGACCCAAAGCAAGTGAAGGCAAAATTAAGACACTGGGCTCAAGCTGTGGCATGTTCAG
TAATGCAATCATTGGGAGAAAAATAA
mRNA sequenceShow/hide mRNA sequence
GTTTCCTTCTCTCCATTAAAGCCCTCCCTTGCTCCTGTGCCCTCACTCGGTCGGGCTGGCTCTGGCCTTCTTCCTAAACAAATTCCCAACTCGGCCTCCTTCCCGTCCCC
GACCCCGGCCCTCTCTCCTCACGCGCGTGGCTGGCGGGTCGTCACATTCCCCCACCGCCACTCGCCTTTTTGTTTGTTTGTTTTCTTTCTTTAGACATAACCCTAATGGA
TAAGAAACACTTGCAAAGGATGCCTTCATCACTATCTTCGTCGTCATCGTGCCCATCGTTCCACGAATCGGAGGACGAGCTAAAGCACATGCCGTTGGCACCACCAAGGT
TGAAGAACAAGAAACGTTTGTCGAAGCAACTTTCAATGTGTGAGACGCCAAGGGACCTTGCATGGGAGAAGCGGCGAAGACAGATGCTTCGCCCGAGGAATGGCTCGACG
GATAGGGACGACTTGACCGATGAGGACTGGAATGAGCTCAAAGGCTGCATAGAGCTAGGGTTTGCATTCAATGAGGAAGATGGGCACAAGCTGTGTGGTACGTTGCCGGC
CCTTGACCTTTACTTTGCCGTCAATCGACAGCTATCGCCAAGCCCCGTGTCGACGCCGCAGAGCAGCACCTCGACGTCGTCACTTGGCAGAAGGTCTTCATCATTTGAAA
GCCCCAGGAGTGAGTTTGATACATGGAGGGTTTGTAGCCCAGGGGAAGACCCAAAGCAAGTGAAGGCAAAATTAAGACACTGGGCTCAAGCTGTGGCATGTTCAGTAATG
CAATCATTGGGAGAAAAATAAAGGAAAAAAATTGAGGAAATATATTGTACAGATCAGAGTAAAAAGGGAAGAAAATAAATTTAAAAAAAAAAAAAATGAGGCAGAAGGGG
TTTGGTTTCAAACCTTGAATGAATTAGGGTTTGGTTCTGCTTCTTCAATGCTCTTTCTTGTAGTTTTAACAGCAAAAATATCTCTCTGGTTGGAGCTAGCTTATTATATG
AGGGCCAAAGGAAGAGCCAGAGGCATTAATGGGAACTTTGGTATATAATTAATTTAGGGAAAAAATATGTTTTTTTCTTCATTCTCATTATTCCATGTTCATTGTAATTC
TATAAATATATATACTATATATGTGTGTGTTTATCTTTGTTGTTACACGTGAAATATGAACACATCAAACTATATATAAATCCAAAGTTTAAAAATCCATGGATTCAAGT
TTTTTATTTATTTATTTTCTTTTTAATTTTTGGGTGGAGAGATTCGAAATTCAAATCTTTATACTATCTCAATCTATACTCATCTTTGGCAGGTGTCTTTATTTTTAAAT
ATAAAGATTGTTATCATAATAAAATGAATTAAGTGGTACTCGAAAATATAACCTAATTTAAATAGAGGAAATTTATAATGGGTGGCATGCCTTCAACATATGGAAAATTG
TAGATATAACAAAATTCTCAAATTTGGATTTTCGTTTATTCCGTATTTCAATTTTACATTTTTAACAAACCAATGTTTAATCCGTATTGTTATCTATTTTGTATTTTTTC
ATTGATTTGTATACTTTTTTGCATCTTTTGTTTGTAACAATATAAGTTATTACTCACTAATTTTTGTATTTGCTACATTTTTTATTTCTTTTGTAGTTTTTTTCCTTTTT
AAATGATGTTTCTTTCGTCTTATGTACAAATGTACAACAGATAAGGAAGATTCAACTTAAAAGCTCACTAGCACTCAATTACAATTTGACAATCATTTTTATGATTTTTT
TAAAACACTTTAATAATTAGTCATTTTCACCCATATATCAAAAGTATATCAAGTATATCAACAATGTATCAAGTGTATATCAGACATATACTTGTATTATATCGAGCTAT
AAGTCTAATTTTTTAAACTTATTTTACCAAATCTACAAGAGGCCCTTAAATAAAAAGGGAAAACACTAGTTAACAAAGTAAAAAAGCAAAGGTGAATATGAAGCAAAATT
TTAGGCTCTGCTTGATAACCATTTCGTTTTTTGTTTTTTGGAAATTAAGCCTATAGATATTACTCGTACCTTCAAATTTCTTTTTTCCTTATCTACCTTTTACAAATTAT
TTAAAAAACCAAGCCAAAATTTGAAAACTAAAAATGTAACTTTTAAAAACTTGTTTTTATTTTTAAAATTTGGTTAAAAATTCAATCATGGTACTAAAGAACGATGCAAA
TTATTGTAGGAAATGGTAAGAAATAGGCTTAATTTTCAAAAATTAAAAAAAAAATGAATAGATTTTTAAAGCCATTGATGAAGTAATTCAATTTCTAAGTGCTTGTCACT
AGATTCATTTGGGGTAATTTATATATATATATATGGATCTTAGCACTTATTTCATCTTGAAATAGAAGCACAATAGGAGTAACATTGGGGTCGCACACTTACTTGGTTTT
CTACACTGGAAATGGACAAGTGTTCAATCACATGAAATTACCACAAATGAGATGATAAGTTATGGTGCAATGATTCATCAAAAGGTGGCAATACATGATTATTGTGACCA
TTTGGCCCTCTCTTGATTATCAACATGCAACTTTCGATTTTACTATAAGTGCATTAATCATATATACATGCTAAAATTAAAATTGATTAAAATAGTAAAACCAAATATTT
AAGAGAAAGAAATCAAAACATACGTTAAGTAATAAACCTAAAAGTCAATTAAAACATAAAATAAACATGGCTAGAATCTTAGGAAAGAATGCCTAGGCTAATCATTGAAA
GCAAAAGTAAATGAACAAAATAACTTACAAAATGTATGATTAGTATGAGAGATTTGCTGATTGTCTCTATATTTTTCTAAGGCAAATTTATAGTAGTTTTTGTAGTTGTC
ACAAGTGAAGTATTAACATTCGACACAACATTACTAATTCTTGTAGAGTATTTTATTTTTCTAATCTAATGCAGAATATTTTTCTTTCCTACTCTGGAAAATCTAAATTC
TTTCCTTATTTGGCCATGGTTCTAAAACAAGCTTTATGACTTTAATTAAGAGCTGTCACATTTTTTTTTTATTGTGAGAAAGAGTTTTTGCTGTTGTTGGTCCAATTGTA
CAACATTATGTTTTAATCTTTGCTCACTGTATTCAGCTATACAAATTCATTGTTTGAATCATTCAAGAACTTGAAGATTGTTGGATTTAGTGTCCTAAATCTCGTACATC
TTGTAGTTTGTAATTTATAAATACAAATTATTTATTTAATAAAATAAGAAGTATTCTATTTCACATTTAGATAACATTAACTAAATCCAATAAACTAAGATCCAATGTTA
TTTAATGAAACTTAAACAGTATGTAGTAGACATACAGGTGAATCATGTTTGAATAATAACCTAAAAGACCTGCAATATATGGTTAAGGTTGGGTGTATTATCTTTGTGGC
ACTAGGATATGAACCACTTTGTAATTGTTACATGAATTGTAAGTATTACAGACGATATGATCCATATTGTTTATATAGAGACATGTGAGTGGAGGTATCCTATACAAAGA
GTTTGTATAAGATCGTACCTTGAAATGATTGGTTTCTTATTATAACATCATTGCTAGAAGAAACTTTCATTTCACCAAAATGACCATAAGTGACTTAAGCTTCATCCTAA
ATGAGTTGTAAACTCCTGCCTATGAGAGCAGTCCTTTGATCTCTGCATGGGTGATAATGACAAGATTCGCTGACTTAATATGCCTACCATTTTGAAAATTTATCCGATTT
GGGAGTTGGAAACACAATTACAAAAGAAGAAATTCACTCCTCGATTGTTAGAGTAAGTAGATAAATTGCTCCCCTAAAGGCTGATTTTGAGGCTTGAACAATGAGACGCC
TCACCCTCTCCTGGCCTCAGAGGGGTTTAGTTATAGTTGAACTATGACGAATTATTCATTAGAGGGATTAATGGTACTTAATAAGTTAGATGTAATTACAGAAGTAAAAC
CGTAATTTTGACCCGTTGTACTTACGAACAATTTGTGAAGGGTCAACGCACTGTTGATTAGTTATATCCAATAAACACAGAAATATATCTATAGTACGAAGAGTGCAGTT
GCCAGTTTTTAGTAGAGAGACTGACAGTTAATGAAGGTTGATTAATTTAA
Protein sequenceShow/hide protein sequence
MDKKHLQRMPSSLSSSSSCPSFHESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPRNGSTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTL
PALDLYFAVNRQLSPSPVSTPQSSTSTSSLGRRSSSFESPRSEFDTWRVCSPGEDPKQVKAKLRHWAQAVACSVMQSLGEK