; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032551 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032551
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA-directed RNA polymerase V subunit 5C-like
Genome locationscaffold3:22870853..22876180
RNA-Seq ExpressionSpg032551
SyntenySpg032551
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80834.1 hypothetical protein [Trifolium medium]2.6e-1434.56Show/hide
Query:  IAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIM
        I +S+A+INN Y L+     E   L+D + D+ L+AV+  +  PG +W    N     +K   L+ E  +W   +K+ I+PTTH+ T++  R++L++CIM
Subjt:  IAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIM

Query:  KSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC
        +  PI+VGRI+ + +   + + +G  + P L+T+LC
Subjt:  KSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.4e-1228.99Show/hide
Query:  MTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHD
        +T+ +    +  G  +++S   IN V+ L D P  E +  ++ + +  L  VL  VA  GA+W+    GA   ++S L    A +W H++K+ ++PTTH 
Subjt:  MTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHD

Query:  ATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEEVRTMWKDFDEQWWACVTSTRNRRLQGSTHQPTAPASQ
         T+S  R++L++ ++    I+VGR++   +R  AA+  GA F P L+T LC  +      +EE      + D    A +T             PT    Q
Subjt:  ATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEEVRTMWKDFDEQWWACVTSTRNRRLQGSTHQPTAPASQ

Query:  PTETSPA
        P+ + PA
Subjt:  PTETSPA

PON62892.1 hypothetical protein PanWU01x14_135680 [Parasponia andersonii]5.3e-1533.74Show/hide
Query:  MTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHD
        MT       +  G  +  S+  IN +Y L D P  E +  ++ + +  L+ VL  VA  GA+W+    GA   L+S  L   A +W H++K+R++PTTH 
Subjt:  MTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHD

Query:  ATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEE
         T+S +RV+L+Y ++    I+VG+I+   +   AA+  GA F P L+T +CC +      +EE
Subjt:  ATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEE

XP_038876674.1 chromatin assembly factor 1 subunit A-like, partial [Benincasa hispida]1.7e-1332.45Show/hide
Query:  REPSEQPAAASSRKRKAP--TAAKDKGKKKMGASKEAEAKDWEKFQDERGFSCPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLD
        RE  EQ      ++R+A     AK+KGK    +SK A           R F         + ++ ++     T++FS+  IN +Y+++D P A GN ++D
Subjt:  REPSEQPAAASSRKRKAP--TAAKDKGKKKMGASKEAEAKDWEKFQDERGFSCPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLD

Query:  CLDDDFLSAVLLAVARPGAKWDGN-GAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLR
           +  +   L  + + G KW  +     TL S+ L  E  LW++ VK R++ TTHD T+S  RV+  YCI++S PIDVG+++   LR
Subjt:  CLDDDFLSAVLLAVARPGAKWDGN-GAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLR

XP_038904385.1 uncharacterized protein LOC120090747 [Benincasa hispida]6.4e-1335.77Show/hide
Query:  GHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWDGN-GAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYC
        G  + FS+  IN +YK++DIP+A GN ++D   ++ +   L  + + G +W  +    +TL S  L  EA LW++ VK RI+PT+HD T+S  RV+  YC
Subjt:  GHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWDGN-GAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYC

Query:  IMKSTPIDVGRILVETLRRTAAK
        I     IDV  ++    +  A +
Subjt:  IMKSTPIDVGRILVETLRRTAAK

TrEMBL top hitse value%identityAlignment
A0A2P5CPE8 Uncharacterized protein2.5e-1533.74Show/hide
Query:  MTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHD
        MT       +  G  +  S+  IN +Y L D P  E +  ++ + +  L+ VL  VA  GA+W+    GA   L+S  L   A +W H++K+R++PTTH 
Subjt:  MTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHD

Query:  ATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEE
         T+S +RV+L+Y ++    I+VG+I+   +   AA+  GA F P L+T +CC +      +EE
Subjt:  ATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEE

A0A392M2J7 Uncharacterized protein (Fragment)1.3e-1434.56Show/hide
Query:  IAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIM
        I +S+A+INN Y L+     E   L+D + D+ L+AV+  +  PG +W    N     +K   L+ E  +W   +K+ I+PTTH+ T++  R++L++CIM
Subjt:  IAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD--GNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIM

Query:  KSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC
        +  PI+VGRI+ + +   + + +G  + P L+T+LC
Subjt:  KSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC

A0A5D1ZYC3 Uncharacterized protein5.9e-1226.63Show/hide
Query:  KEAEAKDWEKFQDERG----------FSCPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKW-DG
        K+  A  WE F D R           ++    + +TE I Q+K     T+  +S  +N+++ L D+   E   + + +  DFL  VL  V   G++W   
Subjt:  KEAEAKDWEKFQDERG----------FSCPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKW-DG

Query:  NGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC
           + + + + L+  A +W ++V+   MP  H +T+S+++++L+Y I+    I++G+I+++ +   A    G+ + P L+TSLC
Subjt:  NGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC

A0A5D2MA47 Uncharacterized protein3.4e-1228.26Show/hide
Query:  KEAEAKDWEKFQDERGFS----------CPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKW-DG
        K+  A  WE+F +   FS              Q  TE I ++K      +  +S  IN+++ L D+   E   +++ ++ DFL  VL  V   G++W   
Subjt:  KEAEAKDWEKFQDERGFS----------CPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKW-DG

Query:  NGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC
           + + + + L+  A +W ++V+   MP +H  T+S++R++L+Y I+    I+VG+I+++ +   A K  G+ + P L+TSLC
Subjt:  NGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLC

A0A5D2Q4A4 Uncharacterized protein1.7e-1126.06Show/hide
Query:  KEAEAKDWEKFQDERG---------FSCPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD-GN
        K+  A  W++F D R          F   L      ++  RK +   T  F S  IN+++ L D+   E   +++ ++ DFL  VL  +   G++W    
Subjt:  KEAEAKDWEKFQDERG---------FSCPLPQTMTEQISQRKWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWD-GN

Query:  GAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNL
          + + + + L+  A +W H+V+N  MP +H +T+ +++++ +Y I+    I+VG+I+++ +   A K  G+ + P L+TS+C  +++
Subjt:  GAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPIDVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGACTCTCAAAATCCATCCCTCGGTGGTTATGCTGACGACGCTCCTCCATCTTCACCGAAGGTGGAAGGACCGCATCGGGACGTCATTATCTCATCGGATGAAGA
TAGCCGGCAAGACATTGATCACATGGAGGAAGACGAACCGGAAGTCATAGATGGGGAACAAGCCATCGAGAGCTCGGAAGGTCAGCTCACCGCTGATCCATCACCCTCCG
GTGGAAACACCACTGAAGTCCTCAACATTACGCCACTAGATACCTCAACCTTCACCCTCCAACCCGTCAACCTCCAGCCTATACCAAATCTTCTCACCCCTCGCAACCTA
ACTCCATTCTCTTCTCCCTTCCCGTCGGTTGCCCTCCTTCCACCTCCGAACTCCTCCACTGCCCTGGCCCCAAACACCAACTCTTCCTCTTCCTCCACGACCGCATCCGT
TTCCACCTCCCTCCCACTGGTAACTCCCACATGCCCTACTCTACCACAATTATCAGAGCAGCAGATGGCCGCATGGAATGACCTGTTAGGAGAATTGGGCGAGGAAGGCG
AAGAAATTCGCCGAAGAGAAATGAGGGAGCATGTCACCGCAGATGAAACTGCTCGCAAACTCATCGCAGAAACAAGTGCAGAAAAAGATAAAGAAGGATCAAGCAATGCG
GAGAACGCAACTGCCGCAGACATCGCCGCAACTAACTTAGCCATCATTGCGGCAGCCCTTGACGTTGAATCCTCGGATTCCGAAGATGAAGTGCCCCTAATTCTTCGCCG
CAGCCTCCAACCCACAGGGGTAATCATCCGAGAGCCATCTGAACAGCCTGCGGCGGCCTCTAGCAGAAAAAGGAAAGCGCCCACCGCAGCCAAAGACAAGGGCAAAAAGA
AAATGGGCGCAAGCAAAGAGGCAGAGGCCAAAGATTGGGAGAAATTCCAAGACGAACGAGGATTCAGTTGCCCCCTCCCGCAGACAATGACTGAGCAAATTTCCCAGCGC
AAATGGGAAAACGGCCACACCATCGCCTTCTCCTCCGCCCACATCAACAATGTCTATAAACTACGAGACATTCCCAACGCAGAAGGCAATGCACTCCTTGACTGTCTAGA
TGATGACTTCCTGAGCGCAGTACTGCTTGCGGTGGCCAGACCTGGGGCCAAATGGGATGGAAATGGAGCTGCAAGGACCCTCAAATCGCAACTGCTCGAGTTTGAGGCCA
CGCTTTGGATGCACTGGGTAAAGAATCGCATCATGCCCACAACGCATGATGCGACATTGAGCTTGCAGCGAGTAATCCTTGTCTACTGCATAATGAAGAGCACCCCAATC
GACGTTGGCCGCATCCTCGTAGAAACCTTGCGCCGGACTGCTGCAAAACCAAAGGGCGCAAAATTCTGTCCCGGACTGGTCACATCCCTCTGCTGCACATCAAATCTATT
GCCCGCATCAGATGAAGAAGTGCGAACAATGTGGAAGGACTTTGATGAGCAATGGTGGGCATGTGTTACAAGCACTCGCAATAGGCGTCTGCAAGGAAGCACGCATCAGC
CAACGGCCCCCGCATCTCAACCCACCGAGACTTCGCCTGCGCCTGAGGCACATAAGAAGAAAAGGCGCAAGCAAACCGCAGGCAGGTACCATCGCCTAGCCGCACAGACG
CCTCAAAATACTGCTCCAGCGCAGGAAAGCACGGTCTGCGCTCAAAATCAAGAATCCTTGCCACAATCGCAAGAAGACCTGCGGCCACCACCACCTCCACCACCGCCACC
GCAACCACCAATTGTCCAGCACACGCCTCCTGCGCAGGCCACGGATGAATTGCGACCAGAGGGGAATGCTGAAGACCCATTACCGAACCCCACTCGCAAGGAGGAAGTTC
CTACGCCGAACCCTACGCTGCAAGCCTCTGCGAGTACCACTCAGGCTGGGGCCGATCTCACGATTGATGTGGACGCAGCTCTTATCTTCTCGCAGATGCGGAAAATAATT
CACGATAGTATAGAGCCCTTGCGACTGCAGCAAGCGACTCCTTTGCTTTCAATCCGTGCATCTGACTTTCTCAATTTTCTTAACCGCATTACTTCAATTGGTTTAACCTT
TATCTTCATTTCCTCAGACCTTAAGGAAAATTTCTTTGCATTTTCGACTTGCCGCAACTCAGCCTTCATCTTCTTTAGTCCATGCGCTTCAATTTCTTCTCAGCGCAGGC
ACACATCCCAAGATGATCAAGAGACAGAAAAGATGGATAAAAAGACAATGCTGACACAACTGACGCAATGTGCTAAATCCCTGACAGGAGCAAAATTATGGATTTTGGTG
AAAAAATGCCCAAAAGGAGTCATTCAAGCCCAAATTCCATACATGGGTCAAGGGGGAGTCCAAAGAAGTCAACCATGTGGAAAAGGTCCAAAAAGAATTGAAGTTGGCCC
AAGAATTGAAGCCCAAACCGCGCTTGAAAGTTCTGGAATGCGCCTAGCATCGAGACGCTGTAGGGACAGCGTCGACGCTGTCCCATTTCTTGGCCGGCAAGTCGATGACG
TCACAGCGTCGCGACGCTGTGCCAATAGCGTTGTGATGTTGCCCCATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGACTCTCAAAATCCATCCCTCGGTGGTTATGCTGACGACGCTCCTCCATCTTCACCGAAGGTGGAAGGACCGCATCGGGACGTCATTATCTCATCGGATGAAGA
TAGCCGGCAAGACATTGATCACATGGAGGAAGACGAACCGGAAGTCATAGATGGGGAACAAGCCATCGAGAGCTCGGAAGGTCAGCTCACCGCTGATCCATCACCCTCCG
GTGGAAACACCACTGAAGTCCTCAACATTACGCCACTAGATACCTCAACCTTCACCCTCCAACCCGTCAACCTCCAGCCTATACCAAATCTTCTCACCCCTCGCAACCTA
ACTCCATTCTCTTCTCCCTTCCCGTCGGTTGCCCTCCTTCCACCTCCGAACTCCTCCACTGCCCTGGCCCCAAACACCAACTCTTCCTCTTCCTCCACGACCGCATCCGT
TTCCACCTCCCTCCCACTGGTAACTCCCACATGCCCTACTCTACCACAATTATCAGAGCAGCAGATGGCCGCATGGAATGACCTGTTAGGAGAATTGGGCGAGGAAGGCG
AAGAAATTCGCCGAAGAGAAATGAGGGAGCATGTCACCGCAGATGAAACTGCTCGCAAACTCATCGCAGAAACAAGTGCAGAAAAAGATAAAGAAGGATCAAGCAATGCG
GAGAACGCAACTGCCGCAGACATCGCCGCAACTAACTTAGCCATCATTGCGGCAGCCCTTGACGTTGAATCCTCGGATTCCGAAGATGAAGTGCCCCTAATTCTTCGCCG
CAGCCTCCAACCCACAGGGGTAATCATCCGAGAGCCATCTGAACAGCCTGCGGCGGCCTCTAGCAGAAAAAGGAAAGCGCCCACCGCAGCCAAAGACAAGGGCAAAAAGA
AAATGGGCGCAAGCAAAGAGGCAGAGGCCAAAGATTGGGAGAAATTCCAAGACGAACGAGGATTCAGTTGCCCCCTCCCGCAGACAATGACTGAGCAAATTTCCCAGCGC
AAATGGGAAAACGGCCACACCATCGCCTTCTCCTCCGCCCACATCAACAATGTCTATAAACTACGAGACATTCCCAACGCAGAAGGCAATGCACTCCTTGACTGTCTAGA
TGATGACTTCCTGAGCGCAGTACTGCTTGCGGTGGCCAGACCTGGGGCCAAATGGGATGGAAATGGAGCTGCAAGGACCCTCAAATCGCAACTGCTCGAGTTTGAGGCCA
CGCTTTGGATGCACTGGGTAAAGAATCGCATCATGCCCACAACGCATGATGCGACATTGAGCTTGCAGCGAGTAATCCTTGTCTACTGCATAATGAAGAGCACCCCAATC
GACGTTGGCCGCATCCTCGTAGAAACCTTGCGCCGGACTGCTGCAAAACCAAAGGGCGCAAAATTCTGTCCCGGACTGGTCACATCCCTCTGCTGCACATCAAATCTATT
GCCCGCATCAGATGAAGAAGTGCGAACAATGTGGAAGGACTTTGATGAGCAATGGTGGGCATGTGTTACAAGCACTCGCAATAGGCGTCTGCAAGGAAGCACGCATCAGC
CAACGGCCCCCGCATCTCAACCCACCGAGACTTCGCCTGCGCCTGAGGCACATAAGAAGAAAAGGCGCAAGCAAACCGCAGGCAGGTACCATCGCCTAGCCGCACAGACG
CCTCAAAATACTGCTCCAGCGCAGGAAAGCACGGTCTGCGCTCAAAATCAAGAATCCTTGCCACAATCGCAAGAAGACCTGCGGCCACCACCACCTCCACCACCGCCACC
GCAACCACCAATTGTCCAGCACACGCCTCCTGCGCAGGCCACGGATGAATTGCGACCAGAGGGGAATGCTGAAGACCCATTACCGAACCCCACTCGCAAGGAGGAAGTTC
CTACGCCGAACCCTACGCTGCAAGCCTCTGCGAGTACCACTCAGGCTGGGGCCGATCTCACGATTGATGTGGACGCAGCTCTTATCTTCTCGCAGATGCGGAAAATAATT
CACGATAGTATAGAGCCCTTGCGACTGCAGCAAGCGACTCCTTTGCTTTCAATCCGTGCATCTGACTTTCTCAATTTTCTTAACCGCATTACTTCAATTGGTTTAACCTT
TATCTTCATTTCCTCAGACCTTAAGGAAAATTTCTTTGCATTTTCGACTTGCCGCAACTCAGCCTTCATCTTCTTTAGTCCATGCGCTTCAATTTCTTCTCAGCGCAGGC
ACACATCCCAAGATGATCAAGAGACAGAAAAGATGGATAAAAAGACAATGCTGACACAACTGACGCAATGTGCTAAATCCCTGACAGGAGCAAAATTATGGATTTTGGTG
AAAAAATGCCCAAAAGGAGTCATTCAAGCCCAAATTCCATACATGGGTCAAGGGGGAGTCCAAAGAAGTCAACCATGTGGAAAAGGTCCAAAAAGAATTGAAGTTGGCCC
AAGAATTGAAGCCCAAACCGCGCTTGAAAGTTCTGGAATGCGCCTAGCATCGAGACGCTGTAGGGACAGCGTCGACGCTGTCCCATTTCTTGGCCGGCAAGTCGATGACG
TCACAGCGTCGCGACGCTGTGCCAATAGCGTTGTGATGTTGCCCCATTTCTGA
Protein sequenceShow/hide protein sequence
MSDSQNPSLGGYADDAPPSSPKVEGPHRDVIISSDEDSRQDIDHMEEDEPEVIDGEQAIESSEGQLTADPSPSGGNTTEVLNITPLDTSTFTLQPVNLQPIPNLLTPRNL
TPFSSPFPSVALLPPPNSSTALAPNTNSSSSSTTASVSTSLPLVTPTCPTLPQLSEQQMAAWNDLLGELGEEGEEIRRREMREHVTADETARKLIAETSAEKDKEGSSNA
ENATAADIAATNLAIIAAALDVESSDSEDEVPLILRRSLQPTGVIIREPSEQPAAASSRKRKAPTAAKDKGKKKMGASKEAEAKDWEKFQDERGFSCPLPQTMTEQISQR
KWENGHTIAFSSAHINNVYKLRDIPNAEGNALLDCLDDDFLSAVLLAVARPGAKWDGNGAARTLKSQLLEFEATLWMHWVKNRIMPTTHDATLSLQRVILVYCIMKSTPI
DVGRILVETLRRTAAKPKGAKFCPGLVTSLCCTSNLLPASDEEVRTMWKDFDEQWWACVTSTRNRRLQGSTHQPTAPASQPTETSPAPEAHKKKRRKQTAGRYHRLAAQT
PQNTAPAQESTVCAQNQESLPQSQEDLRPPPPPPPPPQPPIVQHTPPAQATDELRPEGNAEDPLPNPTRKEEVPTPNPTLQASASTTQAGADLTIDVDAALIFSQMRKII
HDSIEPLRLQQATPLLSIRASDFLNFLNRITSIGLTFIFISSDLKENFFAFSTCRNSAFIFFSPCASISSQRRHTSQDDQETEKMDKKTMLTQLTQCAKSLTGAKLWILV
KKCPKGVIQAQIPYMGQGGVQRSQPCGKGPKRIEVGPRIEAQTALESSGMRLASRRCRDSVDAVPFLGRQVDDVTASRRCANSVVMLPHF