; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04440 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04440
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein MNN4-like
Genome locationClcChr08:13434568..13437351
RNA-Seq ExpressionClc08G04440
SyntenyClc08G04440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833181.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]3.1e-6891.16Show/hide
Query:  LVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNA
        +VEEPTKEAAKSR KQYSGLYTEVGFFPEPIELPAFIIQGV+AL WRQFCESGQVIQPTAVEAFYEGTIH KAH+VKV+DEVISFEPQEINALFDLPN A
Subjt:  LVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNA

Query:  AAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIA
         AEGN+IMSTPTDAE+NDAL+IIAKPGSEWNTS KGIQSLAPNCLIA
Subjt:  AAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIA

WP_217833214.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]5.6e-6289.19Show/hide
Query:  QYSNLPTPPTTEPSARPTQPQTEATTSTHHQEEPYHLQHASAPLPVVDLNLDDLLRYLDDGILHPIMGDLDEIRRKEMESLQRQQELAQQVSQLGQQVTQ
        Q+SNLPTPPTTEPSARPTQPQ EATTSTHHQEEPYHL+HASAPLP VDLNLDDLLRYLDDGILHP+MGDLDEIRRKEMESLQRQ+ELAQQVSQLGQQ+TQ
Subjt:  QYSNLPTPPTTEPSARPTQPQTEATTSTHHQEEPYHLQHASAPLPVVDLNLDDLLRYLDDGILHPIMGDLDEIRRKEMESLQRQQELAQQVSQLGQQVTQ

Query:  MAQQQMELRSFVQRQAQRQDEQFSTLMNYIYEVFVQRIPAPVIPPSLQ
        MAQQQ+ELRSFVQRQAQRQDEQFSTLMNYIYEVF+QRIP   +  +LQ
Subjt:  MAQQQMELRSFVQRQAQRQDEQFSTLMNYIYEVFVQRIPAPVIPPSLQ

XP_038876674.1 chromatin assembly factor 1 subunit A-like, partial [Benincasa hispida]7.1e-2531.05Show/hide
Query:  VREEEAEEDVLLLR----RKNEKGKDAGTSEAVTPLVAASPKKRKEEIAAVEQSINEAARKAEFLRRIAEIA-TELEVEWEVSDAEKPPKAERIKKNIKK
        V EEEA  + +++      + E G+D  T E  TP V A+ K   +    V + I            +AE+A  E+E + EV + +K  K  + +K  + 
Subjt:  VREEEAEEDVLLLR----RKNEKGKDAGTSEAVTPLVAASPKKRKEEIAAVEQSINEAARKAEFLRRIAEIA-TELEVEWEVSDAEKPPKAERIKKNIKK

Query:  IREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYS-----G
        + E +++EKE    +E+  K KKE+E    KE   E++KK E                 RR       + +KGKE L E   +E  K R K+        
Subjt:  IREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYS-----G

Query:  LYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDA
        L  E G   E    PA +++                        FY G +H     V +K E +SF  ++IN ++ + +N  A GNKI+  PT+ ++ DA
Subjt:  LYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDA

Query:  LSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGL
        L ++ + G +W+ SLKG+ +LA   L+ E  LW+Y +K+ L+ TTHD ++SRD+ M  YCI++ I +DVG++IA Q+R L
Subjt:  LSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGL

XP_038898613.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120086174 [Benincasa hispida]5.1e-2329.94Show/hide
Query:  VAASPKKRKEEIAAVEQSINEAARKAEFLRRIAEIATELEVEWEVSDAEKPPKAERIKKNIKKIREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPER
        V   P   +EE   V     E A K E L     +   L  E +  + +     +     +KK+ E +K++K     K+K+ KK++ KE           
Subjt:  VAASPKKRKEEIAAVEQSINEAARKAEFLRRIAEIATELEVEWEVSDAEKPPKAERIKKNIKKIREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPER

Query:  KKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAV
        K+  +VA+      R  D   +   PAS  KR+ K KE                    +  E+GF+P P  LP  I   +   GW  F +   +I P  V
Subjt:  KKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAV

Query:  EAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLL
          FY G +H     V  K E + F  + IN L+ + +N  A GNKI+  PT+  + +AL ++A+PG+ W  S KGI++L    L+++  LW+Y +K+ L+
Subjt:  EAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLL

Query:  PTTHDASISRDQAM
        PTTHD ++S+DQ +
Subjt:  PTTHDASISRDQAM

XP_038904385.1 uncharacterized protein LOC120090747 [Benincasa hispida]5.3e-2829.85Show/hide
Query:  WEVSDAEKPPKAERIKKNIKKIREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVE
        +E +  +K  K ++  +  ++ REEK+ +KE    + + P    E      +E +  ++KK+   +   + IR +       +  +P+ R  K      E
Subjt:  WEVSDAEKPPKAERIKKNIKKIREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVE

Query:  EPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAE
            +   S  K+   +  E+GF P    LP F    V   GW  F +   +I PT V AFY+G +H     V +K  ++ F  ++IN L+ + +   A 
Subjt:  EPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAE

Query:  GNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGLFFKP
        GNKI+  P + ++ DAL  + + G++W+ SLKGI++LA + L+ EA LW+Y +KR ++PT+HD ++SRD+ M  YCI   I +DV  +IA Q +      
Subjt:  GNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGLFFKP

Query:  RAQSLRQILKDSTNLLDAANPKKRPLKSQPSPPQP
         A   +Q    S   ++A      PL    SP +P
Subjt:  RAQSLRQILKDSTNLLDAANPKKRPLKSQPSPPQP

TrEMBL top hitse value%identityAlignment
A0A061FAJ6 Uncharacterized protein1.2e-1427.6Show/hide
Query:  AAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNAL----GWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEG
        +A++  + Y+ L  +V      IE+P    + +N L     W QFC    V+    V  FY   +     +  V+ + + F  Q IN L   PN    E 
Subjt:  AAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNAL----GWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEG

Query:  NKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQI
         + +    D   N+ +S +   G++W TS     S   + +  E  +WL+F+   LLP+TH + +++D+A++IY I+    +DVG++I+  I
Subjt:  NKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQI

A0A2P5AGA5 Uncharacterized protein (Fragment)4.2e-1530.84Show/hide
Query:  VKRNEKGKEKLVEEPTKEAAKSR---NKQYSGLYTEVGFFPEPIE----LPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVI
        +KR  +   K V+  T EAA++R   N Q   L  E GF  +  E    LP FI Q +    W+QFC   +      V  FY        + V V+   +
Subjt:  VKRNEKGKEKLVEEPTKEAAKSR---NKQYSGLYTEVGFFPEPIE----LPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVI

Query:  SFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQE
        S+  + INA+F L  +   E ++ +   T+ +L   L  +A  G+EWN S +G  +   + L   A +W +F+K  LLPTTH  ++S+D+ ++++ ++  
Subjt:  SFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQE

Query:  IQLDVGRIIAPQIR
          ++VGR+I  +IR
Subjt:  IQLDVGRIIAPQIR

A0A2P5BCG4 Uncharacterized protein (Fragment)3.2e-1530.05Show/hide
Query:  VKRNEKGKEKLVEEPTKEAAK--SRNKQYSGLYTEVGFFPEPIE----LPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVIS
        +KR  +   K V+  T+ AA     N Q   L  E GF  +  E    LP FI Q +    W+QFC   +      V  FY      + + V V+   +S
Subjt:  VKRNEKGKEKLVEEPTKEAAK--SRNKQYSGLYTEVGFFPEPIE----LPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVIS

Query:  FEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEI
        +  + INA+F L  +   E ++ +   T  +L   L  +A  G+EWN S +G  +   + L   A +W +F+K  LLPTTH  ++S+D+ ++++ ++   
Subjt:  FEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEI

Query:  QLDVGRIIAPQIR
         ++VGR+I  +IR
Subjt:  QLDVGRIIAPQIR

A0A5A7TZE0 Protein MNN4-like4.7e-2229.69Show/hide
Query:  VEQSINEAARKA-EFLRRIAEIATELEVEWEVSDAEKPPKAERIKKNIKKIREEKKKE-KEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGV
        VE    + ARKA E   ++ +   +++V+ +   A    K ER +K   +  +E +KE +E++ +++++ +K+ +K+ V   +   +R+KKN+       
Subjt:  VEQSINEAARKA-EFLRRIAEIATELEVEWEVSDAEKPPKAERIKKNIKKIREEKKKE-KEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGV

Query:  VIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKA
        V R+ ++ ++ +     +   E GK  ++E+                    G FP   +LP F+   + AL W+QF E    I+P+ +  FY G+I+ + 
Subjt:  VIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKA

Query:  HMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWN-TSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRD
        H   VK ++++F P+ +N L+ L          I   P+D ++ +AL  +A PG +W+ T +K  Q L P+ L   A++WL FIK++L+PT HD +IS +
Subjt:  HMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWN-TSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRD

Query:  QAMVIYCIMQEIQLDVGRII
        + M++YCIM+EI L+V  II
Subjt:  QAMVIYCIMQEIQLDVGRII

A0A5D3DVQ6 Uncharacterized protein2.9e-1628.29Show/hide
Query:  KEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQ
        +++ P +KKK          +   DA   RR      ++ E+ +E  +E+    A + ++        E GFF   ++L  F++  + ALGW++F     
Subjt:  KEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQYSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQ

Query:  VIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWN-TSLKGIQSLAPNCLIAEANLWL
         I+   V+ FY G I  + H   VK+                              P+D ++ +AL  +A    +W+ TS+K  +    N L  EA++WL
Subjt:  VIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKPGSEWN-TSLKGIQSLAPNCLIAEANLWL

Query:  YFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGLFFKPR
         FIK+ L+PT HD +IS ++ M++YCIM+EI +DV  II   I+     PR
Subjt:  YFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGLFFKPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCAGAAACCCACTTCCTCCAAAAGCCCTAAACCCATGGCCTCATCCTCGAGGCAGGATAGTCAGAATCTGTCCACTACCTCCACCACACCTGAACCTCTCGC
TGTTGTTCGCCCCAACAGTCCTGAGTTGGAGGCGACATCCCCATCTTCACCACGCTCCAGGAACCTGTCTGAAGCCCAAAAGGTTGGGAGGAATGACGAGACCACCGTCT
TTGGAGATATTCTAGACACGATCGATGAAGAAGACGATGACTGGTTGGACGGGTTGGTCGACCAGAGGATGAAGAACGAATCGCGTGGGTCAGTCCTTGCGGTAAGGGGT
CTTGAGGAATCTGACTCCGATGAAAGACCCATTGGCAAGGTATTAGCGGCAGTGGAAAAGAAGAGGTTCACTAAGGAAAAGAAAGTCAGACCAAATAGGGTCAGGGAAGA
AGAAGCCGAGGAAGATGTCCTTCTGCTAAGAAGGAAAAACGAAAAAGGAAAGGACGCGGGGACATCTGAGGCTGTCACGCCATTGGTCGCCGCATCCCCTAAGAAAAGAA
AAGAGGAAATCGCGGCTGTTGAGCAATCTATCAATGAAGCCGCGAGAAAAGCTGAATTTCTCCGTCGCATAGCAGAGATTGCGACGGAGTTAGAAGTAGAGTGGGAAGTA
AGTGATGCGGAGAAGCCACCAAAAGCTGAACGCATCAAGAAAAATATAAAGAAGATAAGGGAGGAGAAGAAGAAAGAAAAAGAGGTAGCCTTCATAAAGGAGAAGCTACC
GAAGAAGAAGAAAGAAAAAGAGGTAGTCTTCACCAAGGAGAAACTACCGGAGAGAAAGAAGAAAAATGAGGTGGCTAAGGCACCTGGAGTAGTTATCCGGGATATGGACG
CAGGCAGAACAAGGAGAACGCCTGCTAGTCCCGTGAAGAGGAATGAGAAAGGAAAAGAGAAATTGGTTGAGGAACCAACAAAGGAGGCGGCGAAGAGCAGAAATAAACAA
TATAGTGGGCTCTACACTGAAGTGGGATTCTTCCCAGAGCCTATTGAGCTACCCGCCTTCATCATCCAAGGAGTCAACGCATTGGGTTGGAGACAATTTTGTGAAAGTGG
CCAAGTCATCCAACCCACTGCCGTGGAGGCGTTTTACGAAGGAACAATTCACCGCAAGGCACATATGGTCAAAGTTAAAGATGAGGTGATTTCCTTCGAGCCTCAAGAGA
TTAACGCATTGTTTGATCTGCCCAACAATGCGGCGGCGGAAGGTAACAAAATAATGTCAACGCCTACAGACGCCGAATTGAATGATGCCCTCTCAATCATCGCCAAACCG
GGGTCAGAGTGGAACACTTCCCTGAAGGGCATTCAATCATTGGCGCCTAACTGCTTGATCGCTGAGGCCAACCTGTGGCTGTATTTCATCAAACGGTCGCTGCTTCCCAC
AACGCATGACGCCTCAATTTCAAGGGACCAGGCCATGGTCATCTACTGCATCATGCAGGAAATTCAGCTGGATGTGGGACGCATCATTGCCCCACAGATACGGGGGCTGT
TCTTCAAGCCAAGGGCGCAGAGTTTGAGACAAATACTGAAGGACTCAACAAATCTGCTGGACGCGGCCAACCCAAAGAAGAGGCCGCTAAAGTCACAACCATCACCCCCC
CAACCTCCTCAACCAAAGAAGAGGAAATTGGTCAAGAAAAATTTTGAGATTCAGGCGTCGAGTTCACAACCTGGTGAAGCCGAAGTTCCGCTCGACGCCTACACCCAGGC
CTTGACCATTTATACTCCTCCCAGCGCCCCAATCCCCGAGGAGCCCTCTTCACCACCAACTTCCCCTTCTCCTTCCCCTAAAATCCAAAATGAACCGCTCCATCTCCTTA
CCACCCAACGCGGCCTTCCGACGCCTCTTCAGATCCCTATCGCTGATTTAAATGAAGAGACGAAGGAAGTGTCGCCGCCTTCTTCGTCTCACCTGGAGTTAACCCTCTCC
CCGCCTCAAGAGTCAGCGCCCTTTCATTTCTTGTCGCCTCGCCATGAACCTCAATATTCCAATCTTCCAACCCCACCAACTACCGAACCTTCTGCCCGCCCAACTCAACC
GCAGACAGAAGCAACCACCAGCACTCATCATCAAGAAGAACCGTATCACTTGCAGCACGCCTCTGCTCCGCTGCCGGTTGTTGATCTAAATTTGGATGATCTGCTGAGGT
ATTTGGATGATGGAATCCTTCACCCCATCATGGGAGATTTGGATGAAATAAGGCGCAAGGAGATGGAAAGTCTCCAGCGGCAACAAGAGTTAGCTCAACAAGTCTCACAA
TTGGGTCAGCAAGTGACTCAAATGGCGCAGCAACAAATGGAGCTTCGGAGTTTTGTTCAACGCCAAGCTCAGCGTCAGGACGAACAGTTCAGCACTTTGATGAACTATAT
TTATGAAGTGTTCGTCCAACGCATCCCAGCGCCGGTTATCCCTCCTTCCCTCCAACAACCCCTATCCAAATCCTCCCCTCGTCGACAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCAGAAACCCACTTCCTCCAAAAGCCCTAAACCCATGGCCTCATCCTCGAGGCAGGATAGTCAGAATCTGTCCACTACCTCCACCACACCTGAACCTCTCGC
TGTTGTTCGCCCCAACAGTCCTGAGTTGGAGGCGACATCCCCATCTTCACCACGCTCCAGGAACCTGTCTGAAGCCCAAAAGGTTGGGAGGAATGACGAGACCACCGTCT
TTGGAGATATTCTAGACACGATCGATGAAGAAGACGATGACTGGTTGGACGGGTTGGTCGACCAGAGGATGAAGAACGAATCGCGTGGGTCAGTCCTTGCGGTAAGGGGT
CTTGAGGAATCTGACTCCGATGAAAGACCCATTGGCAAGGTATTAGCGGCAGTGGAAAAGAAGAGGTTCACTAAGGAAAAGAAAGTCAGACCAAATAGGGTCAGGGAAGA
AGAAGCCGAGGAAGATGTCCTTCTGCTAAGAAGGAAAAACGAAAAAGGAAAGGACGCGGGGACATCTGAGGCTGTCACGCCATTGGTCGCCGCATCCCCTAAGAAAAGAA
AAGAGGAAATCGCGGCTGTTGAGCAATCTATCAATGAAGCCGCGAGAAAAGCTGAATTTCTCCGTCGCATAGCAGAGATTGCGACGGAGTTAGAAGTAGAGTGGGAAGTA
AGTGATGCGGAGAAGCCACCAAAAGCTGAACGCATCAAGAAAAATATAAAGAAGATAAGGGAGGAGAAGAAGAAAGAAAAAGAGGTAGCCTTCATAAAGGAGAAGCTACC
GAAGAAGAAGAAAGAAAAAGAGGTAGTCTTCACCAAGGAGAAACTACCGGAGAGAAAGAAGAAAAATGAGGTGGCTAAGGCACCTGGAGTAGTTATCCGGGATATGGACG
CAGGCAGAACAAGGAGAACGCCTGCTAGTCCCGTGAAGAGGAATGAGAAAGGAAAAGAGAAATTGGTTGAGGAACCAACAAAGGAGGCGGCGAAGAGCAGAAATAAACAA
TATAGTGGGCTCTACACTGAAGTGGGATTCTTCCCAGAGCCTATTGAGCTACCCGCCTTCATCATCCAAGGAGTCAACGCATTGGGTTGGAGACAATTTTGTGAAAGTGG
CCAAGTCATCCAACCCACTGCCGTGGAGGCGTTTTACGAAGGAACAATTCACCGCAAGGCACATATGGTCAAAGTTAAAGATGAGGTGATTTCCTTCGAGCCTCAAGAGA
TTAACGCATTGTTTGATCTGCCCAACAATGCGGCGGCGGAAGGTAACAAAATAATGTCAACGCCTACAGACGCCGAATTGAATGATGCCCTCTCAATCATCGCCAAACCG
GGGTCAGAGTGGAACACTTCCCTGAAGGGCATTCAATCATTGGCGCCTAACTGCTTGATCGCTGAGGCCAACCTGTGGCTGTATTTCATCAAACGGTCGCTGCTTCCCAC
AACGCATGACGCCTCAATTTCAAGGGACCAGGCCATGGTCATCTACTGCATCATGCAGGAAATTCAGCTGGATGTGGGACGCATCATTGCCCCACAGATACGGGGGCTGT
TCTTCAAGCCAAGGGCGCAGAGTTTGAGACAAATACTGAAGGACTCAACAAATCTGCTGGACGCGGCCAACCCAAAGAAGAGGCCGCTAAAGTCACAACCATCACCCCCC
CAACCTCCTCAACCAAAGAAGAGGAAATTGGTCAAGAAAAATTTTGAGATTCAGGCGTCGAGTTCACAACCTGGTGAAGCCGAAGTTCCGCTCGACGCCTACACCCAGGC
CTTGACCATTTATACTCCTCCCAGCGCCCCAATCCCCGAGGAGCCCTCTTCACCACCAACTTCCCCTTCTCCTTCCCCTAAAATCCAAAATGAACCGCTCCATCTCCTTA
CCACCCAACGCGGCCTTCCGACGCCTCTTCAGATCCCTATCGCTGATTTAAATGAAGAGACGAAGGAAGTGTCGCCGCCTTCTTCGTCTCACCTGGAGTTAACCCTCTCC
CCGCCTCAAGAGTCAGCGCCCTTTCATTTCTTGTCGCCTCGCCATGAACCTCAATATTCCAATCTTCCAACCCCACCAACTACCGAACCTTCTGCCCGCCCAACTCAACC
GCAGACAGAAGCAACCACCAGCACTCATCATCAAGAAGAACCGTATCACTTGCAGCACGCCTCTGCTCCGCTGCCGGTTGTTGATCTAAATTTGGATGATCTGCTGAGGT
ATTTGGATGATGGAATCCTTCACCCCATCATGGGAGATTTGGATGAAATAAGGCGCAAGGAGATGGAAAGTCTCCAGCGGCAACAAGAGTTAGCTCAACAAGTCTCACAA
TTGGGTCAGCAAGTGACTCAAATGGCGCAGCAACAAATGGAGCTTCGGAGTTTTGTTCAACGCCAAGCTCAGCGTCAGGACGAACAGTTCAGCACTTTGATGAACTATAT
TTATGAAGTGTTCGTCCAACGCATCCCAGCGCCGGTTATCCCTCCTTCCCTCCAACAACCCCTATCCAAATCCTCCCCTCGTCGACAACAATGA
Protein sequenceShow/hide protein sequence
MASQKPTSSKSPKPMASSSRQDSQNLSTTSTTPEPLAVVRPNSPELEATSPSSPRSRNLSEAQKVGRNDETTVFGDILDTIDEEDDDWLDGLVDQRMKNESRGSVLAVRG
LEESDSDERPIGKVLAAVEKKRFTKEKKVRPNRVREEEAEEDVLLLRRKNEKGKDAGTSEAVTPLVAASPKKRKEEIAAVEQSINEAARKAEFLRRIAEIATELEVEWEV
SDAEKPPKAERIKKNIKKIREEKKKEKEVAFIKEKLPKKKKEKEVVFTKEKLPERKKKNEVAKAPGVVIRDMDAGRTRRTPASPVKRNEKGKEKLVEEPTKEAAKSRNKQ
YSGLYTEVGFFPEPIELPAFIIQGVNALGWRQFCESGQVIQPTAVEAFYEGTIHRKAHMVKVKDEVISFEPQEINALFDLPNNAAAEGNKIMSTPTDAELNDALSIIAKP
GSEWNTSLKGIQSLAPNCLIAEANLWLYFIKRSLLPTTHDASISRDQAMVIYCIMQEIQLDVGRIIAPQIRGLFFKPRAQSLRQILKDSTNLLDAANPKKRPLKSQPSPP
QPPQPKKRKLVKKNFEIQASSSQPGEAEVPLDAYTQALTIYTPPSAPIPEEPSSPPTSPSPSPKIQNEPLHLLTTQRGLPTPLQIPIADLNEETKEVSPPSSSHLELTLS
PPQESAPFHFLSPRHEPQYSNLPTPPTTEPSARPTQPQTEATTSTHHQEEPYHLQHASAPLPVVDLNLDDLLRYLDDGILHPIMGDLDEIRRKEMESLQRQQELAQQVSQ
LGQQVTQMAQQQMELRSFVQRQAQRQDEQFSTLMNYIYEVFVQRIPAPVIPPSLQQPLSKSSPRRQQ