; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G060600 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G060600
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptiontranscription elongation factor B polypeptide 3
Genome locationCicolChr04:5228995..5229705
RNA-Seq ExpressionCcUC04G060600
SyntenyCcUC04G060600
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0070449 - elongin complex (cellular component)
GO:0035529 - NADH pyrophosphatase activity (molecular function)
GO:0047631 - ADP-ribose diphosphatase activity (molecular function)
GO:0051287 - NAD binding (molecular function)
InterPro domainsIPR010684 - RNA polymerase II transcription factor SIII, subunit A


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582214.1 hypothetical protein SDJN03_22216, partial [Cucurbita argyrosperma subsp. sororia]9.6e-5256.16Show/hide
Query:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG  DLN L+ ILPHCT+DQLMHIEN SKGRDLTPVT+KLWK FYEKKFG++  + V+ERMKH KESF WKQ+YE KM+ELE+KA +
Subjt:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK

Query:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV
        IE +YIQNC+ EK +K++RQ+  C  S                      +K+LKK K + + C+V S  NNK+  ++GG T        SKI KKA++E 
Subjt:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV

Query:  LTRIETKNLIAFRRNVIQK
           IETKNLIAFRRN IQK
Subjt:  LTRIETKNLIAFRRNVIQK

XP_022979565.1 uncharacterized protein LOC111479243 [Cucurbita maxima]1.1e-5055.25Show/hide
Query:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG  DLN L+ ILPHCT++QLM IENSSKGRDLTPVT+KLWK FYE+KFGK+  + V+E MKH KESF+WKQ+YE K++ELE+KA +
Subjt:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK

Query:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV
        IE +YIQNCQ EKA+K++RQ+  C  S                      +K+LKKSK   + C+V S+ +N K+   GGT         SKI KKA++E 
Subjt:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV

Query:  LTRIETKNLIAFRRNVIQK
        L  IETKN+IAFRRN +QK
Subjt:  LTRIETKNLIAFRRNVIQK

XP_022979571.1 uncharacterized protein LOC111479252 [Cucurbita maxima]1.5e-5255.71Show/hide
Query:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG  DLN L+ ILPHCT+DQLMHIEN SKGRDLTP+T+KLWK FYE+KFGK+D + V++RMKH KESF WKQ+YE K++ELE KA +
Subjt:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK

Query:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV
        +E +YI+NCQ EKA+K++RQ+  C  S                      +K+LKK K   + C+V S+ NN K+   GGT         SKI KKA++E 
Subjt:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV

Query:  LTRIETKNLIAFRRNVIQK
        L  IETKNLIAFRRN IQK
Subjt:  LTRIETKNLIAFRRNVIQK

XP_038885873.1 uncharacterized protein LOC120076176 [Benincasa hispida]4.6e-7870.04Show/hide
Query:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEW
        M EEVSKI +SFL++ SINEAIDSL+FLGDVG  DL++L+RILPHCT+DQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSD VI++MK+KKESF+W
Subjt:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEW

Query:  KQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGG-SSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK
        KQ+YEAKME LEKKA +IE +Y QNCQKE A+K++R+IIFC   SS  NKK+R EG +    CNT E+KILKK  RE Q C+V S          GGTTK
Subjt:  KQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGG-SSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK

Query:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK
        P H TK SKI KKAKRE L  IETKN+IAFRRN +QK
Subjt:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK

XP_038885879.1 uncharacterized protein LOC120076185 [Benincasa hispida]2.0e-7367.93Show/hide
Query:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEW
        M EEVSKI +SFL++ SINEAID+L+FL DVG  DL++L RILPHCT+DQL+HIENSSKGRDLT VTDKLWKNFY KKFGKND D  IERMK+KKESF+W
Subjt:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEW

Query:  KQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGG-SSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK
        KQ+YEAKME LEKKA +IE +Y QNCQKE A+K++R+IIFC   SS  NKK R EG +    CNT E+KILKK  RE Q C+V S          GGTTK
Subjt:  KQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGG-SSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK

Query:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK
        P H TK SKI KKAKRE L  IETKN+IAFRRN +QK
Subjt:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK

TrEMBL top hitse value%identityAlignment
A0A0A0KJR8 Nudix hydrolase domain-containing protein4.1e-4055.37Show/hide
Query:  NLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKK
        N  +N+AIDS+KFLGDVG  DL  L+ IL HCT DQL+HIEN SKGRDLTP+T+KLWKNFYE+KFGK+D D V+     K E+F+W  +Y AKM+ELE +
Subjt:  NLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKK

Query:  AKKIETQYIQNCQKEKAQKENRQIIFCGG-SSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKK
        AKKIE + IQ+ QKEKA+K++RQI+FCG   S ++ K     +   F  NT ++  LKK+KRE    +V ST +NK+
Subjt:  AKKIETQYIQNCQKEKAQKENRQIIFCGG-SSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKK

A0A1S4E1U0 transcription elongation factor B polypeptide 3 isoform X21.5e-3945.09Show/hide
Query:  LNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELE
        L +L +N+AID+++FLGDVG  D++LL+RILPHCT+DQLMH+E SS+GRDLTPVTDKLWK FYE++FGK  +  VIERM+ K+ +F W Q+YEAKM+++E
Subjt:  LNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELE

Query:  KKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKK
        K   K   +  Q+  KE A+K++RQI  C    P + K+ F G  + +G N   TK                                      +KI KK
Subjt:  KKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKK

Query:  AKREVLTRIETKNLIAFRRNVIQK
        AK EVL   E KN+ A+RRN +QK
Subjt:  AKREVLTRIETKNLIAFRRNVIQK

A0A5D3BGR1 Transcription elongation factor B polypeptide 3 isoform X21.5e-3945.09Show/hide
Query:  LNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELE
        L +L +N+AID+++FLGDVG  D++LL+RILPHCT+DQLMH+E SS+GRDLTPVTDKLWK FYE++FGK  +  VIERM+ K+ +F W Q+YEAKM+++E
Subjt:  LNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELE

Query:  KKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKK
        K   K   +  Q+  KE A+K++RQI  C    P + K+ F G  + +G N   TK                                      +KI KK
Subjt:  KKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKK

Query:  AKREVLTRIETKNLIAFRRNVIQK
        AK EVL   E KN+ A+RRN +QK
Subjt:  AKREVLTRIETKNLIAFRRNVIQK

A0A6J1IR58 uncharacterized protein LOC1114792527.1e-5355.71Show/hide
Query:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG  DLN L+ ILPHCT+DQLMHIEN SKGRDLTP+T+KLWK FYE+KFGK+D + V++RMKH KESF WKQ+YE K++ELE KA +
Subjt:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK

Query:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV
        +E +YI+NCQ EKA+K++RQ+  C  S                      +K+LKK K   + C+V S+ NN K+   GGT         SKI KKA++E 
Subjt:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV

Query:  LTRIETKNLIAFRRNVIQK
        L  IETKNLIAFRRN IQK
Subjt:  LTRIETKNLIAFRRNVIQK

A0A6J1ITM2 uncharacterized protein LOC1114792435.1e-5155.25Show/hide
Query:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG  DLN L+ ILPHCT++QLM IENSSKGRDLTPVT+KLWK FYE+KFGK+  + V+E MKH KESF+WKQ+YE K++ELE+KA +
Subjt:  INEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEELEKKAKK

Query:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV
        IE +YIQNCQ EKA+K++RQ+  C  S                      +K+LKKSK   + C+V S+ +N K+   GGT         SKI KKA++E 
Subjt:  IETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREV

Query:  LTRIETKNLIAFRRNVIQK
        L  IETKN+IAFRRN +QK
Subjt:  LTRIETKNLIAFRRNVIQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42780.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684); Has 187 Blast hits to 186 proteins in 77 species: Archae - 0; Bacteria - 0; Metazoa - 104; Fungi - 29; Plants - 38; Viruses - 0; Other Eukaryotes - 16 (source: NCBI BLink).4.0e-2432.91Show/hide
Query:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMK-HKKESFE
        ++E ++K   S L +L + +AID++K++G VG  D  LL++IL HCT++QL HIE+++   DL+P+TDK WK FY+K +G+ D   +IE ++ +K   F+
Subjt:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMK-HKKESFE

Query:  WKQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK
        W+ +YE K+  +++K K++  +  +  + E  +K++RQ   C  + P                          SKR       P   N+    N G    
Subjt:  WKQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK

Query:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK
               S I KKAK ++L   E KNL A +RN IQK
Subjt:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK

AT2G42780.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684).4.0e-2432.91Show/hide
Query:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMK-HKKESFE
        ++E ++K   S L +L + +AID++K++G VG  D  LL++IL HCT++QL HIE+++   DL+P+TDK WK FY+K +G+ D   +IE ++ +K   F+
Subjt:  MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMK-HKKESFE

Query:  WKQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK
        W+ +YE K+  +++K K++  +  +  + E  +K++RQ   C  + P                          SKR       P   N+    N G    
Subjt:  WKQVYEAKMEELEKKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTK

Query:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK
               S I KKAK ++L   E KNL A +RN IQK
Subjt:  PRHNTKPSKIWKKAKREVLTRIETKNLIAFRRNVIQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGAAGAAGTAAGTAAAATAACTACATCATTTCTTAATAATCTTTCCATTAACGAAGCTATAGATAGTTTGAAGTTTCTTGGAGATGTTGGCGTTGCTGATTTAAA
TCTTCTAGACCGTATTTTGCCACATTGTACTATTGACCAATTGATGCATATAGAGAACTCTTCTAAAGGAAGGGATCTCACACCTGTGACCGACAAGTTGTGGAAAAACT
TTTATGAAAAGAAGTTTGGTAAGAACGATTCTGATTTTGTGATTGAGAGGATGAAACACAAGAAAGAATCATTTGAATGGAAGCAAGTGTATGAAGCAAAGATGGAAGAG
TTAGAAAAGAAGGCGAAGAAAATTGAGACTCAATATATACAAAACTGTCAAAAGGAAAAAGCTCAAAAAGAAAACCGTCAAATAATATTTTGTGGGGGTTCTTCTCCAAT
CAATAAGAAACAAAGATTTGAAGGAAAACTGAATGAGTTCGGATGCAATACCAACGAGACCAAGATTTTGAAGAAGTCAAAGAGAGAAGAACAAAGTTGCGAAGTTCCAT
CTACGATCAATAATAAGAAGAAACAAAACTTTGGAGGGACAACCAAACCTAGACACAATACTAAGCCAAGCAAGATATGGAAGAAGGCAAAGAGAGAAGTGTTGACTCGT
ATAGAGACGAAGAACCTAATAGCTTTTCGAAGAAATGTGATACAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGAAGAAGTAAGTAAAATAACTACATCATTTCTTAATAATCTTTCCATTAACGAAGCTATAGATAGTTTGAAGTTTCTTGGAGATGTTGGCGTTGCTGATTTAAA
TCTTCTAGACCGTATTTTGCCACATTGTACTATTGACCAATTGATGCATATAGAGAACTCTTCTAAAGGAAGGGATCTCACACCTGTGACCGACAAGTTGTGGAAAAACT
TTTATGAAAAGAAGTTTGGTAAGAACGATTCTGATTTTGTGATTGAGAGGATGAAACACAAGAAAGAATCATTTGAATGGAAGCAAGTGTATGAAGCAAAGATGGAAGAG
TTAGAAAAGAAGGCGAAGAAAATTGAGACTCAATATATACAAAACTGTCAAAAGGAAAAAGCTCAAAAAGAAAACCGTCAAATAATATTTTGTGGGGGTTCTTCTCCAAT
CAATAAGAAACAAAGATTTGAAGGAAAACTGAATGAGTTCGGATGCAATACCAACGAGACCAAGATTTTGAAGAAGTCAAAGAGAGAAGAACAAAGTTGCGAAGTTCCAT
CTACGATCAATAATAAGAAGAAACAAAACTTTGGAGGGACAACCAAACCTAGACACAATACTAAGCCAAGCAAGATATGGAAGAAGGCAAAGAGAGAAGTGTTGACTCGT
ATAGAGACGAAGAACCTAATAGCTTTTCGAAGAAATGTGATACAAAAGTAG
Protein sequenceShow/hide protein sequence
MYEEVSKITTSFLNNLSINEAIDSLKFLGDVGVADLNLLDRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKFGKNDSDFVIERMKHKKESFEWKQVYEAKMEE
LEKKAKKIETQYIQNCQKEKAQKENRQIIFCGGSSPINKKQRFEGKLNEFGCNTNETKILKKSKREEQSCEVPSTINNKKKQNFGGTTKPRHNTKPSKIWKKAKREVLTR
IETKNLIAFRRNVIQK