; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0865 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0865
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionethylene-responsive transcription factor ERF109-like
Genome locationMC01:14466434..14466925
RNA-Seq ExpressionMC01g0865
SyntenyMC01g0865
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595039.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]6.38e-6562.11Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSS-GPTPTNHNNTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ---------------------
        D+APF     L VEEEHSI+VSALTHV+S GR     T       F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                     
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSS-GPTPTNHNNTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ---------------------

Query:  ---EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
           ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF TAEQAARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  ---EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

KAG7027062.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. argyrosperma]4.50e-6562.11Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSS-GPTPTNHNNTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ---------------------
        D+APF     L VEEEHSI+VSALTHV+S GR     T       F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                     
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSS-GPTPTNHNNTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ---------------------

Query:  ---EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
           ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF+TAEQAARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  ---EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

XP_022963153.1 ethylene-responsive transcription factor ERF109-like [Cucurbita moschata]7.82e-6662.83Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------
        D+APF     L VEEEHSI+VSALTHV+S GR  G T T +     F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                    
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------

Query:  ----EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
            ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF+TAEQAARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  ----EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

XP_023003418.1 ethylene-responsive transcription factor ERF109-like [Cucurbita maxima]6.88e-6663.64Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------
        D+ PF     L VEEEHSI+VSALTHV+S GR  G T T +     F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                    
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------

Query:  EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
        ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF+TAEQAARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

XP_023518712.1 ethylene-responsive transcription factor ERF109-like [Cucurbita pepo subsp. pepo]1.04e-6562.96Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------
        D+APF     L VEEEHSI+VSALTHV+S GR  G T T +     F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                    
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------

Query:  --EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
          ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF+TAE+AARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  --EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

TrEMBL top hitse value%identityAlignment
A0A1S3B0B5 ethylene-responsive transcription factor ERF109-like7.11e-5553.55Show/hide
Query:  FTSNSDTAPFRLPLPVEEEHSIIVSALTHVLSQGRSS---------GPTPTNHNNTFYNAAAPSTVWFPSG-DTCEVCGIVGCLGCNYFQE---------
        F+ N  TAPF   L VEEE+SI+VSALTHVL+  R             +  +HN+   N  + + +WFPS  D C+ C   GCLGCNYF++         
Subjt:  FTSNSDTAPFRLPLPVEEEHSIIVSALTHVLSQGRSS---------GPTPTNHNNTFYNAAAPSTVWFPSG-DTCEVCGIVGCLGCNYFQE---------

Query:  -DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRG-DKAKLNFPASDYE
         D+ +D K + ++++++K  FRGVRQR WGKWAAEIRDPRR  RVWLGTF TAE+AARAYDRAAIEFRG  +AKLNFPASDY+
Subjt:  -DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRG-DKAKLNFPASDYE

A0A5D3CM46 Ethylene-responsive transcription factor ERF109-like protein2.87e-5453.01Show/hide
Query:  FTSNSDTAPFRLPLPVEEEHSIIVSALTHVLSQGRSS---------GPTPTNHNNTFYNAAAPSTVWFPSG-DTCEVCGIVGCLGCNYFQE---------
        F+ N  TAPF   L VEEE+SI+VSALTHVL+  R             +  +HN+   N  + + +WFPS  D C+ C   GCLGCNYF++         
Subjt:  FTSNSDTAPFRLPLPVEEEHSIIVSALTHVLSQGRSS---------GPTPTNHNNTFYNAAAPSTVWFPSG-DTCEVCGIVGCLGCNYFQE---------

Query:  -DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRG-DKAKLNFPASDYE
         D+ +D K + ++++++K  FRGVRQR WGKWAAEIRDPRR  RVWLGTF TAE+AARAYDRAAI+FRG  +AKLNFPASDY+
Subjt:  -DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRG-DKAKLNFPASDYE

A0A6J1HGX3 ethylene-responsive transcription factor ERF109-like3.79e-6662.83Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------
        D+APF     L VEEEHSI+VSALTHV+S GR  G T T +     F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                    
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------

Query:  ----EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
            ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF+TAEQAARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  ----EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

A0A6J1KWF8 ethylene-responsive transcription factor ERF109-like3.33e-6663.64Show/hide
Query:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------
        D+ PF     L VEEEHSI+VSALTHV+S GR  G T T +     F +A  P+  +WFPSGDTCEVC IVGCLGCNYFQ                    
Subjt:  DTAPFR--LPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHN--NTFYNAAAPST-VWFPSGDTCEVCGIVGCLGCNYFQ--------------------

Query:  EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE
        ED+  DQ    +     KKRR+KR FRGVRQRPWGKWAAEIRDPRRATRVWLGTF+TAEQAARAYDRAAIEFRGDKAKLNFPASDY+
Subjt:  EDQSSDQKRRNE-----KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYE

A0A6P4AMT7 ethylene-responsive transcription factor ERF109-like4.52e-4952.13Show/hide
Query:  TSNSDTAPFR-LPLPVEEEHSIIVSALTHVLSQGR------------------SSGPTPTNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYFQED
        T+N    PF  + +  ++EHSI+VSAL H +S G                   SS P+ +  N    +     T+ FP GD C  C I GCLGCN+F   
Subjt:  TSNSDTAPFR-LPLPVEEEHSIIVSALTHVLSQGR------------------SSGPTPTNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYFQED

Query:  QSSDQKR------RNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYEHD
          +D KR       +E KRR K  +RGVRQRPWGKWAAEIRDPRRA RVWLGTF TAEQAARAYD+AAIEFRG +AKLNFP SDY HD
Subjt:  QSSDQKR------RNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYEHD

SwissProt top hitse value%identityAlignment
P93007 Ethylene-responsive transcription factor ERF1122.0e-2268.42Show/hide
Query:  DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        D  S      EK    +R +RGVRQRPWGKWAAEIRDP +A RVWLGTF+TAE+AA AYD+AA EFRG KAKLNFP
Subjt:  DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

Q70II3 Ethylene-responsive transcription factor ERF1101.8e-2348.97Show/hide
Query:  SIIVSALTHVLS-------QGRSSGPTPTNHNNTFYNA-AAPSTVWFPSGDTCEVCGIVGCLGCNYFQEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKW
        S +VSALT V+S       +G  S  +   H   +    +AP    F   D+            N  +E  S   K   E+ R  KR +RGVRQRPWGKW
Subjt:  SIIVSALTHVLS-------QGRSSGPTPTNHNNTFYNA-AAPSTVWFPSGDTCEVCGIVGCLGCNYFQEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKW

Query:  AAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        AAEIRDP RA RVWLGTF+TAE AARAYD AA+ FRG+KAKLNFP
Subjt:  AAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

Q7G1L2 Ethylene-responsive transcription factor RAP2-64.4e-2281.36Show/hide
Query:  REFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        +++RGVRQRPWGKWAAEIRDP +ATRVWLGTF TAE AARAYD AA+ FRG KAKLNFP
Subjt:  REFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

Q9LYU3 Ethylene-responsive transcription factor ERF1132.0e-2267.95Show/hide
Query:  QEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        QE   SDQ + ++ + R +R +RGVRQRPWGKWAAEIRDP++A RVWLGTF TAE+AA AYDRAA++F+G KAKLNFP
Subjt:  QEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

Q9SZ06 Ethylene-responsive transcription factor ERF1091.3e-3449.72Show/hide
Query:  LPVEEEHSIIVSALTHVLSQGRSSGPTP-TNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYF-------------QEDQSSDQKRRNE-------
        L  E+E S+IVSAL HV+S    + P    + ++T  +A  P        DTC+VC I GCLGCNYF             +E+ +S   RR E       
Subjt:  LPVEEEHSIIVSALTHVLSQGRSSGPTP-TNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYF-------------QEDQSSDQKRRNE-------

Query:  --------KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDY
                K++  K  +RGVRQRPWGK+AAEIRDP+RATRVWLGTF TAE AARAYDRAAI FRG +AKLNFP  DY
Subjt:  --------KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDY

Arabidopsis top hitse value%identityAlignment
AT2G33710.1 Integrase-type DNA-binding superfamily protein1.4e-2368.42Show/hide
Query:  DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        D  S      EK    +R +RGVRQRPWGKWAAEIRDP +A RVWLGTF+TAE+AA AYD+AA EFRG KAKLNFP
Subjt:  DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

AT2G33710.2 Integrase-type DNA-binding superfamily protein1.4e-2368.42Show/hide
Query:  DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        D  S      EK    +R +RGVRQRPWGKWAAEIRDP +A RVWLGTF+TAE+AA AYD+AA EFRG KAKLNFP
Subjt:  DQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

AT4G34410.1 redox responsive transcription factor 19.3e-3649.72Show/hide
Query:  LPVEEEHSIIVSALTHVLSQGRSSGPTP-TNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYF-------------QEDQSSDQKRRNE-------
        L  E+E S+IVSAL HV+S    + P    + ++T  +A  P        DTC+VC I GCLGCNYF             +E+ +S   RR E       
Subjt:  LPVEEEHSIIVSALTHVLSQGRSSGPTP-TNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYF-------------QEDQSSDQKRRNE-------

Query:  --------KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDY
                K++  K  +RGVRQRPWGK+AAEIRDP+RATRVWLGTF TAE AARAYDRAAI FRG +AKLNFP  DY
Subjt:  --------KKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDY

AT5G13330.1 related to AP2 6l1.4e-2367.95Show/hide
Query:  QEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        QE   SDQ + ++ + R +R +RGVRQRPWGKWAAEIRDP++A RVWLGTF TAE+AA AYDRAA++F+G KAKLNFP
Subjt:  QEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP

AT5G50080.1 ethylene response factor 1101.3e-2448.97Show/hide
Query:  SIIVSALTHVLS-------QGRSSGPTPTNHNNTFYNA-AAPSTVWFPSGDTCEVCGIVGCLGCNYFQEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKW
        S +VSALT V+S       +G  S  +   H   +    +AP    F   D+            N  +E  S   K   E+ R  KR +RGVRQRPWGKW
Subjt:  SIIVSALTHVLS-------QGRSSGPTPTNHNNTFYNA-AAPSTVWFPSGDTCEVCGIVGCLGCNYFQEDQSSDQKRRNEKKRRVKREFRGVRQRPWGKW

Query:  AAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP
        AAEIRDP RA RVWLGTF+TAE AARAYD AA+ FRG+KAKLNFP
Subjt:  AAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCACTTCCAACTCCGATACGGCGCCGTTCCGGTTGCCACTCCCGGTCGAGGAGGAGCACTCCATCATCGTCTCTGCGCTCACCCACGTGCTCAGCCAAGGCAGAAGTAG
TGGGCCCACGCCCACCAACCACAACAACACATTTTATAACGCTGCTGCTCCTAGTACTGTCTGGTTTCCTAGTGGCGACACGTGCGAGGTGTGCGGGATTGTTGGGTGCT
TGGGATGCAACTACTTCCAGGAGGACCAAAGTAGTGATCAGAAGAGGAGGAATGAGAAGAAGAGGAGGGTGAAGAGAGAATTCAGGGGGGTCAGGCAGAGGCCGTGGGGG
AAATGGGCGGCCGAGATTCGAGACCCGCGGCGGGCCACGAGGGTGTGGCTCGGCACATTCAACACGGCCGAGCAAGCCGCAAGGGCCTACGACAGAGCTGCCATTGAGTT
CCGCGGAGATAAGGCAAAGCTCAACTTTCCAGCTTCTGACTACGAACACGAC
mRNA sequenceShow/hide mRNA sequence
TTCACTTCCAACTCCGATACGGCGCCGTTCCGGTTGCCACTCCCGGTCGAGGAGGAGCACTCCATCATCGTCTCTGCGCTCACCCACGTGCTCAGCCAAGGCAGAAGTAG
TGGGCCCACGCCCACCAACCACAACAACACATTTTATAACGCTGCTGCTCCTAGTACTGTCTGGTTTCCTAGTGGCGACACGTGCGAGGTGTGCGGGATTGTTGGGTGCT
TGGGATGCAACTACTTCCAGGAGGACCAAAGTAGTGATCAGAAGAGGAGGAATGAGAAGAAGAGGAGGGTGAAGAGAGAATTCAGGGGGGTCAGGCAGAGGCCGTGGGGG
AAATGGGCGGCCGAGATTCGAGACCCGCGGCGGGCCACGAGGGTGTGGCTCGGCACATTCAACACGGCCGAGCAAGCCGCAAGGGCCTACGACAGAGCTGCCATTGAGTT
CCGCGGAGATAAGGCAAAGCTCAACTTTCCAGCTTCTGACTACGAACACGAC
Protein sequenceShow/hide protein sequence
FTSNSDTAPFRLPLPVEEEHSIIVSALTHVLSQGRSSGPTPTNHNNTFYNAAAPSTVWFPSGDTCEVCGIVGCLGCNYFQEDQSSDQKRRNEKKRRVKREFRGVRQRPWG
KWAAEIRDPRRATRVWLGTFNTAEQAARAYDRAAIEFRGDKAKLNFPASDYEHD