; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007904 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007904
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTF-B3 domain-containing protein
Genome locationChr10:17010475..17012030
RNA-Seq ExpressionHG10007904
SyntenyHG10007904
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003340 - B3 DNA binding domain
IPR005508 - B3 domain-containing protein At2g31720-like
IPR015300 - DNA-binding pseudobarrel domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043114.1 putative B3 domain-containing protein [Cucumis melo var. makuwa]1.1e-6357.83Show/hide
Query:  VRYTYNNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP-----------------NNNNENVAP-VADGARD
        V Y YNNKIV+KEVYE    + +M        +SSEP     NQFP STS S+V++QPP +NE PP                 NNNNENVA  VAD    
Subjt:  VRYTYNNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP-----------------NNNNENVAP-VADGARD

Query:  RIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLL
        R+ P+AVI+N+IG CS PFVKQLTK+D++D  GRL L KEFVNR+LVPMFN DE L +GI V V+DSEGR+Y+MTF++W+SKLYVLTK WK+FY  N L 
Subjt:  RIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLL

Query:  QSGEYVTVWMFRHVVNHKLCFAIIQGRVEG
        +  E++T+WMFRHVVNHKLCFAI++G VEG
Subjt:  QSGEYVTVWMFRHVVNHKLCFAIIQGRVEG

KAG6589940.1 putative B3 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.5e-4950Show/hide
Query:  VRYTYNNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGARDRIWPI-----AVIENVIG
        V Y +NN+I+EK+VYE A I + +        +  EP+     Q  GSTS  + +E     NEAP  N +E+VA   +   + + P+     AVI N+IG
Subjt:  VRYTYNNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGARDRIWPI-----AVIENVIG

Query:  ACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRH
        A S P+ K+LTKSD+S+G GRL ++KEFV  HL+PMFN DED   GIDV V++ EGRQY+M FK+W+SKLYVLTKGW +FY+TNNL   G YVT+WMFRH
Subjt:  ACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRH

Query:  VVNHKLCFAIIQGRVEGP
        V  HK+CFA+  G+   P
Subjt:  VVNHKLCFAIIQGRVEGP

KAG7023605.1 putative B3 domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.2e-4356.69Show/hide
Query:  NEAPPNNNNENVAPVADGARDRIWPI-----AVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNM
        NEAP  N +E+VA   +   + + P+     AVI N+IGA S P+ K+LTKSD+S+G GRL ++KEFV  HL+PMFN DED   GIDV V++ EGRQY+M
Subjt:  NEAPPNNNNENVAPVADGARDRIWPI-----AVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNM

Query:  TFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVNHKLCFAIIQGRVEGP
         FK+W+SKLYVLTKGW +FY+TNNL + G YVT+WMFRHV  HK+CFA+  G+   P
Subjt:  TFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVNHKLCFAIIQGRVEGP

KGN56101.1 hypothetical protein Csa_010492 [Cucumis sativus]3.1e-6360Show/hide
Query:  NVRYTYNNKIVEKEVYEGASIIYDMN-SKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP------------NNNNENVAPVADGARD--RI
        +V YTYNNKIV ++VY+ + I+YDM  SKSV+  E  EP  RF NQ   STSTS+VNEQ P +NE  P            N NNENVA      RD  R 
Subjt:  NVRYTYNNKIVEKEVYEGASIIYDMN-SKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP------------NNNNENVAPVADGARD--RI

Query:  WPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQS
         P+AVI+N+IG CS PFVKQLTK+D++D  GRL L KEFVN +L PMFNDDEDL  GI VIV+D EGR+Y+M FK+W+SKLYVLTK WKEFY+TN+L Q 
Subjt:  WPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQS

Query:  GEYVTVWMFRHVVNHKLCFAIIQGR
        GE+++VWMFRHVV  KLCFAI++G+
Subjt:  GEYVTVWMFRHVVNHKLCFAIIQGR

XP_031736688.1 uncharacterized protein LOC116402065 [Cucumis sativus]7.8e-6258.82Show/hide
Query:  VRYTYNNKIVEKEVYEGASIIYDM-NSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQP-------PAINEAPPNNNNENVAPVADGARD--RIWPIAVI
        VRYTYNNKIVEK VYE +  +  M NS+ +K  E  +P  R  N+F  STS SI+N+QP       P +NEA PNNNNENVA VA   RD  R+ P+AVI
Subjt:  VRYTYNNKIVEKEVYEGASIIYDM-NSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQP-------PAINEAPPNNNNENVAPVADGARD--RIWPIAVI

Query:  ENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTV
        +N+IG CS PFVKQL K+D+++  GRL L KEFV R+L+PMFN  E+L +GI V V+DSEGR+Y+M FK W+SKLYVLTK W +FY++NNL + GE+++V
Subjt:  ENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTV

Query:  WMFRHVVNHKLCFAIIQGRVE
        WMFRHVVN KLCFAI++G  E
Subjt:  WMFRHVVNHKLCFAIIQGRVE

TrEMBL top hitse value%identityAlignment
A0A0A0L7L0 TF-B3 domain-containing protein1.5e-6360Show/hide
Query:  NVRYTYNNKIVEKEVYEGASIIYDMN-SKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP------------NNNNENVAPVADGARD--RI
        +V YTYNNKIV ++VY+ + I+YDM  SKSV+  E  EP  RF NQ   STSTS+VNEQ P +NE  P            N NNENVA      RD  R 
Subjt:  NVRYTYNNKIVEKEVYEGASIIYDMN-SKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP------------NNNNENVAPVADGARD--RI

Query:  WPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQS
         P+AVI+N+IG CS PFVKQLTK+D++D  GRL L KEFVN +L PMFNDDEDL  GI VIV+D EGR+Y+M FK+W+SKLYVLTK WKEFY+TN+L Q 
Subjt:  WPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQS

Query:  GEYVTVWMFRHVVNHKLCFAIIQGR
        GE+++VWMFRHVV  KLCFAI++G+
Subjt:  GEYVTVWMFRHVVNHKLCFAIIQGR

A0A0A0LC65 TF-B3 domain-containing protein1.6e-4163.2Show/hide
Query:  IAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGE
        +AVI+N+IG CS PFVKQL K+D+++  GRL L KEFV R+L+PMFN  E+L +GI V V+DSEGR+Y+M FK W+SKLYVLTK W +FY++NNL + GE
Subjt:  IAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGE

Query:  YVTVWMFRHVVNHKLCFAIIQGRVE
        +++VWMFRHVVN KLCFAI++G  E
Subjt:  YVTVWMFRHVVNHKLCFAIIQGRVE

A0A2P6RSD2 Putative transcription factor B3-Domain family3.9e-2741.51Show/hide
Query:  GSTSTSIVNEQPPAINEAPPNNNNENVAPVADGARDRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIV
        G    ++  E+  A+NEA P           D     + P+  +  +I  CS PF KQLT SD+ D   RL + KE V +HL+P+ N+ EDL  GIDV  
Subjt:  GSTSTSIVNEQPPAINEAPPNNNNENVAPVADGARDRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIV

Query:  FDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVNHKLCFAI
        +D  G++Y M FK W SK++VLT GWK F + + L+Q+  +VTVWMFR++    LCF I
Subjt:  FDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVNHKLCFAI

A0A5D3BAS8 Putative B3 domain-containing protein5.2e-6457.83Show/hide
Query:  VRYTYNNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP-----------------NNNNENVAP-VADGARD
        V Y YNNKIV+KEVYE    + +M        +SSEP     NQFP STS S+V++QPP +NE PP                 NNNNENVA  VAD    
Subjt:  VRYTYNNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPP-----------------NNNNENVAP-VADGARD

Query:  RIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLL
        R+ P+AVI+N+IG CS PFVKQLTK+D++D  GRL L KEFVNR+LVPMFN DE L +GI V V+DSEGR+Y+MTF++W+SKLYVLTK WK+FY  N L 
Subjt:  RIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLL

Query:  QSGEYVTVWMFRHVVNHKLCFAIIQGRVEG
        +  E++T+WMFRHVVNHKLCFAI++G VEG
Subjt:  QSGEYVTVWMFRHVVNHKLCFAIIQGRVEG

A0A6J1CTE7 putative B3 domain-containing protein At4g031705.7e-3441.81Show/hide
Query:  NNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENV-----------------APV------ADGARD
        N K  + EVYE A  +Y M+  +        PT R      G T      ++   IN+   NNNN NV                 AP        DG   
Subjt:  NNKIVEKEVYEGASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENV-----------------APV------ADGARD

Query:  R-IWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFN---DDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQT
        R + P+  I +VIG CSR + K+LT SD+++  GRL + KEFV ++L PMFN   ++E+L  GI+V V++ EGR++ MTFKIW+SKLYVL KGWK F + 
Subjt:  R-IWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFN---DDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQT

Query:  NNL--LQSGEYVTVWMFRHVVNHKLCFAIIQG
        NNL    +GE++T+WMFRH   H LCFAII G
Subjt:  NNL--LQSGEYVTVWMFRHVVNHKLCFAIIQG

SwissProt top hitse value%identityAlignment
P82280 AP2/ERF and B3 domain-containing transcription repressor RAV24.1e-0533.33Show/hide
Query:  FVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIW-SSKLYVLTKGWKEFYQTNNLLQSGEYVT
        F K +T SD+   L RL + K+   +H  P+ +    +  G+ +   D  G+ +   +  W SS+ YVLTKGW  F +  N L++G+ VT
Subjt:  FVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIW-SSKLYVLTKGWKEFYQTNNLLQSGEYVT

Q9LM90 B3 domain-containing protein At1g206009.0e-1339.78Show/hide
Query:  SRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNNLLQSGEYVT
        SRP  KQL  SD+     RL L KE V   ++P   + ED + G+DV V+  +G    M FK+W+  K  VLT GW +F     LL++ ++VT
Subjt:  SRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNNLLQSGEYVT

Q9ZR14 Putative B3 domain-containing protein At4g031704.2e-1831.56Show/hide
Query:  ERGEDFDGRQNVRYTYNNKIVEKEVYEGASI-IYDMNSKSVKQS-------ESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGAR
        +R ED D   ++    ++++++ E YE AS+ +   +  + KQS       E ++P  RFK Q     +    +E+   + E       +      D   
Subjt:  ERGEDFDGRQNVRYTYNNKIVEKEVYEGASI-IYDMNSKSVKQS-------ESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGAR

Query:  DRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNN
         R++            SRP  KQL  SD+      L L KE V   ++P   D E+ + GIDV V+  +G+   M FK+W+  K  VLT GWK+F +   
Subjt:  DRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNN

Query:  LLQSGEYVTVWMFRHVVNHKLCFAI
        L  + ++VTVWMFRH+   KLCFAI
Subjt:  LLQSGEYVTVWMFRHVVNHKLCFAI

Q9ZR15 Putative B3 domain-containing protein At4g031603.1e-1335.51Show/hide
Query:  RPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVN
        R   +QL  SD+ D   RL L KE V + +  +   ++    G++V V+   G  + M  KIW     VLT GWK F ++  L +  +++T+WMFRH   
Subjt:  RPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVN

Query:  HKLCFAI
         ++CFAI
Subjt:  HKLCFAI

Arabidopsis top hitse value%identityAlignment
AT1G20600.1 AP2/B3-like transcriptional factor family protein6.4e-1439.78Show/hide
Query:  SRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNNLLQSGEYVT
        SRP  KQL  SD+     RL L KE V   ++P   + ED + G+DV V+  +G    M FK+W+  K  VLT GW +F     LL++ ++VT
Subjt:  SRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNNLLQSGEYVT

AT1G68840.1 related to ABI3/VP1 22.9e-0633.33Show/hide
Query:  FVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIW-SSKLYVLTKGWKEFYQTNNLLQSGEYVT
        F K +T SD+   L RL + K+   +H  P+ +    +  G+ +   D  G+ +   +  W SS+ YVLTKGW  F +  N L++G+ VT
Subjt:  FVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIW-SSKLYVLTKGWKEFYQTNNLLQSGEYVT

AT1G68840.2 related to ABI3/VP1 22.9e-0633.33Show/hide
Query:  FVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIW-SSKLYVLTKGWKEFYQTNNLLQSGEYVT
        F K +T SD+   L RL + K+   +H  P+ +    +  G+ +   D  G+ +   +  W SS+ YVLTKGW  F +  N L++G+ VT
Subjt:  FVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIW-SSKLYVLTKGWKEFYQTNNLLQSGEYVT

AT4G03160.1 BEST Arabidopsis thaliana protein match is: AP2/B3-like transcriptional factor family protein (TAIR:AT4G03170.1)2.2e-1435.51Show/hide
Query:  RPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVN
        R   +QL  SD+ D   RL L KE V + +  +   ++    G++V V+   G  + M  KIW     VLT GWK F ++  L +  +++T+WMFRH   
Subjt:  RPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVN

Query:  HKLCFAI
         ++CFAI
Subjt:  HKLCFAI

AT4G03170.1 AP2/B3-like transcriptional factor family protein3.0e-1931.56Show/hide
Query:  ERGEDFDGRQNVRYTYNNKIVEKEVYEGASI-IYDMNSKSVKQS-------ESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGAR
        +R ED D   ++    ++++++ E YE AS+ +   +  + KQS       E ++P  RFK Q     +    +E+   + E       +      D   
Subjt:  ERGEDFDGRQNVRYTYNNKIVEKEVYEGASI-IYDMNSKSVKQS-------ESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGAR

Query:  DRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNN
         R++            SRP  KQL  SD+      L L KE V   ++P   D E+ + GIDV V+  +G+   M FK+W+  K  VLT GWK+F +   
Subjt:  DRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRHLVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWS-SKLYVLTKGWKEFYQTNN

Query:  LLQSGEYVTVWMFRHVVNHKLCFAI
        L  + ++VTVWMFRH+   KLCFAI
Subjt:  LLQSGEYVTVWMFRHVVNHKLCFAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGAAGATCACACAAAACCCAGTTGAACGTGAAGGAGAAGTCTATACACAAAACCCAATCGAACACGAAGGAGAAGACTATGGACAAAACGCAGTCGAACGCAG
AGGAGAAGGAAAAAAAAAGAAGCGGAAAAGAAAAATGAAAGAGAAGGTCCACAAGCTCCAGAGATTTGCAAAGAGAAGCTGTAGAGAGAGAACGAAGGAGAAGAATGAAA
GAGATGAAGATTTTGAGCGCCCTAATGAAAGAGGGGAGGATTTCGACGGGCGCCAAAATGTACGTTACACATACAATAACAAAATAGTTGAGAAAGAGGTCTACGAAGGT
GCTTCGATTATTTACGACATGAATAGTAAAAGCGTGAAGCAATCTGAGTCGTCAGAGCCGACAGGTCGTTTCAAAAATCAATTTCCAGGTTCAACTTCTACTTCCATAGT
AAACGAGCAGCCTCCTGCAATCAACGAAGCTCCACCTAACAACAACAATGAAAATGTGGCGCCAGTGGCCGATGGAGCTCGAGATAGAATTTGGCCAATAGCAGTGATCG
AAAATGTAATTGGAGCATGTAGTCGGCCATTTGTGAAGCAATTAACGAAAAGCGACATGTCAGATGGGCTAGGGCGATTGACACTTCAAAAGGAATTTGTGAATCGTCAT
TTGGTTCCAATGTTCAATGATGATGAGGATTTGATTCACGGGATTGATGTTATAGTTTTTGATAGTGAAGGGAGACAATACAACATGACGTTCAAGATTTGGTCTTCTAA
GCTTTATGTGCTCACGAAAGGTTGGAAGGAGTTTTACCAGACTAATAATTTATTGCAATCTGGTGAGTATGTCACCGTTTGGATGTTCCGACATGTCGTTAATCACAAGC
TTTGCTTTGCTATCATTCAAGGAAGGGTTGAAGGGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGAAGATCACACAAAACCCAGTTGAACGTGAAGGAGAAGTCTATACACAAAACCCAATCGAACACGAAGGAGAAGACTATGGACAAAACGCAGTCGAACGCAG
AGGAGAAGGAAAAAAAAAGAAGCGGAAAAGAAAAATGAAAGAGAAGGTCCACAAGCTCCAGAGATTTGCAAAGAGAAGCTGTAGAGAGAGAACGAAGGAGAAGAATGAAA
GAGATGAAGATTTTGAGCGCCCTAATGAAAGAGGGGAGGATTTCGACGGGCGCCAAAATGTACGTTACACATACAATAACAAAATAGTTGAGAAAGAGGTCTACGAAGGT
GCTTCGATTATTTACGACATGAATAGTAAAAGCGTGAAGCAATCTGAGTCGTCAGAGCCGACAGGTCGTTTCAAAAATCAATTTCCAGGTTCAACTTCTACTTCCATAGT
AAACGAGCAGCCTCCTGCAATCAACGAAGCTCCACCTAACAACAACAATGAAAATGTGGCGCCAGTGGCCGATGGAGCTCGAGATAGAATTTGGCCAATAGCAGTGATCG
AAAATGTAATTGGAGCATGTAGTCGGCCATTTGTGAAGCAATTAACGAAAAGCGACATGTCAGATGGGCTAGGGCGATTGACACTTCAAAAGGAATTTGTGAATCGTCAT
TTGGTTCCAATGTTCAATGATGATGAGGATTTGATTCACGGGATTGATGTTATAGTTTTTGATAGTGAAGGGAGACAATACAACATGACGTTCAAGATTTGGTCTTCTAA
GCTTTATGTGCTCACGAAAGGTTGGAAGGAGTTTTACCAGACTAATAATTTATTGCAATCTGGTGAGTATGTCACCGTTTGGATGTTCCGACATGTCGTTAATCACAAGC
TTTGCTTTGCTATCATTCAAGGAAGGGTTGAAGGGCCTTGA
Protein sequenceShow/hide protein sequence
MKEKITQNPVEREGEVYTQNPIEHEGEDYGQNAVERRGEGKKKKRKRKMKEKVHKLQRFAKRSCRERTKEKNERDEDFERPNERGEDFDGRQNVRYTYNNKIVEKEVYEG
ASIIYDMNSKSVKQSESSEPTGRFKNQFPGSTSTSIVNEQPPAINEAPPNNNNENVAPVADGARDRIWPIAVIENVIGACSRPFVKQLTKSDMSDGLGRLTLQKEFVNRH
LVPMFNDDEDLIHGIDVIVFDSEGRQYNMTFKIWSSKLYVLTKGWKEFYQTNNLLQSGEYVTVWMFRHVVNHKLCFAIIQGRVEGP