; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022558 (gene) of Snake gourd v1 genome

Gene IDTan0022558
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHTH myb-type domain-containing protein
Genome locationLG01:103289210..103291238
RNA-Seq ExpressionTan0022558
SyntenyTan0022558
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]7.2e-9075.3Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAG NAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-9075.71Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

XP_022937359.1 uncharacterized protein LOC111443670 isoform X1 [Cucurbita moschata]2.5e-9075.71Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

XP_022937362.1 uncharacterized protein LOC111443670 isoform X2 [Cucurbita moschata]2.5e-9075.71Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

XP_023534838.1 uncharacterized protein LOC111796458 isoform X2 [Cucurbita pepo subsp. pepo]2.7e-8975.3Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAG NAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087031.5e-8574.7Show/hide
Query:  MIERKEKRKKGTISKEDS-STLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIE
        MIERKEK+KKG IS ED  STLLERYSVRTILTLLREVAQ SE RIDWDKL K TSTGISN REYQ+LWRHLAYRHTLLENMDC+T PLDDDSDLDFEIE
Subjt:  MIERKEKRKKGTISKEDS-STLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIE

Query:  AFPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT----------------------------------RQPI-STPS-ATEVFDVNGAAGSN
        +FPSV++ES NEAAA VKVLIAN IPSESDIPSSS VEAPLT                                  RQPI STP+ +TEVFDVNGAAG N
Subjt:  AFPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT----------------------------------RQPI-STPS-ATEVFDVNGAAGSN

Query:  AASRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        AASRKRRKPWSKAED ELIAAVQK GEGNWANILK DFKG+RTASQLSQ
Subjt:  AASRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X21.2e-9075.71Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X11.2e-9075.71Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDSS +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK ED EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X21.3e-8974.9Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDS  +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK +D EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X11.3e-8974.9Show/hide
Query:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA
        MIE KEK+KKGTIS EDS  +LERYSVRTI TLLREVA  SE RIDWDKL K TSTGISNVREYQLLWRHLAYRHTLLEN+D VTDPLD DSDLDFEIE 
Subjt:  MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEA

Query:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA
        FPSVSNES NEAAACVKVLIANGIPSESD+PSSS VEAPLT                                   RQP+ TPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------------------------------RQPISTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        SRKRRKPWSK +D EL+AAV+KYGEGNWANILK DFKGDRTASQLSQ
Subjt:  SRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein3.1e-5149.79Show/hide
Query:  RKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSNE
        R+K  I++ D +TLL RY + TIL +L+E++  SE ++DW+ L KKT+TGI+N REYQLLWRHL+YRH LL   D    PLDDDSD++ E+EA P+VS+E
Subjt:  RKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSNE

Query:  SSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------RQPISTP-----------------SATEVFDVNGAAGSNAASRKRRKPWSKAEDA
        +S EA A VKV+ A+ + SESDI   S VEAPLT           ++P  +P                 ++TE  + NG+AG + A R++RK WS  ED 
Subjt:  SSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------RQPISTP-----------------SATEVFDVNGAAGSNAASRKRRKPWSKAEDA

Query:  ELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        EL AAV++ GEGNWA+I+K DF+G+RTASQLSQ
Subjt:  ELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

AT1G09710.2 Homeodomain-like superfamily protein3.1e-5149.79Show/hide
Query:  RKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSNE
        R+K  I++ D +TLL RY + TIL +L+E++  SE ++DW+ L KKT+TGI+N REYQLLWRHL+YRH LL   D    PLDDDSD++ E+EA P+VS+E
Subjt:  RKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSNE

Query:  SSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------RQPISTP-----------------SATEVFDVNGAAGSNAASRKRRKPWSKAEDA
        +S EA A VKV+ A+ + SESDI   S VEAPLT           ++P  +P                 ++TE  + NG+AG + A R++RK WS  ED 
Subjt:  SSNEAAACVKVLIANGIPSESDIPSSSEVEAPLT-----------RQPISTP-----------------SATEVFDVNGAAGSNAASRKRRKPWSKAEDA

Query:  ELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        EL AAV++ GEGNWA+I+K DF+G+RTASQLSQ
Subjt:  ELIAAVQKYGEGNWANILKEDFKGDRTASQLSQ

AT1G58220.1 Homeodomain-like superfamily protein5.0e-4950Show/hide
Query:  KRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSN
        K++K  IS+ D +TLL+RY   TIL LL+E+A  +E +++W++L KKTSTGI++ REYQLLWRHLAYR +L+  +      LDDDSD++ E+EA P VS 
Subjt:  KRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSN

Query:  ESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLTRQ--------------------------PISTPSATEVFDVNGAAGSNAASRKRRKPWSKAEDAE
        +   EA A VKV+ A+ +PSESDIP  S VEAPLT                            P+  P A E  + NG A S+ A RKRRK WS  ED E
Subjt:  ESSNEAAACVKVLIANGIPSESDIPSSSEVEAPLTRQ--------------------------PISTPSATEVFDVNGAAGSNAASRKRRKPWSKAEDAE

Query:  LIAAVQKYGEGNWANILKEDFKGDRTASQLSQ
        LIAAV+++GEG+WA I KE+F+G+RTASQLSQ
Subjt:  LIAAVQKYGEGNWANILKEDFKGDRTASQLSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGAGGAAAGAGAAGCGAAAGAAAGGGACAATTAGTAAGGAAGATAGTTCCACTCTATTGGAAAGGTATTCAGTAAGGACGATACTGACATTGCTTCGAGAGGT
CGCCCAGGCTTCCGAAGGGAGAATTGATTGGGACAAGTTGGCGAAGAAGACGTCGACTGGAATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATC
GTCACACGTTACTGGAGAACATGGATTGTGTTACAGATCCACTGGATGATGATAGTGACTTAGATTTTGAAATTGAAGCTTTTCCATCTGTCAGCAACGAGTCCTCGAAT
GAAGCTGCAGCATGTGTGAAGGTATTGATTGCTAATGGTATACCAAGTGAGTCAGATATTCCAAGTAGTTCTGAAGTTGAGGCCCCATTGACTAGACAGCCTATTTCAAC
CCCATCAGCAACCGAAGTATTTGACGTGAATGGAGCAGCTGGTAGTAATGCAGCTTCTCGAAAAAGAAGAAAACCCTGGTCAAAGGCAGAGGATGCAGAATTGATTGCTG
CCGTGCAAAAGTATGGTGAAGGGAACTGGGCGAATATCTTGAAAGAAGACTTCAAGGGGGATAGAACAGCTTCACAGCTATCTCAGGTGTTCTTACATACTTTATTTCCC
AATGCAATGAACTCTGTTTTTGTACATTTTGCTCTATTCTCGACCATTTCTTTTAATTTAATCAATTATCCTATATTATATCAATTTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGAGGAAAGAGAAGCGAAAGAAAGGGACAATTAGTAAGGAAGATAGTTCCACTCTATTGGAAAGGTATTCAGTAAGGACGATACTGACATTGCTTCGAGAGGT
CGCCCAGGCTTCCGAAGGGAGAATTGATTGGGACAAGTTGGCGAAGAAGACGTCGACTGGAATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATC
GTCACACGTTACTGGAGAACATGGATTGTGTTACAGATCCACTGGATGATGATAGTGACTTAGATTTTGAAATTGAAGCTTTTCCATCTGTCAGCAACGAGTCCTCGAAT
GAAGCTGCAGCATGTGTGAAGGTATTGATTGCTAATGGTATACCAAGTGAGTCAGATATTCCAAGTAGTTCTGAAGTTGAGGCCCCATTGACTAGACAGCCTATTTCAAC
CCCATCAGCAACCGAAGTATTTGACGTGAATGGAGCAGCTGGTAGTAATGCAGCTTCTCGAAAAAGAAGAAAACCCTGGTCAAAGGCAGAGGATGCAGAATTGATTGCTG
CCGTGCAAAAGTATGGTGAAGGGAACTGGGCGAATATCTTGAAAGAAGACTTCAAGGGGGATAGAACAGCTTCACAGCTATCTCAGGTGTTCTTACATACTTTATTTCCC
AATGCAATGAACTCTGTTTTTGTACATTTTGCTCTATTCTCGACCATTTCTTTTAATTTAATCAATTATCCTATATTATATCAATTTCAGTAA
Protein sequenceShow/hide protein sequence
MIERKEKRKKGTISKEDSSTLLERYSVRTILTLLREVAQASEGRIDWDKLAKKTSTGISNVREYQLLWRHLAYRHTLLENMDCVTDPLDDDSDLDFEIEAFPSVSNESSN
EAAACVKVLIANGIPSESDIPSSSEVEAPLTRQPISTPSATEVFDVNGAAGSNAASRKRRKPWSKAEDAELIAAVQKYGEGNWANILKEDFKGDRTASQLSQVFLHTLFP
NAMNSVFVHFALFSTISFNLINYPILYQFQ