; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017991 (gene) of Snake gourd v1 genome

Gene IDTan0017991
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationLG08:75262700..75267856
RNA-Seq ExpressionTan0017991
SyntenyTan0017991
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964954.1 uncharacterized protein LOC111464906 isoform X1 [Cucurbita moschata]2.1e-9873.99Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRR DDSTLRILEF S SKD  SLMD KS +KELL FES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS K A+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD
               V AL+SEYLK++VTERN+K SS CTR  TASTLFRNGIRNHNAKRLHEYQ LEG TS+SYK+Q  D
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD

XP_022970619.1 uncharacterized protein LOC111469552 isoform X1 [Cucurbita maxima]1.4e-9974.36Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRRFDDSTLRILEF S SKD    MD KS +KELLRFES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS KRA+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD
               V AL+SEYLK+KVTERN+K SS CTR  TASTLFRNGIRNHNAK+LHEYQ LEG TS+SYK+Q  D
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD

XP_022970621.1 uncharacterized protein LOC111469552 isoform X2 [Cucurbita maxima]2.5e-9675Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRRFDDSTLRILEF S SKD    MD KS +KELLRFES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS KRA+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTS
               V AL+SEYLK+KVTERN+K SS CTR  TASTLFRNGIRNHNAK+LHEYQ LEG TS
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTS

XP_023520165.1 uncharacterized protein LOC111783465 isoform X2 [Cucurbita pepo subsp. pepo]9.3e-9973.63Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRRFDDSTLRILEF S SKD  SLMD KS +KELLRFES S+IRET +KTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIA++AYEQALS LQQSDTANYTSHGS K A+V+EKIKRLK+ +LKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD
               V AL+SEYLK+KVTERN+K SS CTR  TASTLFRNGIRNHNAK+LHEYQ LEG TS+SYK+Q  D
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD

XP_038895344.1 protein DOUBLE-STRAND BREAK FORMATION isoform X1 [Benincasa hispida]7.1e-9973.63Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        MS S  EQ+SLFRSRL SRRFDDSTLRILEF   SKDA SLMD KS LKE LRFES S+IRETAEKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FR LKSFNQ WLQVSHAEWLNFAEHS+ AGFFSIAI+AYEQALS LQQ+DT NYTSHGS KR +V+EKIKRLK+ AL+SAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD
               V AL+SEYL +KVTERN K SS CTR  TASTLFRNG RNHNAK+LHEYQVLEG TS+S+K+Q  D
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD

TrEMBL top hitse value%identityAlignment
A0A1S3CL48 uncharacterized protein LOC103502216 isoform X11.4e-8971.94Show/hide
Query:  FSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFRELKSFN
        +SLF SRL SRRFDDSTLRILE    SKDATSL D KSS  ELLRFES S+IRETAEKTD+QKL+V EFLVRAFALVGDIESCLALRYEALNFR LKSFN
Subjt:  FSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFRELKSFN

Query:  QQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFLTLFV
        Q WLQVSHAEWLNFAEHS+HAGFFSIAI+AYEQALS LQQSDTANYTSHGSFK  +V+EKI RLK+ AL  +GSHS                       V
Subjt:  QQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFLTLFV

Query:  VALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEG
         AL+S+YLK+KVTER++K SS CTR  TASTLFRNGIRN+NA++LHEY+ + G
Subjt:  VALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEG

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X21.8e-9574.62Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRR DDSTLRILEF S SKD  SLMD KS +KELL FES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS K A+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTS
               V AL+SEYLK++VTERN+K SS CTR  TASTLFRNGIRNHNAKRLHEYQ LEG TS
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTS

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X11.0e-9873.99Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRR DDSTLRILEF S SKD  SLMD KS +KELL FES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS K A+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD
               V AL+SEYLK++VTERN+K SS CTR  TASTLFRNGIRNHNAKRLHEYQ LEG TS+SYK+Q  D
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD

A0A6J1I136 uncharacterized protein LOC111469552 isoform X21.2e-9675Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRRFDDSTLRILEF S SKD    MD KS +KELLRFES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS KRA+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTS
               V AL+SEYLK+KVTERN+K SS CTR  TASTLFRNGIRNHNAK+LHEYQ LEG TS
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTS

A0A6J1I645 uncharacterized protein LOC111469552 isoform X16.9e-10074.36Show/hide
Query:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN
        M  SV EQ+SLF SRL SRRFDDSTLRILEF S SKD    MD KS +KELLRFES S+IRET EKTD+QKL+V EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD
        FRELKSFNQ  LQVSHAEWLNFAEHS++AGFFSIAI+AYEQALS LQQSDTANYTSHGS KRA+V+EKIKRLK+ ALKSAGSHS                
Subjt:  FRELKSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICD

Query:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD
               V AL+SEYLK+KVTERN+K SS CTR  TASTLFRNGIRNHNAK+LHEYQ LEG TS+SYK+Q  D
Subjt:  FFFLTLFVVALSSEYLKRKVTERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGD

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION3.2e-3341.79Show/hide
Query:  VVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFREL
        + +Q  LF +R+  RRFD+ +LRILE   V+ +  S ++ +S L++ +R ES  +  E   ++   KL V EF  RAFAL+GD+ESCLA+RYEALN R+L
Subjt:  VVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFREL

Query:  KSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFL
        KS +  WL VSH+EW  FA  SM  GF SIA +A E AL  L++       S  +    D  EK++RL++ A     SHSG+F      L   +C+    
Subjt:  KSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFL

Query:  T
        T
Subjt:  T

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein2.2e-3441.79Show/hide
Query:  VVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFREL
        + +Q  LF +R+  RRFD+ +LRILE   V+ +  S ++ +S L++ +R ES  +  E   ++   KL V EF  RAFAL+GD+ESCLA+RYEALN R+L
Subjt:  VVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFREL

Query:  KSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFL
        KS +  WL VSH+EW  FA  SM  GF SIA +A E AL  L++       S  +    D  EK++RL++ A     SHSG+F      L   +C+    
Subjt:  KSFNQQWLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFL

Query:  T
        T
Subjt:  T


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCGTTCGGTTGTGGAGCAATTCTCTCTCTTTCGCTCGCGGCTCGGGAGCCGAAGATTTGATGATTCTACTTTGCGAATTCTGGAATTTCTTTCCGTTTCTAAAGA
CGCGACGTCGTTGATGGATGCCAAATCCAGCTTAAAGGAATTACTCAGATTTGAATCTCCATCTGTCATTCGTGAAACCGCTGAGAAAACTGATGAACAAAAGCTTATAG
TCTTCGAATTTCTCGTCCGAGCTTTCGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATACGAGGCCTTGAATTTTCGGGAACTGAAGTCTTTTAATCAGCAG
TGGCTTCAAGTTTCACACGCAGAATGGTTAAACTTCGCTGAGCATTCAATGCACGCTGGCTTTTTTTCAATTGCAATACAGGCATATGAGCAAGCGCTGTCGCGCCTTCA
GCAGAGTGATACTGCAAACTACACATCACATGGTTCCTTTAAACGCGCGGACGTTGTTGAAAAGATAAAGAGACTAAAAAATGATGCTCTGAAATCCGCTGGTTCCCATT
CTGGCCTCTTCTCGGTTGACAAATTGATTCTCATGGATGAGATTTGTGACTTCTTCTTTCTGACCCTCTTTGTTGTTGCTCTCTCATCTGAGTATTTGAAGAGGAAAGTA
ACTGAAAGGAACAAAAAGAATTCTTCACCCTGCACAAGAACTCTTACAGCAAGCACTCTATTCAGAAATGGTATCAGAAACCATAATGCGAAAAGGCTGCATGAATATCA
AGTTTTGGAGGGGTTCACCAGTCAATCCTACAAACTCCAGTGTGGTGATCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCGTTCGGTTGTGGAGCAATTCTCTCTCTTTCGCTCGCGGCTCGGGAGCCGAAGATTTGATGATTCTACTTTGCGAATTCTGGAATTTCTTTCCGTTTCTAAAGA
CGCGACGTCGTTGATGGATGCCAAATCCAGCTTAAAGGAATTACTCAGATTTGAATCTCCATCTGTCATTCGTGAAACCGCTGAGAAAACTGATGAACAAAAGCTTATAG
TCTTCGAATTTCTCGTCCGAGCTTTCGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATACGAGGCCTTGAATTTTCGGGAACTGAAGTCTTTTAATCAGCAG
TGGCTTCAAGTTTCACACGCAGAATGGTTAAACTTCGCTGAGCATTCAATGCACGCTGGCTTTTTTTCAATTGCAATACAGGCATATGAGCAAGCGCTGTCGCGCCTTCA
GCAGAGTGATACTGCAAACTACACATCACATGGTTCCTTTAAACGCGCGGACGTTGTTGAAAAGATAAAGAGACTAAAAAATGATGCTCTGAAATCCGCTGGTTCCCATT
CTGGCCTCTTCTCGGTTGACAAATTGATTCTCATGGATGAGATTTGTGACTTCTTCTTTCTGACCCTCTTTGTTGTTGCTCTCTCATCTGAGTATTTGAAGAGGAAAGTA
ACTGAAAGGAACAAAAAGAATTCTTCACCCTGCACAAGAACTCTTACAGCAAGCACTCTATTCAGAAATGGTATCAGAAACCATAATGCGAAAAGGCTGCATGAATATCA
AGTTTTGGAGGGGTTCACCAGTCAATCCTACAAACTCCAGTGTGGTGATCACTGATCAGCCCTACATATAGTCTTCCTACATACCCATCTAACTGGATAAATATCCCAGG
ATCCTGATGCGTCCTTTAGGCAATACATTACTGGACATTTTATCCAAAGATCAAGCTAACTGCTGCAAACATGTACTTTATTACTGGGATTCGATCGCAGAAGTTCGCTG
GCAAAATAAAAATGCTTCCACTGAAATTCCAGTTTCACTCCATTCAAGTAATTCAGTTCTATGCAAGTCTAAGTCGCCCACTCCCCTTCTGACAGGTCCCTACTATCAAT
AGTTTCCTCTCTATTCTTCATTATTCTAGTACTAATATTTTTCAATAGGACAGTCTTTCATGTGGTAAAATCCACTTGAAGTTCACATGGAAGCCTGCAAATTCCTCTTC
ATTCCTCCTTTTAGGACAATGACATGTCTATTTCCTGACCTTTTCCTCCCACAAATTGTTTAGGTTTAG
Protein sequenceShow/hide protein sequence
MSRSVVEQFSLFRSRLGSRRFDDSTLRILEFLSVSKDATSLMDAKSSLKELLRFESPSVIRETAEKTDEQKLIVFEFLVRAFALVGDIESCLALRYEALNFRELKSFNQQ
WLQVSHAEWLNFAEHSMHAGFFSIAIQAYEQALSRLQQSDTANYTSHGSFKRADVVEKIKRLKNDALKSAGSHSGLFSVDKLILMDEICDFFFLTLFVVALSSEYLKRKV
TERNKKNSSPCTRTLTASTLFRNGIRNHNAKRLHEYQVLEGFTSQSYKLQCGDH