; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023391 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023391
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationchr7:47861734..47865843
RNA-Seq ExpressionLag0023391
SyntenyLag0023391
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964954.1 uncharacterized protein LOC111464906 isoform X1 [Cucurbita moschata]4.5e-9671.78Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RR DDS+LRILEF S SKD+ SLMDVKS +KELL FESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +  E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK++ TERNRK SSSC R +FTASTLFRNGIRNHNAK+LHEYQALEG T+E YK+Q  D
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

XP_022970619.1 uncharacterized protein LOC111469552 isoform X1 [Cucurbita maxima]6.3e-9872.82Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RRFDDS+LRILEF S SKD+   MDVKS +KELLRFESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +R E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK+K TERNRK SSSC R +FTASTLFRNGIRNHNAKKLHEYQALEG T+E YK+Q  D
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

XP_022970621.1 uncharacterized protein LOC111469552 isoform X2 [Cucurbita maxima]5.0e-9573.38Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RRFDDS+LRILEF S SKD+   MDVKS +KELLRFESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +R E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTN
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK+K TERNRK SSSC R +FTASTLFRNGIRNHNAKKLHEYQALEG T+
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTN

XP_023520165.1 uncharacterized protein LOC111783465 isoform X2 [Cucurbita pepo subsp. pepo]4.1e-9772.13Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RRFDDS+LRILEF S SKD+ SLMDVKS +KELLRFESLSIIRET +KTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    +KAYEQALS LQ SDTAN +SHGS +  E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
        VIEKIKRLKDH+LKSAGSHSVQ LTSEYLK+K TERNRK SSSC R +FTASTLFRNGIRNHNAKKLHEYQALEG T+E YK+Q  D
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

XP_038895344.1 protein DOUBLE-STRAND BREAK FORMATION isoform X1 [Benincasa hispida]3.1e-9771.78Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        MS SAAEQ+SLFRSRLR+RRFDDS+LRILEF   SKD+ SLMDVKS LKE LRFESLSIIRETAEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FR LKSFNQ WL+VSHAEWLNFAEHSL+AGFF IA                                    IKAYEQALS LQ +DT N +SHGS +R E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
        VIEKIKRLKDHAL+SAGSHSVQ LTSEYL +K TERN K SSSC R + TASTLFRNG RNHNAKKLHEYQ LEG T+E +K+QF D
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

TrEMBL top hitse value%identityAlignment
A0A6J1DKU7 uncharacterized protein LOC111022017 isoform X18.9e-9067.38Show/hide
Query:  AEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELK
        +E FSLFRSRLR+RR DDS+L+ILEF+SVSKD  SL++ KS LKELLRFESLSIIRET EKTDDQKLLVLEFLVRAFALVGD ESCLALRYEAL+FRE+K
Subjt:  AEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELK

Query:  SFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKI
        S NQ+WL+VSH EWLNFAEHS+ +GF  IA                                    IKAYE ALSRLQ SDT N +SH   +  EVIEKI
Subjt:  SFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKI

Query:  KRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
         RLKDHALKSA SHSVQ LTSEYLK+K TERNRKDSS C RT FTASTLFR+GIRNHNA+KL EYQ L  F +E Y +QFGD
Subjt:  KRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X21.7e-9372.3Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RR DDS+LRILEF S SKD+ SLMDVKS +KELL FESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +  E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTN
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK++ TERNRK SSSC R +FTASTLFRNGIRNHNAK+LHEYQALEG T+
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTN

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X12.2e-9671.78Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RR DDS+LRILEF S SKD+ SLMDVKS +KELL FESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +  E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK++ TERNRK SSSC R +FTASTLFRNGIRNHNAK+LHEYQALEG T+E YK+Q  D
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

A0A6J1I136 uncharacterized protein LOC111469552 isoform X22.4e-9573.38Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RRFDDS+LRILEF S SKD+   MDVKS +KELLRFESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +R E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTN
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK+K TERNRK SSSC R +FTASTLFRNGIRNHNAKKLHEYQALEG T+
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTN

A0A6J1I645 uncharacterized protein LOC111469552 isoform X13.1e-9872.82Show/hide
Query:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN
        M  S AEQ+SLF SRLR+RRFDDS+LRILEF S SKD+   MDVKS +KELLRFESLSIIRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEALN
Subjt:  MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALN

Query:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE
        FRELKSFNQ  L+VSHAEWLNFAEHSL AGFF IA                                    IKAYEQALS LQ SDTAN +SHGS +R E
Subjt:  FRELKSFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTE

Query:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD
        VIEKIKRLKDHALKSAGSHSVQ LTSEYLK+K TERNRK SSSC R +FTASTLFRNGIRNHNAKKLHEYQALEG T+E YK+Q  D
Subjt:  VIEKIKRLKDHALKSAGSHSVQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGD

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION9.4e-2838.6Show/hide
Query:  AEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELK
        A+Q  LF +R+++RRFD+ SLRILE   V+ +  S ++V+S L++ +R ES+ I  E   ++   KL VLEF  RAFAL+GD+ESCLA+RYEALN R+LK
Subjt:  AEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELK

Query:  SFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKI
        S +  WL VSH+EW  FA  S++ GF  IA                                     KA E AL  L+        S  + +  +  EK+
Subjt:  SFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKI

Query:  KRLKDHALKSAGSHS
        +RL+D A     SHS
Subjt:  KRLKDHALKSAGSHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein6.6e-2938.6Show/hide
Query:  AEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELK
        A+Q  LF +R+++RRFD+ SLRILE   V+ +  S ++V+S L++ +R ES+ I  E   ++   KL VLEF  RAFAL+GD+ESCLA+RYEALN R+LK
Subjt:  AEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELK

Query:  SFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKI
        S +  WL VSH+EW  FA  S++ GF  IA                                     KA E AL  L+        S  + +  +  EK+
Subjt:  SFNQQWLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKI

Query:  KRLKDHALKSAGSHS
        +RL+D A     SHS
Subjt:  KRLKDHALKSAGSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAATTCTGCTGCGGAGCAATTCTCTCTCTTTCGCTCGCGGCTCAGAAACCGAAGATTCGATGATTCTTCTTTGCGAATTCTGGAATTTTTGTCCGTTTCCAAAGA
CTCGACGTCGTTGATGGATGTCAAATCCAGCTTAAAAGAATTACTCAGATTTGAATCTCTATCTATCATTCGTGAAACCGCTGAGAAAACTGATGATCAAAAGCTTTTAG
TCCTCGAATTTCTTGTTCGAGCTTTCGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATATGAGGCCTTGAATTTTCGGGAACTGAAGTCTTTTAATCAGCAA
TGGCTTGAAGTTTCACACGCGGAATGGTTAAACTTCGCTGAGCATTCGTTGCAGGCTGGCTTCTTTCCCATTGCCATAAAGGACAAGAAAACGGCCATTGCTGCTGATAA
TCTCTCTACCATATGGGCAACTAGCTCAATATGGATTAGTGTAAGAGAGGAACAGGGGTGGGGTTATGAAGACATTAAGGCATATGAGCAAGCACTGTCACGCCTTCAGC
TGAGTGATACTGCAAACAACTCATCACATGGTTCCTTTGAACGCACAGAAGTTATTGAGAAGATAAAGAGACTCAAAGATCATGCTCTGAAATCTGCTGGTTCCCATTCT
GTTCAAGTTCTCACATCTGAGTATTTGAAAAGGAAAGGAACAGAAAGGAACAGAAAGGATTCTTCATCCTGCAGAAGAACTCAGTTTACAGCAAGCACTCTATTCAGAAA
TGGTATCAGAAACCATAACGCGAAAAAGCTGCATGAATATCAAGCTTTGGAAGGGTTCACCAATGAATGGTACAAAGTCCAGTTTGGCGATCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAATTCTGCTGCGGAGCAATTCTCTCTCTTTCGCTCGCGGCTCAGAAACCGAAGATTCGATGATTCTTCTTTGCGAATTCTGGAATTTTTGTCCGTTTCCAAAGA
CTCGACGTCGTTGATGGATGTCAAATCCAGCTTAAAAGAATTACTCAGATTTGAATCTCTATCTATCATTCGTGAAACCGCTGAGAAAACTGATGATCAAAAGCTTTTAG
TCCTCGAATTTCTTGTTCGAGCTTTCGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATATGAGGCCTTGAATTTTCGGGAACTGAAGTCTTTTAATCAGCAA
TGGCTTGAAGTTTCACACGCGGAATGGTTAAACTTCGCTGAGCATTCGTTGCAGGCTGGCTTCTTTCCCATTGCCATAAAGGACAAGAAAACGGCCATTGCTGCTGATAA
TCTCTCTACCATATGGGCAACTAGCTCAATATGGATTAGTGTAAGAGAGGAACAGGGGTGGGGTTATGAAGACATTAAGGCATATGAGCAAGCACTGTCACGCCTTCAGC
TGAGTGATACTGCAAACAACTCATCACATGGTTCCTTTGAACGCACAGAAGTTATTGAGAAGATAAAGAGACTCAAAGATCATGCTCTGAAATCTGCTGGTTCCCATTCT
GTTCAAGTTCTCACATCTGAGTATTTGAAAAGGAAAGGAACAGAAAGGAACAGAAAGGATTCTTCATCCTGCAGAAGAACTCAGTTTACAGCAAGCACTCTATTCAGAAA
TGGTATCAGAAACCATAACGCGAAAAAGCTGCATGAATATCAAGCTTTGGAAGGGTTCACCAATGAATGGTACAAAGTCCAGTTTGGCGATCACTGA
Protein sequenceShow/hide protein sequence
MSNSAAEQFSLFRSRLRNRRFDDSSLRILEFLSVSKDSTSLMDVKSSLKELLRFESLSIIRETAEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALNFRELKSFNQQ
WLEVSHAEWLNFAEHSLQAGFFPIAIKDKKTAIAADNLSTIWATSSIWISVREEQGWGYEDIKAYEQALSRLQLSDTANNSSHGSFERTEVIEKIKRLKDHALKSAGSHS
VQVLTSEYLKRKGTERNRKDSSSCRRTQFTASTLFRNGIRNHNAKKLHEYQALEGFTNEWYKVQFGDH