; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015905 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015905
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationtig00006297:515286..519046
RNA-Seq ExpressionSgr015905
SyntenySgr015905
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154860.1 uncharacterized protein LOC111022017 isoform X1 [Momordica charantia]1.7e-10784.68Show/hide
Query:  AEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFRELK
        +E FSLFR+RLRSRR DDSTL+ILEFVSVSKDVKSLIE KSRL+ELLRFES S+IRETVEKTDDQKLLVLEFLVRAFALVGD ESCLALRYEAL FRE+K
Subjt:  AEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFRELK

Query:  SSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAESTKKNS
        SSNQKWLQVSHVEWLNFAEHS+H+GF SIAIKAYE ALS LQQSDT NCTSH   KCVEV+EKI RLKDHALKSAASHSVQALTSEYLKKKV E  +K+S
Subjt:  SSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAESTKKNS

Query:  SFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHS
        SFCTRT FTASTLFRSGIRNHNA+KL EYQGL  F SESY +Q GD S
Subjt:  SFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHS

XP_022964954.1 uncharacterized protein LOC111464906 isoform X1 [Cucurbita moschata]5.8e-10381.5Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRR DDSTLRILEF S SKD  SL++VKS ++ELL FES S+IRETVEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALS LQQSDTAN TSHGS KC EV+EKIKRLKDHALKSA SHSVQALTSEYLKK+V E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY
         +K SS CTR KFTASTLFR+GIRNHNAK+LHEYQ L+G  SESYKIQ+ D SY
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY

XP_022970619.1 uncharacterized protein LOC111469552 isoform X1 [Cucurbita maxima]1.7e-10281.89Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRRFDDSTLRILEF S SKD    ++VKS ++ELLRFES S+IRETVEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALS LQQSDTAN TSHGS K  EV+EKIKRLKDHALKSA SHSVQALTSEYLKKKV E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY
         +K SS CTR KFTASTLFR+GIRNHNAKKLHEYQ L+G  SESYKIQ+ D SY
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY

XP_023520165.1 uncharacterized protein LOC111783465 isoform X2 [Cucurbita pepo subsp. pepo]5.2e-10481.89Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRRFDDSTLRILEF S SKD  SL++VKS ++ELLRFES S+IRETV+KTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIA+KAYEQALS LQQSDTAN TSHGS KC EV+EKIKRLKDH+LKSA SHSVQALTSEYLKKKV E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY
         +K SS CTR KFTASTLFR+GIRNHNAKKLHEYQ L+G  SESYKIQ+ D SY
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY

XP_038895344.1 protein DOUBLE-STRAND BREAK FORMATION isoform X1 [Benincasa hispida]2.3e-9979.13Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        MSCS AEQ+SLFR+RLRSRRFDDSTLRILEF   SKD  SL++VKS L+E LRFES S+IRET EKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FR LKS NQ WLQVSH EWLNFAEHSL AGFFSIAIKAYEQALS LQQ+DT N TSHGS K +EV+EKIKRLKDHAL+SA SHSVQALTSEYL KKV E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY
          K SS CTR K TASTLFR+G RNHNAKKLHEYQ L+G  SES+KIQ  D +Y
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY

TrEMBL top hitse value%identityAlignment
A0A6J1DKU7 uncharacterized protein LOC111022017 isoform X18.4e-10884.68Show/hide
Query:  AEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFRELK
        +E FSLFR+RLRSRR DDSTL+ILEFVSVSKDVKSLIE KSRL+ELLRFES S+IRETVEKTDDQKLLVLEFLVRAFALVGD ESCLALRYEAL FRE+K
Subjt:  AEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFRELK

Query:  SSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAESTKKNS
        SSNQKWLQVSHVEWLNFAEHS+H+GF SIAIKAYE ALS LQQSDT NCTSH   KCVEV+EKI RLKDHALKSAASHSVQALTSEYLKKKV E  +K+S
Subjt:  SSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAESTKKNS

Query:  SFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHS
        SFCTRT FTASTLFRSGIRNHNA+KL EYQGL  F SESY +Q GD S
Subjt:  SFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHS

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X26.0e-9881.82Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRR DDSTLRILEF S SKD  SL++VKS ++ELL FES S+IRETVEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALS LQQSDTAN TSHGS KC EV+EKIKRLKDHALKSA SHSVQALTSEYLKK+V E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFIS
         +K SS CTR KFTASTLFR+GIRNHNAK+LHEYQ L+G  S
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFIS

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X12.8e-10381.5Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRR DDSTLRILEF S SKD  SL++VKS ++ELL FES S+IRETVEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALS LQQSDTAN TSHGS KC EV+EKIKRLKDHALKSA SHSVQALTSEYLKK+V E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY
         +K SS CTR KFTASTLFR+GIRNHNAK+LHEYQ L+G  SESYKIQ+ D SY
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY

A0A6J1I136 uncharacterized protein LOC111469552 isoform X21.8e-9782.23Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRRFDDSTLRILEF S SKD    ++VKS ++ELLRFES S+IRETVEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALS LQQSDTAN TSHGS K  EV+EKIKRLKDHALKSA SHSVQALTSEYLKKKV E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFIS
         +K SS CTR KFTASTLFR+GIRNHNAKKLHEYQ L+G  S
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFIS

A0A6J1I645 uncharacterized protein LOC111469552 isoform X18.1e-10381.89Show/hide
Query:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY
        M CSVAEQ+SLF +RLRSRRFDDSTLRILEF S SKD    ++VKS ++ELLRFES S+IRETVEKTDDQKLLV+EFLVRAFALVGDIESCLALRYEAL 
Subjt:  MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALY

Query:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES
        FRELKS NQ  LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALS LQQSDTAN TSHGS K  EV+EKIKRLKDHALKSA SHSVQALTSEYLKKKV E 
Subjt:  FRELKSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAES

Query:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY
         +K SS CTR KFTASTLFR+GIRNHNAKKLHEYQ L+G  SESYKIQ+ D SY
Subjt:  TKKNSSFCTRTKFTASTLFRSGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSY

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION1.2e-3447.22Show/hide
Query:  VAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFREL
        +A+Q  LF  R++ RRFD+ +LRILE   V+ +VKS +EV+SRL++ +R ES  +  E   ++   KL VLEF  RAFAL+GD+ESCLA+RYEAL  R+L
Subjt:  VAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFREL

Query:  KSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHS
        KS +  WL VSH EW  FA  S+  GF SIA KA E AL  L++       S  +   ++  EK++RL+D A    +SHS
Subjt:  KSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein8.5e-3647.22Show/hide
Query:  VAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFREL
        +A+Q  LF  R++ RRFD+ +LRILE   V+ +VKS +EV+SRL++ +R ES  +  E   ++   KL VLEF  RAFAL+GD+ESCLA+RYEAL  R+L
Subjt:  VAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFREL

Query:  KSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHS
        KS +  WL VSH EW  FA  S+  GF SIA KA E AL  L++       S  +   ++  EK++RL+D A    +SHS
Subjt:  KSSNQKWLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTGTTCGGTTGCGGAGCAATTCTCTCTCTTTCGCGCACGGCTCAGGAGCCGAAGATTTGATGATTCTACTTTGCGAATTCTGGAATTTGTTTCCGTTTCCAAGGA
CGTGAAGTCGTTGATCGAAGTCAAATCCAGATTACAAGAGTTACTGAGATTTGAATCTCCATCTGTCATTCGAGAAACCGTCGAGAAAACTGATGATCAAAAGCTTCTAG
TCCTCGAATTTCTTGTTCGAGCTTTCGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATATGAGGCGTTGTATTTTCGGGAACTGAAGTCTTCTAATCAGAAA
TGGCTTCAAGTTTCACACGTGGAATGGTTAAACTTCGCTGAGCATTCATTGCATGCTGGCTTTTTTTCTATAGCCATAAAGGCATATGAGCAAGCACTGTCGCACCTTCA
GCAGAGTGATACTGCAAACTGCACATCACATGGTTCCTTTAAATGCGTGGAAGTTGTTGAAAAGATAAAGAGACTCAAAGATCATGCTCTGAAATCAGCTGCTTCCCATT
CTGTTCAGGCTCTCACATCTGAGTATTTGAAAAAGAAAGTAGCTGAAAGTACAAAAAAGAATTCTTCATTCTGCACAAGAACTAAGTTTACAGCAAGCACTCTATTCAGA
AGTGGTATCAGAAACCATAATGCAAAAAAGCTGCATGAATATCAGGGTTTGCAGGGGTTTATCAGTGAATCGTACAAAATTCAGCTCGGTGACCATTCCTACACATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTGTTCGGTTGCGGAGCAATTCTCTCTCTTTCGCGCACGGCTCAGGAGCCGAAGATTTGATGATTCTACTTTGCGAATTCTGGAATTTGTTTCCGTTTCCAAGGA
CGTGAAGTCGTTGATCGAAGTCAAATCCAGATTACAAGAGTTACTGAGATTTGAATCTCCATCTGTCATTCGAGAAACCGTCGAGAAAACTGATGATCAAAAGCTTCTAG
TCCTCGAATTTCTTGTTCGAGCTTTCGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATATGAGGCGTTGTATTTTCGGGAACTGAAGTCTTCTAATCAGAAA
TGGCTTCAAGTTTCACACGTGGAATGGTTAAACTTCGCTGAGCATTCATTGCATGCTGGCTTTTTTTCTATAGCCATAAAGGCATATGAGCAAGCACTGTCGCACCTTCA
GCAGAGTGATACTGCAAACTGCACATCACATGGTTCCTTTAAATGCGTGGAAGTTGTTGAAAAGATAAAGAGACTCAAAGATCATGCTCTGAAATCAGCTGCTTCCCATT
CTGTTCAGGCTCTCACATCTGAGTATTTGAAAAAGAAAGTAGCTGAAAGTACAAAAAAGAATTCTTCATTCTGCACAAGAACTAAGTTTACAGCAAGCACTCTATTCAGA
AGTGGTATCAGAAACCATAATGCAAAAAAGCTGCATGAATATCAGGGTTTGCAGGGGTTTATCAGTGAATCGTACAAAATTCAGCTCGGTGACCATTCCTACACATAG
Protein sequenceShow/hide protein sequence
MSCSVAEQFSLFRARLRSRRFDDSTLRILEFVSVSKDVKSLIEVKSRLQELLRFESPSVIRETVEKTDDQKLLVLEFLVRAFALVGDIESCLALRYEALYFRELKSSNQK
WLQVSHVEWLNFAEHSLHAGFFSIAIKAYEQALSHLQQSDTANCTSHGSFKCVEVVEKIKRLKDHALKSAASHSVQALTSEYLKKKVAESTKKNSSFCTRTKFTASTLFR
SGIRNHNAKKLHEYQGLQGFISESYKIQLGDHSYT