; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G009600 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G009600
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDSBA domain-containing protein
Genome locationchr07:12351663..12355090
RNA-Seq ExpressionLsi07G009600
SyntenyLsi07G009600
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR001853 - DSBA-like thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649326.1 hypothetical protein Csa_014666 [Cucumis sativus]3.9e-13695.62Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        +LRIQLPLRFVSPNRLPIPS Q FQGECK+IRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFEL WHPFQLNP+APKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K EYYRSKFGIQSEQMEARMAEVFRGLGLDYD SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
        ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

XP_022955347.1 uncharacterized protein LOC111457403 [Cucurbita moschata]1.2e-12992.03Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        MLRIQLPL+FVSP  LPI SPQ FQG+CKYIR MAESVGSR+MDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQY FELKWHPFQLNPSAPKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K+E+YRSKFGIQSEQMEARM EVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELCLGYFTQGKYIGDRDFLLECA KA VEGAAEFLES
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
         D GVKEVKEELEKYSGKISGVPFYVINGKHKL+GAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

XP_022980705.1 uncharacterized protein LOC111479994 [Cucurbita maxima]1.9e-13092.43Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        MLRIQLPL+FVSP  LPI SPQ FQG+CKYIR MAESVGSR+MDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQY FELKWHPFQLNPSAPKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K+E+YRSKFGIQSEQMEARM EVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELCLGYFTQGKYIGDRDFLLECARKA VEGAAEFLES
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
         D GVKEVKEELEKYSGKISGVPFYVINGKHKL+GAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

XP_023526218.1 uncharacterized protein LOC111789767 [Cucurbita pepo subsp. pepo]1.9e-13092.43Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        MLRIQLPL+FVSP  LPI SPQ FQG+CKYIR MAESVGSR+MDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQY FELKWHPFQLNPSAPKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K+E+YRSKFGIQSEQMEARM EVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELCLGYFTQGKYIGDRDFLLECARKA VEGAAEFLES
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
         D GVKEVKEELEKYSGKISGVPFYVINGKHKL+GAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

XP_038900410.1 uncharacterized protein YwbO [Benincasa hispida]3.4e-13294.38Show/hide
Query:  RIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKR
        RIQLPLR VS   LPIPS Q FQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQY+FELKWHPFQLNPSAPKEG+VKR
Subjt:  RIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKR

Query:  EYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESAD
        EYYRSKFGIQS+QMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDR+FLLECARKAGVEGAAEFLES D
Subjt:  EYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESAD

Query:  NGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
        NGVKE+KEEL+KYSGKISGVPFYVINGKHKLSGAQPPEVF+RAFQVAGK
Subjt:  NGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

TrEMBL top hitse value%identityAlignment
A0A0A0KZG9 DSBA domain-containing protein1.9e-13695.62Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        +LRIQLPLRFVSPNRLPIPS Q FQGECK+IRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFEL WHPFQLNP+APKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K EYYRSKFGIQSEQMEARMAEVFRGLGLDYD SGLTGNTL+SHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
        ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

A0A5A7U4Z3 DSBA domain-containing protein4.8e-12496.9Show/hide
Query:  GECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFR
        GECK+IRIM ESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFEL WHPFQLNPSAPKEGVVK EYYRSKFGIQSEQMEARMAEVFR
Subjt:  GECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFR

Query:  GLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYSGKISGVPFY
        GLGLDYD SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLE+ADNGVKEVKEELEKYSGKISGVPFY
Subjt:  GLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYSGKISGVPFY

Query:  VINGKHKLSGAQPPEVFVRAFQVAGK
        VINGKHKLSGAQPPEVF+RAFQVAGK
Subjt:  VINGKHKLSGAQPPEVFVRAFQVAGK

A0A6J1DQG1 uncharacterized protein LOC1110221671.1e-12890.44Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        M RIQLPLRFVSPNR  I + ++FQGE KY RIMAESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQYDFELKWHPFQLNPSAPKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        KRE+YR+KFGIQSEQME+RMAEVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR+FLLECARKAGVEGAAEFL S
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
         DNGV +VKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

A0A6J1GTB1 uncharacterized protein LOC1114574035.9e-13092.03Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        MLRIQLPL+FVSP  LPI SPQ FQG+CKYIR MAESVGSR+MDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQY FELKWHPFQLNPSAPKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K+E+YRSKFGIQSEQMEARM EVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELCLGYFTQGKYIGDRDFLLECA KA VEGAAEFLES
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
         D GVKEVKEELEKYSGKISGVPFYVINGKHKL+GAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

A0A6J1IXA2 uncharacterized protein LOC1114799949.1e-13192.43Show/hide
Query:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV
        MLRIQLPL+FVSP  LPI SPQ FQG+CKYIR MAESVGSR+MDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQY FELKWHPFQLNPSAPKEGVV
Subjt:  MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVV

Query:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES
        K+E+YRSKFGIQSEQMEARM EVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELCLGYFTQGKYIGDRDFLLECARKA VEGAAEFLES
Subjt:  KREYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLES

Query:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK
         D GVKEVKEELEKYSGKISGVPFYVINGKHKL+GAQPPEVF+RAFQVAGK
Subjt:  ADNGVKEVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFVRAFQVAGK

SwissProt top hitse value%identityAlignment
P39598 Uncharacterized protein YwbO5.9e-1026.96Show/hide
Query:  IQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPS-APKEGVVK--REYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSH
        + I + SD VCP+CFVGK   ++AI       D E++W PF+L PS +P+   V    + Y  +  IQ       MAE    LG++ +   ++ +     
Subjt:  IQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPS-APKEGVVK--REYYRSKFGIQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSH

Query:  KLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYS---GKISGVPFYVINGKHKLSGAQPPEV
                +   K H+    +   +F + + IGD D L + A + G++GA+          ++V+ +  K++     I+ VP ++I G   + GA   +V
Subjt:  KLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYS---GKISGVPFYVINGKHKLSGAQPPEV

Query:  FVRA
        F +A
Subjt:  FVRA

Arabidopsis top hitse value%identityAlignment
AT5G38900.1 Thioredoxin superfamily protein1.0e-8167.13Show/hide
Query:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDM
        MAES  S    KKLIQID+SSD+VCPWCFVGKKNLDKAI AS+DQY+FE++W PF L+PSAPKEGV K+E+Y  K+G + + M ARM+EVF+GLGL++D 
Subjt:  MAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKREYYRSKFGIQSEQMEARMAEVFRGLGLDYDM

Query:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYSGKISGVPFYVINGKHKL
        +GLTGN+LDSH+LI+  G+Q   KQH LVEEL +GYFTQGK+IGDR+FL+E A K G+EGA EFL   +NGV EVKEEL KYS  I+GVP Y INGK KL
Subjt:  SGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFVRAFQVA
        SGAQPPE F  AF+ A
Subjt:  SGAQPPEVFVRAFQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCGCATTCAACTGCCACTGCGGTTCGTTTCTCCGAACAGACTCCCAATTCCATCCCCTCAACTTTTCCAGGGAGAGTGCAAATATATCAGGATCATGGCTGAGTC
AGTTGGGAGTAGGAACATGGATAAAAAGCTTATACAAATCGATATAAGCTCCGACACTGTTTGCCCGTGGTGCTTTGTTGGCAAAAAGAATCTTGACAAAGCAATATCTG
CTTCTCAGGATCAATATGATTTTGAGTTGAAGTGGCATCCGTTCCAGCTCAACCCTTCTGCCCCAAAGGAAGGTGTTGTTAAGAGGGAATATTACAGGAGTAAGTTTGGA
ATTCAATCAGAACAGATGGAAGCGCGGATGGCAGAGGTGTTTAGGGGTCTTGGACTGGATTATGACATGTCTGGGCTGACGGGAAATACTCTAGACAGCCACAAGCTTAT
ATATTTGGCAGGTCAACAAGGTTTAGGTAAACAGCATGATCTTGTGGAGGAGTTGTGCCTTGGATACTTCACTCAGGGAAAATACATTGGTGACCGGGACTTTCTTTTGG
AATGTGCTAGAAAGGCGGGGGTAGAAGGAGCAGCAGAGTTTCTCGAGTCAGCTGATAATGGAGTTAAGGAGGTCAAGGAGGAGCTTGAGAAGTACTCAGGAAAAATTTCA
GGAGTTCCCTTTTATGTTATCAATGGGAAGCACAAGTTGAGTGGTGCTCAACCCCCTGAGGTTTTTGTAAGAGCTTTTCAAGTGGCAGGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
TTGAAAATGGAGAAGCTAAAAATATCGCCATGCTCCGCATTCAACTGCCACTGCGGTTCGTTTCTCCGAACAGACTCCCAATTCCATCCCCTCAACTTTTCCAGGGAGAG
TGCAAATATATCAGGATCATGGCTGAGTCAGTTGGGAGTAGGAACATGGATAAAAAGCTTATACAAATCGATATAAGCTCCGACACTGTTTGCCCGTGGTGCTTTGTTGG
CAAAAAGAATCTTGACAAAGCAATATCTGCTTCTCAGGATCAATATGATTTTGAGTTGAAGTGGCATCCGTTCCAGCTCAACCCTTCTGCCCCAAAGGAAGGTGTTGTTA
AGAGGGAATATTACAGGAGTAAGTTTGGAATTCAATCAGAACAGATGGAAGCGCGGATGGCAGAGGTGTTTAGGGGTCTTGGACTGGATTATGACATGTCTGGGCTGACG
GGAAATACTCTAGACAGCCACAAGCTTATATATTTGGCAGGTCAACAAGGTTTAGGTAAACAGCATGATCTTGTGGAGGAGTTGTGCCTTGGATACTTCACTCAGGGAAA
ATACATTGGTGACCGGGACTTTCTTTTGGAATGTGCTAGAAAGGCGGGGGTAGAAGGAGCAGCAGAGTTTCTCGAGTCAGCTGATAATGGAGTTAAGGAGGTCAAGGAGG
AGCTTGAGAAGTACTCAGGAAAAATTTCAGGAGTTCCCTTTTATGTTATCAATGGGAAGCACAAGTTGAGTGGTGCTCAACCCCCTGAGGTTTTTGTAAGAGCTTTTCAA
GTGGCAGGGAAGTGATTGAAACTGAAATGAAGCTTTTGAATTTGACGATCAATAAGATAAGAATGAAACGTCCTTGCTAAAATCAGTATACATACTCTTATTCTTGTATT
TACTACCATAATGGAATGCATTTAGATCAAACTGCAAATTACAGCTAGAAATTTGTTTGCCAAGTAATAAAATACTTGGTTGTACACTATTAGGTATTAGCACATGCTTG
AGATTTAAATAGTTTCATCTTGATGACTCATTAGCAATTCTAATTTTGTTTTCAATTTTTGGTCCAAGCTAAAGTAGATTTGACTATTGAAATCTCCTTTTAACTGCAAT
TTATCTTTCTGAAAAGTAGGACCAAACTTCCAATGTTACCAAAAAATTACAAAACAATGATATAATTGATTCAGATT
Protein sequenceShow/hide protein sequence
MLRIQLPLRFVSPNRLPIPSPQLFQGECKYIRIMAESVGSRNMDKKLIQIDISSDTVCPWCFVGKKNLDKAISASQDQYDFELKWHPFQLNPSAPKEGVVKREYYRSKFG
IQSEQMEARMAEVFRGLGLDYDMSGLTGNTLDSHKLIYLAGQQGLGKQHDLVEELCLGYFTQGKYIGDRDFLLECARKAGVEGAAEFLESADNGVKEVKEELEKYSGKIS
GVPFYVINGKHKLSGAQPPEVFVRAFQVAGK