; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016716 (gene) of Chayote v1 genome

Gene IDSed0016716
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationLG04:43147798..43150398
RNA-Seq ExpressionSed0016716
SyntenySed0016716
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572284.1 hypothetical protein SDJN03_29012, partial [Cucurbita argyrosperma subsp. sororia]5.8e-8786.08Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNP+L+K GL LIA+SI  YILGPPLYWH KE  AVVTR SSSSSSCPPCFCDCPS PL++IP+ELRN+TF DC+KHDPEVSQDTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+AAQKRLTAMWEQRARQRGWKEG  KSRTQKQ N+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

XP_004144504.1 uncharacterized protein LOC101202853 isoform X1 [Cucumis sativus]9.5e-9088.66Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNPSL+KLGL LIA++I  YILGPPLYWH KE  AVVT  SSSSSSCPPCFCDCPSHP++SIPEELRNSTFADCVKHDPEVS+DTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+ AQKRLTAMWEQRARQRGWKEG  KSRTQKQGN+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

XP_008455457.1 PREDICTED: uncharacterized protein LOC103495614 isoform X1 [Cucumis melo]5.8e-8786.08Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ +SCNPSL+KLGL LIA++I  YILGPPLYWH KE  AVV+  SS+SSSCPPCFCDCPS P++SIPEELRNSTFADCVKHDPEVS+DTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+ AQKRLTA WEQRARQRGWKEG  KSRTQKQGN+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

XP_022952351.1 uncharacterized protein LOC111455059 isoform X1 [Cucurbita moschata]2.6e-8786.6Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNPSL+K GL LIA+SI  YILGPPLYWH KE  AVVTR SSSSSSCPPCFCDCPS PL++IP+ELRN+TF DC+KHDPEVSQDTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+AAQKRLTAMWEQRARQRGWKEG  KSRTQKQ N+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

XP_038888598.1 uncharacterized protein LOC120078397 [Benincasa hispida]5.2e-8887.63Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNPSL+K GL LIA+SI  YILGPPLYWH KE  AVV  S SSSSSCPPCFCDCPS P++SIPEELRNSTFADCVKHDPE+SQDTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+ AQKRLTAMWEQRARQRGWKEG  KSRTQKQGN+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K245 Uncharacterized protein4.6e-9088.66Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNPSL+KLGL LIA++I  YILGPPLYWH KE  AVVT  SSSSSSCPPCFCDCPSHP++SIPEELRNSTFADCVKHDPEVS+DTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+ AQKRLTAMWEQRARQRGWKEG  KSRTQKQGN+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

A0A1S3C135 uncharacterized protein LOC103495614 isoform X12.8e-8786.08Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ +SCNPSL+KLGL LIA++I  YILGPPLYWH KE  AVV+  SS+SSSCPPCFCDCPS P++SIPEELRNSTFADCVKHDPEVS+DTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+ AQKRLTA WEQRARQRGWKEG  KSRTQKQGN+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

A0A6J1ERF9 uncharacterized protein LOC111437126 isoform X13.6e-8787.11Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ  SC+PSL+KLGL LIA+SI  YILGPPLYWH KE FAVV+R SSSSSSCPPCFCDCPS P++SIPEELRNSTFADCVKHDPEVSQDTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKEAEA E+ RRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+AAQKRLTA WEQRARQRGWKEGA KSR QKQGN+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

A0A6J1GK01 uncharacterized protein LOC111455059 isoform X11.3e-8786.6Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNPSL+K GL LIA+SI  YILGPPLYWH KE  AVVTR SSSSSSCPPCFCDCPS PL++IP+ELRN+TF DC+KHDPEVSQDTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+AAQKRLTAMWEQRARQRGWKEG  KSRTQKQ N+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

A0A6J1HXB7 uncharacterized protein LOC1114683213.6e-8786.08Show/hide
Query:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL
        MAIQ SSCNPSL+K GL LIA+SI  YIL PPLYWH KE  AVVTR SSSSSSCPPCFCDCPS PL++IP+ELRN+TF DC+KHDPEVSQDTEKNFA+LL
Subjt:  MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELL

Query:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        LEELKLKE EALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAV+AAQKRLTAMWEQRARQRGWKEG  KSRTQKQ N+QTA
Subjt:  LEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)5.2e-6263.74Show/hide
Query:  VKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEAL
        +K+GL L+ +S+  YILGPPLYWH  EA A V     S+SSCP C C+C ++  V+IP+EL N++FADC KHDPEV++DTEKN+AELL EELKL+EAE+L
Subjt:  VKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEAL

Query:  ENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        E  +RAD+ LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  +A QK+LT+ WE+RARQ+GW+EG+ K   + + NVQ A
Subjt:  ENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

AT2G32580.1 Protein of unknown function (DUF1068)3.3e-5660.44Show/hide
Query:  VKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEAL
        +K+GL L+A+S+  YILGPPLYWH  EA AV      S++SC  C CDC S PL++IP  L N +F DC K DPEV++DTEKN+AELL EELK +EA ++
Subjt:  VKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEAL

Query:  ENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA
        E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  +  QK+LT+MWEQRARQ+G+K+GA KS  + +   + A
Subjt:  ENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA

AT2G32580.2 Protein of unknown function (DUF1068)8.4e-3662.71Show/hide
Query:  TFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGW
        +  +C K DPEV++DTEKN+AELL EELK +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  +  QK+LT+MWEQRARQ+G+
Subjt:  TFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGW

Query:  KEGAVKSRTQKQGNVQTA
        K+GA KS  + +   + A
Subjt:  KEGAVKSRTQKQGNVQTA

AT4G04360.1 Protein of unknown function (DUF1068)2.0e-4558.18Show/hide
Query:  LIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEALENQRRA
        ++ + I AYI GP LYWH  E  A      S  SSCPPC CDC S PL+SIP+ L N +F DC++H+ E S+++E +F E++ EELKL+EA+A E++ RA
Subjt:  LIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEALENQRRA

Query:  DIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKS
        D  LL+AKK  SQYQKEADKC+ GMETCE AREKAEA +  Q+RL+ MWE RARQ GWKEG V S
Subjt:  DIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKS

AT4G30996.1 Protein of unknown function (DUF1068)3.2e-3547.53Show/hide
Query:  LGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSH-PLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEALENQ
        L + AV     + GP LYW   + F   TR   ++S CPPC CDCP    L+ I   L N +  DC   DPE+ Q+ EK F +LL EELKL+EA A E+ 
Subjt:  LGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSH-PLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAEALENQ

Query:  RRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWK
        R  ++ L EAK++ SQYQKEA+KCN+  E CE ARE+AEA++  ++++T++WE+RARQ GW+
Subjt:  RRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCAAGAGAGTTCTTGCAATCCTTCCCTCGTCAAGCTGGGATTGGGTTTGATCGCCGTCTCCATTGGCGCTTACATTCTGGGCCCTCCCCTCTATTGGCACTC
CAAGGAAGCATTTGCCGTCGTTACCCGCTCTTCATCGTCTTCTTCTTCTTGCCCGCCTTGTTTCTGCGACTGCCCTTCTCACCCACTCGTCTCCATTCCCGAAGAATTGA
GGAACTCTACCTTTGCAGATTGTGTTAAGCATGACCCAGAAGTGAGTCAAGACACTGAGAAGAACTTTGCAGAACTATTGTTAGAGGAGCTGAAGTTAAAGGAAGCCGAA
GCCTTGGAAAATCAGCGTCGTGCCGATATAGCTCTGCTCGAGGCAAAGAAGATGACATCTCAGTATCAAAAAGAAGCAGACAAATGCAATTCTGGGATGGAAACATGCGA
AGAAGCAAGGGAGAAAGCCGAAGCGGTAATAGCTGCACAGAAGAGGCTAACAGCAATGTGGGAGCAAAGGGCTCGTCAAAGGGGATGGAAAGAAGGGGCTGTCAAGTCTC
GCACTCAAAAACAAGGAAATGTTCAAACTGCATAA
mRNA sequenceShow/hide mRNA sequence
GCGAAATATATATTTTCAAAATGAAATTCGTAACATGTTAAAATAGATGACCATCGACCCATTCGAGTATCATAATTATCTCAAAAAACTATGCTTTGTCCCAGAAAAAA
AGGGGGTTAGAGATTGGGATAGCCAAAAACTCTGATTCTCGATTTACGAGATTCGGGCCCAATCCGGAGTTGTGTTCATTTCATTCAACTCTGCAAATTTCTTTGCAGCA
TCTCCAATGGCGATTCAAGAGAGTTCTTGCAATCCTTCCCTCGTCAAGCTGGGATTGGGTTTGATCGCCGTCTCCATTGGCGCTTACATTCTGGGCCCTCCCCTCTATTG
GCACTCCAAGGAAGCATTTGCCGTCGTTACCCGCTCTTCATCGTCTTCTTCTTCTTGCCCGCCTTGTTTCTGCGACTGCCCTTCTCACCCACTCGTCTCCATTCCCGAAG
AATTGAGGAACTCTACCTTTGCAGATTGTGTTAAGCATGACCCAGAAGTGAGTCAAGACACTGAGAAGAACTTTGCAGAACTATTGTTAGAGGAGCTGAAGTTAAAGGAA
GCCGAAGCCTTGGAAAATCAGCGTCGTGCCGATATAGCTCTGCTCGAGGCAAAGAAGATGACATCTCAGTATCAAAAAGAAGCAGACAAATGCAATTCTGGGATGGAAAC
ATGCGAAGAAGCAAGGGAGAAAGCCGAAGCGGTAATAGCTGCACAGAAGAGGCTAACAGCAATGTGGGAGCAAAGGGCTCGTCAAAGGGGATGGAAAGAAGGGGCTGTCA
AGTCTCGCACTCAAAAACAAGGAAATGTTCAAACTGCATAAAGGCCATCCAAACAGTAACACTTCCTCTCATTCTCACTTTTTCTTACATTTCATATGTATTTATATATT
TGGGTAGTAAATTAAATTTTACTAGACTTTTTCAAATAGATCGAAAGTATTATGCCAGGTATATCTTTAGCAAATCGAAATGTCAATTACCATTGTTTAGTTAAACATAA
GAAATTGTTCATTAGTGTTGAAAGCAACTAATGAGTGCAAGAATTTTTTTAAGCTAGTTCAAGAAGTTATGTCACTCTCATTTGTCATCATTTTTCTTATATCCGTGATT
GCCCCGAGCACCTTGAGTTGAGTGCATCTTGACTAATATCACAAGACATACTGTCTGAC
Protein sequenceShow/hide protein sequence
MAIQESSCNPSLVKLGLGLIAVSIGAYILGPPLYWHSKEAFAVVTRSSSSSSSCPPCFCDCPSHPLVSIPEELRNSTFADCVKHDPEVSQDTEKNFAELLLEELKLKEAE
ALENQRRADIALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVIAAQKRLTAMWEQRARQRGWKEGAVKSRTQKQGNVQTA