; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006505 (gene) of Chayote v1 genome

Gene IDSed0006505
OrganismSechium edule (Chayote v1)
DescriptionDUF4408 domain-containing protein
Genome locationLG04:42262409..42263715
RNA-Seq ExpressionSed0006505
SyntenySed0006505
Gene Ontology termsNA
InterPro domainsIPR025520 - Domain of unknown function DUF4408


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591991.1 hypothetical protein SDJN03_14337, partial [Cucurbita argyrosperma subsp. sororia]5.0e-4860.71Show/hide
Query:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSP
        ML +MTSN+  ILSLKL+L+SSA+ S A +LKF VP+V  ILVS VPA+WSSI  WLRPPYLYLL NFII+SILASS L H+ LV P      KL   S 
Subjt:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSP

Query:  VDSTVHRAVGDVLDNSCNRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGAT
         D TV RAVGDVL+ S  R+A ED  ADVSE++ K      S  S+REN    E DR  L+K+DS+E LLQK++KKPPIS KI NR+ AKAAT T AG T
Subjt:  VDSTVHRAVGDVLDNSCNRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGAT

Query:  WRWSNVNGHNTL-DAWKAITEGRS
         RWS +N H+TL DAW AITEG S
Subjt:  WRWSNVNGHNTL-DAWKAITEGRS

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]8.0e-3854.23Show/hide
Query:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----------SK
        M+SV++SNNH ILSLKL+L+SSA+ S A +LKFSVPVVA ILVSDVPA+WSSI  WLRPPYLYLL+NFIII+ILASS L H+DLV P           +K
Subjt:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----------SK

Query:  LSAHSPVDSTVHRAVGDVLDNSCN---RNASEDLEADVSENDAKLAKSMASERENE-----ADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAAT
        LS  S  D  V  AV DV   S +    N + D+  +  ++D +L  S  S+REN+      DR GL+K+DS+EI  QK+EKKPPIS K+G+R+  K   
Subjt:  LSAHSPVDSTVHRAVGDVLDNSCN---RNASEDLEADVSENDAKLAKSMASERENE-----ADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAAT

Query:  P
        P
Subjt:  P

XP_022936351.1 uncharacterized protein LOC111442999 [Cucurbita moschata]4.5e-4160.41Show/hide
Query:  AAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSPVDSTVHRAVGDVLDNSCNRNASEDLEA
        A +LKF VP+V  ILVS VPA+WSSI  WLRPPYLYLL NFII+SILASS L H+ LV P      KL   S  D TV RAVGDVL+ S  R+A ED  A
Subjt:  AAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSPVDSTVHRAVGDVLDNSCNRNASEDLEA

Query:  DVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAWKAITEGRS
        DVSE++ K      S  S+REN    E DR  L+K+DS+E LLQK++KKPPIS KI NR+ AKAAT T AG T RWS +N H+TL DAW AITEG S
Subjt:  DVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAWKAITEGRS

XP_022975735.1 uncharacterized protein LOC111475963 [Cucurbita maxima]1.6e-3844.16Show/hide
Query:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLH--------------------
        MLSVMTSN+  IL L+L+ +SSA+ S   +LKF VP+V  ILVSDVPA+WSSI  WLRPPYLYLL NFII+SILASS LH                    
Subjt:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLH--------------------

Query:  -----------------------------------------------------------HEDLV---------GPSKLSAHSPVDSTVHRAVGDVLDNSC
                                                                   +ED V          P KL A S  D TV RAVGDVL+ S 
Subjt:  -----------------------------------------------------------HEDLV---------GPSKLSAHSPVDSTVHRAVGDVLDNSC

Query:  NRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSM-EILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAW
        +R+A ED  ADVSE++ K      S  S+REN    E  R  L+K+D + E LLQK++KKPPIS KIGNR+ AKAAT T AG T +WS +NGH+TL DAW
Subjt:  NRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSM-EILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAW

Query:  KAITEGRS
         AITEG S
Subjt:  KAITEGRS

XP_023535113.1 uncharacterized protein LOC111796630 [Cucurbita pepo subsp. pepo]3.5e-4961.61Show/hide
Query:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSP
        MLSVMTSN+  IL LKL+L+SS + S A +LKF VP+V  ILVSDVPA+WSSI  WLRPPYLYLL NFII+SILASS L H+ LV P     SKL A S 
Subjt:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSP

Query:  VDSTVHRAVGDVLDNSCNRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGAT
         D TV RAV DVL+ S  R+A ED  ADVSE++ K      S  S+REN    E DR  L+K+DS+E LLQK++KKPPIS KI NR+ AKAAT T AG T
Subjt:  VDSTVHRAVGDVLDNSCNRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGAT

Query:  WRWSNVNGHNTL-DAWKAITEGRS
         RWS +N H+TL DAW AITEG S
Subjt:  WRWSNVNGHNTL-DAWKAITEGRS

TrEMBL top hitse value%identityAlignment
A0A6J1CG82 uncharacterized protein LOC1110105213.9e-3854.23Show/hide
Query:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----------SK
        M+SV++SNNH ILSLKL+L+SSA+ S A +LKFSVPVVA ILVSDVPA+WSSI  WLRPPYLYLL+NFIII+ILASS L H+DLV P           +K
Subjt:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----------SK

Query:  LSAHSPVDSTVHRAVGDVLDNSCN---RNASEDLEADVSENDAKLAKSMASERENE-----ADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAAT
        LS  S  D  V  AV DV   S +    N + D+  +  ++D +L  S  S+REN+      DR GL+K+DS+EI  QK+EKKPPIS K+G+R+  K   
Subjt:  LSAHSPVDSTVHRAVGDVLDNSCN---RNASEDLEADVSENDAKLAKSMASERENE-----ADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAAT

Query:  P
        P
Subjt:  P

A0A6J1F789 uncharacterized protein LOC1114429992.2e-4160.41Show/hide
Query:  AAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSPVDSTVHRAVGDVLDNSCNRNASEDLEA
        A +LKF VP+V  ILVS VPA+WSSI  WLRPPYLYLL NFII+SILASS L H+ LV P      KL   S  D TV RAVGDVL+ S  R+A ED  A
Subjt:  AAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGP-----SKLSAHSPVDSTVHRAVGDVLDNSCNRNASEDLEA

Query:  DVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAWKAITEGRS
        DVSE++ K      S  S+REN    E DR  L+K+DS+E LLQK++KKPPIS KI NR+ AKAAT T AG T RWS +N H+TL DAW AITEG S
Subjt:  DVSENDAKL---AKSMASEREN----EADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAWKAITEGRS

A0A6J1FQE8 uncharacterized protein LOC1114475142.1e-2035.74Show/hide
Query:  SNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHE-----------DLVGPSKLSAHSP
        SN   ILSLK+ LIS+ V S A  LKFSVP+VA  LVS++P+IW+    WLRPPYLYL++N IIISI+ASS L  +           + + P  L+  S 
Subjt:  SNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHE-----------DLVGPSKLSAHSP

Query:  VDSTVHRAVGDVLDNSCNRNASE-------DLEADVS--------------------ENDAKLAKSMASERENEADRI-GLRKDDSMEILLQKHEKKPPI
            V     D+L+  C  NA++       DL  D S                    END+ +  + A E    +     L + DS+ +L    ++KPP+
Subjt:  VDSTVHRAVGDVLDNSCNRNASE-------DLEADVS--------------------ENDAKLAKSMASERENEADRI-GLRKDDSMEILLQKHEKKPPI

Query:  SWKIGNRRVAKAATPTGAGATWRWSNVNGHNTLDA-WKAITEGRSRDRWKIRHSPKSQPSESE
        S +IG R++ KA+     G     S     +TL++ W+ ITEGRS      RH  KS   ES+
Subjt:  SWKIGNRRVAKAATPTGAGATWRWSNVNGHNTLDA-WKAITEGRSRDRWKIRHSPKSQPSESE

A0A6J1IHJ9 uncharacterized protein LOC1114759637.8e-3944.16Show/hide
Query:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLH--------------------
        MLSVMTSN+  IL L+L+ +SSA+ S   +LKF VP+V  ILVSDVPA+WSSI  WLRPPYLYLL NFII+SILASS LH                    
Subjt:  MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLH--------------------

Query:  -----------------------------------------------------------HEDLV---------GPSKLSAHSPVDSTVHRAVGDVLDNSC
                                                                   +ED V          P KL A S  D TV RAVGDVL+ S 
Subjt:  -----------------------------------------------------------HEDLV---------GPSKLSAHSPVDSTVHRAVGDVLDNSC

Query:  NRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSM-EILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAW
        +R+A ED  ADVSE++ K      S  S+REN    E  R  L+K+D + E LLQK++KKPPIS KIGNR+ AKAAT T AG T +WS +NGH+TL DAW
Subjt:  NRNASEDLEADVSENDAKL---AKSMASEREN----EADRIGLRKDDSM-EILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTL-DAW

Query:  KAITEGRS
         AITEG S
Subjt:  KAITEGRS

A0A6J1ITI6 uncharacterized protein LOC1114785099.6e-2136.47Show/hide
Query:  SNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHE-----------DLVGPSKLSAHSP
        SN   ILSLK+ LIS+ V S A  LK SVP+VA  LVS++P+IW+    WLRPPYLYL++N IIISI+ASS L  +           + + P  L+  S 
Subjt:  SNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHE-----------DLVGPSKLSAHSP

Query:  VDSTVHRAVGDVLDNSCNRNASE-------DLEADVS------------ENDAKLAKSMASERENEADRIG-LRKDDSMEILLQKHEKKPPISWKIGNRR
            V     D+L+  C  NA++       DL+ D S            END+ +  + A E    +     L + DS+ +L    ++KPP+S +IG R+
Subjt:  VDSTVHRAVGDVLDNSCNRNASE-------DLEADVS------------ENDAKLAKSMASERENEADRIG-LRKDDSMEILLQKHEKKPPISWKIGNRR

Query:  VAKAATPTGAGATWRWSNVNGHNTLDA-WKAITEGRSRDRWKIRHSPKSQPSESE
        + KA+     G     S     +TL++ W+ ITEGRS      RH  KS   ES+
Subjt:  VAKAATPTGAGATWRWSNVNGHNTLDA-WKAITEGRSRDRWKIRHSPKSQPSESE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11210.1 Protein of unknown function (DUF761)4.7e-0428.1Show/hide
Query:  LKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIII----SILASSMLHHEDLVGPSKLSAHSPVDSTVHRAVGDVLDN
        +K +LIS+ V +TA  LK  VPV      S  P I SS   WL+PPYLY++ N III    S   +++  H D  G    +++S  D+        ++  
Subjt:  LKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIII----SILASSMLHHEDLVGPSKLSAHSPVDSTVHRAVGDVLDN

Query:  SCNRNASEDLEADVSENDAKLAKSMASE---RENEADRIGLRKDDSMEILLQKHEKKPP-----ISWKIGNRRVAKAATPTGAGA--TWRWSNVNGHNTL
        +  R  +E  +AD       L      E    E E       ++    I++ K E +PP     ++ +IG ++     TP    +    R +    + TL
Subjt:  SCNRNASEDLEADVSENDAKLAKSMASE---RENEADRIGLRKDDSMEILLQKHEKKPP-----ISWKIGNRRVAKAATPTGAGA--TWRWSNVNGHNTL

Query:  D-AWKAITEG
        +  WK I EG
Subjt:  D-AWKAITEG

AT1G11220.1 Protein of unknown function (DUF761)4.6e-0726.03Show/hide
Query:  ILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHH----------EDLVGPSKLSAHSPVDSTVH
        ++S+K  LI++ + + +  LK SVP+     VS  P  WSS   WL+PPYL++ +N II  I+ASS  +           E L+G      +    +   
Subjt:  ILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHH----------EDLVGPSKLSAHSPVDSTVH

Query:  RAVGDVLDNSCNRNASEDL---EADVSENDAKLAKSMASERENEADRIGLRKDDSMEILLQKHEKKPP-----ISWKIGNRRVAKAATPTGAGATWRWSN
        R V    D          +   E ++ E   +  +   S + N  D   + + +  + ++++ E  PP     +S + G+R+  KA++  G     +   
Subjt:  RAVGDVLDNSCNRNASEDL---EADVSENDAKLAKSMASERENEADRIGLRKDDSMEILLQKHEKKPP-----ISWKIGNRRVAKAATPTGAGATWRWSN

Query:  V---NGHNTLD-AWKAITE
        V   N H TL+  W  ITE
Subjt:  V---NGHNTLD-AWKAITE

AT1G61260.1 Protein of unknown function (DUF761)8.0e-1230.36Show/hide
Query:  ILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSML---HH----EDLV-----GPSKLSAHSPVDST
        +++ K +LISS VA+ A +LK SVPV     VS  P +WSS+  WL+PPYLY++ N III+I+ASS     HH    ED +     G  K+    P+ + 
Subjt:  ILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSML---HH----EDLV-----GPSKLSAHSPVDST

Query:  VHRAVGDVLD-NSCNRNAS-----EDLEADVSENDAKLAKSMASERENE--ADRIGLRKDDSME-----ILLQKHE---------------------KKP
         H+A   +L+    +  A       +LEA+  E++A  A     E E +   D     +D+  E     I+++  +                     +KP
Subjt:  VHRAVGDVLD-NSCNRNAS-----EDLEADVSENDAKLAKSMASERENE--ADRIGLRKDDSME-----ILLQKHE---------------------KKP

Query:  PISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTLD-AWKAITEGRS
         ++ + G+R++ KA+     G   R +    + TL+  WK ITEG+S
Subjt:  PISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTLD-AWKAITEGRS

AT5G54300.1 Protein of unknown function (DUF761)3.2e-0828.85Show/hide
Query:  LISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSML-HHEDLVGPSKLS-----------AHSPVD------STV
        ++ + V+S A  +  +VP V+  +VS  P I+ +  F L+PPYLYL++N II+ I+A+S L H    V  S++S            H P D      + V
Subjt:  LISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSML-HHEDLVGPSKLS-----------AHSPVD------STV

Query:  HRAVGDV------LDNSCNRNASEDLEADVSENDAKLAKSMASERENEADRIGLRKDDSMEILLQKHE-KKPPISWKIGNRRVAKAAT-----PTGAGAT
        H  V D       +D+       E +       +A+ +K  +   E E ++  L K+DS EI + KH  +KPP   +   ++  K+ +      T  G T
Subjt:  HRAVGDV------LDNSCNRNASEDLEADVSENDAKLAKSMASERENEADRIGLRKDDSMEILLQKHE-KKPPISWKIGNRRVAKAAT-----PTGAGAT

Query:  WRWSNVNGHNTLD-AWKAITEGRS---------RDRWKIRHSPKSQPSESEEI
                 +TL+  WK ITEGRS          D W+ R   +S P   E++
Subjt:  WRWSNVNGHNTLD-AWKAITEGRS---------RDRWKIRHSPKSQPSESEEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCTGTAATGACTTCAAACAACCATCGGATTCTATCCCTGAAGCTCATACTCATCTCCTCCGCCGTTGCATCCACGGCGGCGATGCTCAAATTCTCCGTCCCGGT
GGTTGCCGGCATCCTCGTCTCCGACGTTCCCGCCATCTGGAGCTCCATCGCTTTCTGGCTAAGGCCTCCTTACCTTTACTTGCTCCTTAATTTCATCATTATCAGTATCC
TCGCCTCTTCCATGCTCCATCACGAGGACCTAGTCGGTCCTTCAAAGCTATCCGCTCATTCACCCGTAGATTCCACTGTTCACAGAGCCGTCGGGGATGTTCTTGATAAT
AGTTGCAACCGCAACGCGAGCGAAGACCTCGAGGCGGATGTTTCGGAAAATGATGCGAAATTGGCGAAATCCATGGCGTCCGAGAGAGAAAACGAGGCAGATCGGATCGG
ACTGCGAAAGGACGATTCGATGGAGATTTTGTTGCAGAAACACGAAAAGAAGCCGCCGATTTCATGGAAGATCGGGAATCGGAGAGTTGCAAAAGCTGCGACTCCTACTG
GCGCAGGGGCGACGTGGAGGTGGTCCAATGTGAACGGTCACAACACGTTAGACGCGTGGAAGGCGATCACGGAGGGCCGTTCTCGAGACCGTTGGAAAATCCGCCATAGT
CCGAAATCGCAGCCCTCTGAAAGTGAAGAAATCGGAAACTACCTGCAACGGCGGCGAGGGTGA
mRNA sequenceShow/hide mRNA sequence
CAAAGCCCCTCTCGTCTTCACTGAACCGCCGATCAGACAAACAGAACATGCTCTCTGTAATGACTTCAAACAACCATCGGATTCTATCCCTGAAGCTCATACTCATCTCC
TCCGCCGTTGCATCCACGGCGGCGATGCTCAAATTCTCCGTCCCGGTGGTTGCCGGCATCCTCGTCTCCGACGTTCCCGCCATCTGGAGCTCCATCGCTTTCTGGCTAAG
GCCTCCTTACCTTTACTTGCTCCTTAATTTCATCATTATCAGTATCCTCGCCTCTTCCATGCTCCATCACGAGGACCTAGTCGGTCCTTCAAAGCTATCCGCTCATTCAC
CCGTAGATTCCACTGTTCACAGAGCCGTCGGGGATGTTCTTGATAATAGTTGCAACCGCAACGCGAGCGAAGACCTCGAGGCGGATGTTTCGGAAAATGATGCGAAATTG
GCGAAATCCATGGCGTCCGAGAGAGAAAACGAGGCAGATCGGATCGGACTGCGAAAGGACGATTCGATGGAGATTTTGTTGCAGAAACACGAAAAGAAGCCGCCGATTTC
ATGGAAGATCGGGAATCGGAGAGTTGCAAAAGCTGCGACTCCTACTGGCGCAGGGGCGACGTGGAGGTGGTCCAATGTGAACGGTCACAACACGTTAGACGCGTGGAAGG
CGATCACGGAGGGCCGTTCTCGAGACCGTTGGAAAATCCGCCATAGTCCGAAATCGCAGCCCTCTGAAAGTGAAGAAATCGGAAACTACCTGCAACGGCGGCGAGGGTGA
GAAAAAAAGGCGTCGGTGACGGAGGAGGAACTAAACCGGCAGGTGGAGGCGTTTATAGAGAAGTTTAAGGAGGAAATGAGGCTGCAAAGGGAGGAATCACTGAAGAAGTT
TGAGAGAATGATAAACCAGGGAGGAGGATATTATTGAATTCGCTGCATCAATTTGATCTGAATGGGTAAGTTGATTTCATGTTTGATTATGCATTTAGATTCAGAGTTTA
ATTTGGTTTTTTGTTTGAATTTTGGGTGGAGGCTGATTCTGTTGATATAGAGATTATGAAGTTGAAGTGGAGTAGTACTGGATCATGAGGTTCATGCGGCAAGGTTTGTT
TTGGATAACGCTTTCAAGATTAAGAATGGGATTATATAATAATCGATTAAGTTGGACAATAATGGGAGTTTATGAAATTTATGTTTT
Protein sequenceShow/hide protein sequence
MLSVMTSNNHRILSLKLILISSAVASTAAMLKFSVPVVAGILVSDVPAIWSSIAFWLRPPYLYLLLNFIIISILASSMLHHEDLVGPSKLSAHSPVDSTVHRAVGDVLDN
SCNRNASEDLEADVSENDAKLAKSMASERENEADRIGLRKDDSMEILLQKHEKKPPISWKIGNRRVAKAATPTGAGATWRWSNVNGHNTLDAWKAITEGRSRDRWKIRHS
PKSQPSESEEIGNYLQRRRG