; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024549 (gene) of Chayote v1 genome

Gene IDSed0024549
OrganismSechium edule (Chayote v1)
DescriptionCUE domain-containing protein
Genome locationLG03:46758251..46760781
RNA-Seq ExpressionSed0024549
SyntenySed0024549
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603664.1 hypothetical protein SDJN03_04273, partial [Cucurbita argyrosperma subsp. sororia]5.8e-8675.97Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE
        MSAG+CGKRVGFEEIFGSSSPT CS  KRSRWS FGSPTRSEF                GV A+  SFDDF+AR+DSA IGNCSTVP+ERTATCSQMSHE
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE

Query:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE
         V EA D  SA+ EG  MHGSKWVDMFVQEM  AVD+ DARIR+ARILEAFEHN+T +SRESEELKHASLK H QSLVNDNQILKRAVAIQHERNLEQEE
Subjt:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE

Query:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        KT+EVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

XP_008447682.1 PREDICTED: uncharacterized protein LOC103490091 isoform X1 [Cucumis melo]5.8e-8677.35Show/hide
Query:  MSAGICGKRVGFEEIFG-SSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSH
        MSAG+CGKRVGFEEIFG SSSPT CS  KRSRWS FGSPTRS+F                GV A+  SFDDF+AR DSA IGNCSTV +ERTATCSQMSH
Subjt:  MSAGICGKRVGFEEIFG-SSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSH

Query:  ETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQE
        E ++EAKDV SAVAEGN MHGSKWVDMFVQEM GAVD+ DARIR+ARILEAFEHN+T  SRESEELKHASLKEH QSLVNDNQILKRAVAIQHERNLEQE
Subjt:  ETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQE

Query:  EKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        EKT+EVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  EKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

XP_022143875.1 uncharacterized protein LOC111013683 isoform X1 [Momordica charantia]4.0e-8771.54Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEFGVAAQGS------------------------------------SFDDFNARIDSAAI
        MSAG+CGKRVGFEEIFGSSSPT CS  KRSRWS FGSPTRSEFG   + S                                    SFDDFNAR+DSA I
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEFGVAAQGS------------------------------------SFDDFNARIDSAAI

Query:  GNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVND
        GNCSTVP+ER ATCSQMSHE V++ KDV SAVAEGN MHGSKWVDMFVQEMT A+DLDDAR R+ RILEAFEHN+TA+SRESEELKHASLKEH Q+LVND
Subjt:  GNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVND

Query:  NQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        NQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  NQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

XP_022143877.1 uncharacterized protein LOC111013683 isoform X2 [Momordica charantia]6.7e-9078.11Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE
        MSAG+CGKRVGFEEIFGSSSPT CS  KRSRWS FGSPTRSEF                GV A+  SFDDFNAR+DSA IGNCSTVP+ER ATCSQMSHE
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE

Query:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE
         V++ KDV SAVAEGN MHGSKWVDMFVQEMT A+DLDDAR R+ RILEAFEHN+TA+SRESEELKHASLKEH Q+LVNDNQILKRAVAIQHERNLEQEE
Subjt:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE

Query:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        KTKEVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

XP_038882907.1 uncharacterized protein LOC120074014 isoform X1 [Benincasa hispida]2.6e-8674.68Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE
        MSAG+CGKRVGFEEIFGSSSPT CS  KR+RWS FGSPTRS+F                GV A+ SSFDDF+AR++SA IGNCSTVP+ERTATCSQMSHE
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE

Query:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE
         ++EAKDV SA+ EGN M+GSKWVDMFVQEM GAVD+ DARIR+ARILEAFEHN+T +SRESEELKHASL+EH Q+LVNDNQILKRA+AIQHERNLEQEE
Subjt:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE

Query:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        K +EVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

TrEMBL top hitse value%identityAlignment
A0A1S3BHY9 uncharacterized protein LOC103490091 isoform X12.8e-8677.35Show/hide
Query:  MSAGICGKRVGFEEIFG-SSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSH
        MSAG+CGKRVGFEEIFG SSSPT CS  KRSRWS FGSPTRS+F                GV A+  SFDDF+AR DSA IGNCSTV +ERTATCSQMSH
Subjt:  MSAGICGKRVGFEEIFG-SSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSH

Query:  ETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQE
        E ++EAKDV SAVAEGN MHGSKWVDMFVQEM GAVD+ DARIR+ARILEAFEHN+T  SRESEELKHASLKEH QSLVNDNQILKRAVAIQHERNLEQE
Subjt:  ETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQE

Query:  EKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        EKT+EVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  EKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

A0A5D3DIN3 CUE domain-containing protein2.8e-8677.35Show/hide
Query:  MSAGICGKRVGFEEIFG-SSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSH
        MSAG+CGKRVGFEEIFG SSSPT CS  KRSRWS FGSPTRS+F                GV A+  SFDDF+AR DSA IGNCSTV +ERTATCSQMSH
Subjt:  MSAGICGKRVGFEEIFG-SSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSH

Query:  ETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQE
        E ++EAKDV SAVAEGN MHGSKWVDMFVQEM GAVD+ DARIR+ARILEAFEHN+T  SRESEELKHASLKEH QSLVNDNQILKRAVAIQHERNLEQE
Subjt:  ETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQE

Query:  EKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        EKT+EVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  EKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

A0A6J1CQ09 uncharacterized protein LOC111013683 isoform X12.0e-8771.54Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEFGVAAQGS------------------------------------SFDDFNARIDSAAI
        MSAG+CGKRVGFEEIFGSSSPT CS  KRSRWS FGSPTRSEFG   + S                                    SFDDFNAR+DSA I
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEFGVAAQGS------------------------------------SFDDFNARIDSAAI

Query:  GNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVND
        GNCSTVP+ER ATCSQMSHE V++ KDV SAVAEGN MHGSKWVDMFVQEMT A+DLDDAR R+ RILEAFEHN+TA+SRESEELKHASLKEH Q+LVND
Subjt:  GNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVND

Query:  NQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        NQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  NQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

A0A6J1CS47 uncharacterized protein LOC111013683 isoform X23.2e-9078.11Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE
        MSAG+CGKRVGFEEIFGSSSPT CS  KRSRWS FGSPTRSEF                GV A+  SFDDFNAR+DSA IGNCSTVP+ER ATCSQMSHE
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE

Query:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE
         V++ KDV SAVAEGN MHGSKWVDMFVQEMT A+DLDDAR R+ RILEAFEHN+TA+SRESEELKHASLKEH Q+LVNDNQILKRAVAIQHERNLEQEE
Subjt:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE

Query:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        KTKEVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

A0A6J1INZ8 uncharacterized protein LOC1114781493.7e-8675.97Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE
        MSAG+CGKRVGFEEIFGSSSPT CS  KRSRWS FGSPTRSEF                GV A+  SFDDF+AR+DSA IGNCSTVP+ERTATCSQMSHE
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCS--KRSRWSGFGSPTRSEF----------------GVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHE

Query:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE
         V+EA D  SA+ EG  MHGSKWVDMFVQEM  AVD+ DARIR+ARILEAFEHN+T +SRESEELKHASLK H QSLVNDNQILKRAVAIQHERNLEQEE
Subjt:  TVKEAKDVKSAVAEGNRMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEE

Query:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        KT+EVHQLKHVLCQYQEQIQSLEV      L L
Subjt:  KTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80040.1 FUNCTIONS IN: molecular_function unknown5.2e-1631.15Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSSFD-----------------------DFNARIDSA-AIGNCSTVPEERTATC
        MSA  CG +  +   F  +S  P SKR R     SP+ S    +   SS D                       DFNA + S  +  +      E  A  
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSSFD-----------------------DFNARIDSA-AIGNCSTVPEERTATC

Query:  SQMSHETVKEAKDVKSAVAEGN-RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESE----ELKHASLKEHFQSLVNDNQILKRAVA
           + ET         AV  GN    G  WV++ V+E+  +   DDA++R+AR+LEA E  ++A +RE      + +  ++++  ++LV DN +LKRAVA
Subjt:  SQMSHETVKEAKDVKSAVAEGN-RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESE----ELKHASLKEHFQSLVNDNQILKRAVA

Query:  IQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        IQHER    E+   ++  LK ++ QYQE++++LEV   A ++ L
Subjt:  IQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

AT1G80040.3 FUNCTIONS IN: molecular_function unknown4.4e-1533.52Show/hide
Query:  VAAQGSSFDDFNARIDSA-AIGNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGN-RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAY
        V A   +  DFNA + S  +  +      E  A     + ET         AV  GN    G  WV++ V+E+  +   DDA++R+AR+LEA E  ++A 
Subjt:  VAAQGSSFDDFNARIDSA-AIGNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGN-RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAY

Query:  SRESE----ELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        +RE      + +  ++++  ++LV DN +LKRAVAIQHER    E+   ++  LK ++ QYQE++++LEV   A ++ L
Subjt:  SRESE----ELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

AT5G02510.1 BEST Arabidopsis thaliana protein match is: Ubiquitin system component Cue protein (TAIR:AT5G32440.1)6.8e-2453.51Show/hide
Query:  GSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQI
        G+KWVD  V EMT A+++DD R R A ILEA E  I   +  S++L++AS+KE  QSL+NDNQILKR +A QH+R+ E EEK K+V  L+ V+ QYQEQ+
Subjt:  GSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQI

Query:  QSLEVLRLAYKLSL
          LE+   A KL L
Subjt:  QSLEVLRLAYKLSL

AT5G32440.1 Ubiquitin system component Cue protein1.7e-1931.6Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSS----------FDDFNARIDSAAIGNCSTVPEERTATCSQMSHETVKEAKDV
        MSA +CGKR  FE++  +S P   SK+ R   F S + S F      SS          F D + +I   AI  C    +      +Q+  E+  +  D 
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSS----------FDDFNARIDSAAIGNCSTVPEERTATCSQMSHETVKEAKDV

Query:  K-------------------SAVAEGN--RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAY----SRESEELKHASLKEHFQSLVNDNQI
                            SA  E N   + G++WV++FV+EM  A D+ DA+ R+AR LEA E +I A     + ++ + ++  LK+  +++V +N +
Subjt:  K-------------------SAVAEGN--RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAY----SRESEELKHASLKEHFQSLVNDNQI

Query:  LKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        LKRAV  Q +R  E E++++E+  L+ ++ QYQEQ+++LEV   A  L L
Subjt:  LKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL

AT5G32440.3 Ubiquitin system component Cue protein1.0e-1931.47Show/hide
Query:  MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSS----------FDDFNARIDSAAIGNCSTVPEERTATCSQMSHETVKEAKDV
        MSA +CGKR  FE++  +S P   SK+ R   F S + S F      SS          F D + +I   AI  C    +      +Q+  E+  +  D 
Subjt:  MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSS----------FDDFNARIDSAAIGNCSTVPEERTATCSQMSHETVKEAKDV

Query:  --------------------KSAVAEGN--RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAY----SRESEELKHASLKEHFQSLVNDNQ
                            +SA  E N   + G++WV++FV+EM  A D+ DA+ R+AR LEA E +I A     + ++ + ++  LK+  +++V +N 
Subjt:  --------------------KSAVAEGN--RMHGSKWVDMFVQEMTGAVDLDDARIRSARILEAFEHNITAY----SRESEELKHASLKEHFQSLVNDNQ

Query:  ILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL
        +LKRAV  Q +R  E E++++E+  L+ ++ QYQEQ+++LEV   A  L L
Subjt:  ILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCTGGAATCTGTGGGAAACGGGTCGGGTTTGAAGAGATCTTCGGATCTTCTTCTCCGACGCCCTGTTCTAAGAGGTCCCGTTGGTCCGGTTTCGGATCCCCGAC
CCGATCGGAGTTCGGAGTCGCTGCCCAGGGGTCTTCTTTTGATGATTTTAATGCCCGTATCGATTCTGCAGCAATCGGAAATTGTTCTACTGTTCCTGAAGAGCGAACGG
CTACATGCAGTCAAATGTCACACGAGACGGTAAAAGAAGCCAAAGACGTTAAATCGGCCGTTGCAGAAGGAAATAGGATGCATGGGTCGAAGTGGGTGGACATGTTTGTG
CAAGAGATGACGGGCGCGGTAGATCTTGATGATGCTAGAATTCGATCTGCAAGAATTTTAGAAGCTTTTGAACATAACATAACTGCATATTCAAGAGAATCAGAGGAGTT
AAAGCATGCTTCTTTGAAGGAGCATTTCCAGAGCTTGGTAAATGACAACCAAATTTTAAAGAGAGCAGTAGCCATCCAGCACGAGCGAAATCTCGAGCAAGAAGAGAAGA
CGAAAGAAGTCCATCAGTTAAAGCATGTATTATGCCAGTATCAAGAACAAATTCAAAGTTTAGAGGTACTCAGACTTGCCTATAAGTTATCACTTTATTGCAAACATATT
ATCTTTCCAAACTTGGCTATCCCTAATATATTGCTGTAA
mRNA sequenceShow/hide mRNA sequence
CCTTCACCTTTTTCAACCTGCTCCTTCATCAAGCCCTATTCATTCCCCCATTTCCTTACGTTTTTCTCATCCGTTTCCCCTTTTTCCAACACTTTCATTCATTCCGACCA
AAACCCTTTTCTAATTTCTGATTTCCGATTCCTGAATTCTGAATTCTTGTGGCAATTTCGATTTTGGGTTTTTGAAAACGAGCCATGTCGGCTGGAATCTGTGGGAAACG
GGTCGGGTTTGAAGAGATCTTCGGATCTTCTTCTCCGACGCCCTGTTCTAAGAGGTCCCGTTGGTCCGGTTTCGGATCCCCGACCCGATCGGAGTTCGGAGTCGCTGCCC
AGGGGTCTTCTTTTGATGATTTTAATGCCCGTATCGATTCTGCAGCAATCGGAAATTGTTCTACTGTTCCTGAAGAGCGAACGGCTACATGCAGTCAAATGTCACACGAG
ACGGTAAAAGAAGCCAAAGACGTTAAATCGGCCGTTGCAGAAGGAAATAGGATGCATGGGTCGAAGTGGGTGGACATGTTTGTGCAAGAGATGACGGGCGCGGTAGATCT
TGATGATGCTAGAATTCGATCTGCAAGAATTTTAGAAGCTTTTGAACATAACATAACTGCATATTCAAGAGAATCAGAGGAGTTAAAGCATGCTTCTTTGAAGGAGCATT
TCCAGAGCTTGGTAAATGACAACCAAATTTTAAAGAGAGCAGTAGCCATCCAGCACGAGCGAAATCTCGAGCAAGAAGAGAAGACGAAAGAAGTCCATCAGTTAAAGCAT
GTATTATGCCAGTATCAAGAACAAATTCAAAGTTTAGAGGTACTCAGACTTGCCTATAAGTTATCACTTTATTGCAAACATATTATCTTTCCAAACTTGGCTATCCCTAA
TATATTGCTGTAACTCTAGAGAGCTTTGCATTTTTTACATTCTTTTTTATGTTATGGAGGAAAGTAATGATTTTAGACACAATTTTAACTTGTCTGTTCGAATTTGTAAC
TTGGGTGTCATTCACATTAGAATTGAAGAACGTAAAAATATATCGAATTAAACGACATGGCTCAAGGGTCATGTTCTTTTGTCTATAGTCGATGTCATCATTCGTCGGTT
GTTACTTGATGCAAGATTAAGAGATCAAGCTATGCTGAACTTTTGTTTTTGTGGTAAAGGTTATACATTCTCATGTTGCTATGAATTGCTTCTGTTTTTTTCTTTTTTTT
TTGTGCAGGTAAGAAATTACACATTAAATCTCCATTTGCAGAGGGCACAATCTGTTTCAGGACACTTCCACCAAGACAGATTTTAATGTTGTTTTTGTTTCTCTATACAG
TAACTTCTTTTCTTTTACCAACATTGAGTTTATTCCAATATATTATAGCGTTATTTTGGGTGTCCTATTCAAGCCCAAGCTAAGCCAAAGCCAAAGCCTTTTTGGTGTAA
TGAAAGGTTAAAGGCTAGGCTAGGTTAGGCTCTACTTTGTTTGTAAGGGTTTTTTATGCAACTCAAGATGAAAAGCTATGTCTTGTCACTCCTAATTGATGTATTCATAT
TT
Protein sequenceShow/hide protein sequence
MSAGICGKRVGFEEIFGSSSPTPCSKRSRWSGFGSPTRSEFGVAAQGSSFDDFNARIDSAAIGNCSTVPEERTATCSQMSHETVKEAKDVKSAVAEGNRMHGSKWVDMFV
QEMTGAVDLDDARIRSARILEAFEHNITAYSRESEELKHASLKEHFQSLVNDNQILKRAVAIQHERNLEQEEKTKEVHQLKHVLCQYQEQIQSLEVLRLAYKLSLYCKHI
IFPNLAIPNILL