; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015921 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015921
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr03:1471118..1472970
RNA-Seq ExpressionHG10015921
SyntenyHG10015921
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026464.1 hypothetical protein SDJN02_10464, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-8665.13Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M++ SL FARD F +M+NFL++FKIQTKYGT  G  ASSVIISGIGL+L+Y ++ +KKEK  QRV TRSMS+GALHGG++AMKR+LQYHK+RA  + +  
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
        YLEKL+  +NTD P+F  +Q+++AK+EM GQEDKAIEILK+A KEA++ SL + EYEYQMLLVEALIYKG+I EA    CLN+++ SDVRR LYK II+L
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN Q+A EEWE+FR MR QF LPPD+KDS FY L+  FE FK VVDLLK+DI +K+K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

XP_022146501.1 uncharacterized protein LOC111015701 [Momordica charantia]1.7e-9774.71Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M H SLD ARDF LIMRNFL++FKIQTKYGT    AASSVIISGIGLVL+YV++ +K+EK D+RV  RSMS+GALHGGKLAMKRLLQY KMRAT KN+D+
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
         LEKL+KMI    P+F KLQSIVAKLEMRGQEDKAIEILK+AAKEA++NSL H EYEYQ+LLVE LIYKGNI EAE   CLN E+TSDVRRSLYKAIIQ+
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN +KA+E+WEEF+ MR++FLLPPDVKDS FY L+ EFE FK VVDLL KDI E+ K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

XP_022926668.1 uncharacterized protein LOC111433729 [Cucurbita moschata]9.1e-8865.9Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M++ SL FARD F++M+NFL++FKIQTKYGT  G  ASSVIISGIGL+L+Y ++ +KKEK  QRV TRSMS+GALHGG++AMKR+LQYHK+RA  + +  
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
        YLEKL+ M NTD P+F  +Q ++AK+EM GQEDKAIEILK+A KEA++ SL + EYEYQMLLVEALIYKG+I EA    CLN+++ SDVRR LYK II+L
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN QKA EEWE+FR MR QF LPPD+KDS FY L+  FE FK VVDLLK+DI +K+K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

XP_023518384.1 uncharacterized protein LOC111781887 [Cucurbita pepo subsp. pepo]2.9e-8665.13Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M+  SL FARD  ++M+NFL++FKIQTKYGT  G  ASSVIISGIGL+L+Y ++ +KKEK  QRV TRSMS+GALHGG++AMKR+LQYHKMRA  + +  
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
        YLE L+ M NTD P+F  +Q+++AK+EM GQEDKAIEILK+A KEA + SL + EYEYQMLLVEALIYKG+I EA    CLN+++ SDVRR LYK II+L
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN Q+A EEWE+FR MR QF LPPD+KDS FY L+  FE FK VVDLLK+DI +K+K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

XP_038882985.1 uncharacterized protein LOC120074072 [Benincasa hispida]2.4e-10478.71Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        MSHSSLDFARDFFL M NF  NFK+QTKYGT  G AAS  IISGIGLVL+YVF+   KEK DQRV  RS+SMGALH GKLAMKRLLQYHKMRATPK K+ 
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMIN--TDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAII
        YLEK +K+IN  TDRPNF KLQSI+AKLEM GQEDKAIEILKRAA+EA++NS P+ EYEYQMLLVEALIYKGN   AE VPCLNN D SDVRRSLYKAII
Subjt:  YLEKLKKMIN--TDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAII

Query:  QLLLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        QLLLNNTQKA EEWEEF+NMR+ FLLPPDVKDS F+TLLA+F+ FK VV++LKKDI EKQKAK
Subjt:  QLLLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

TrEMBL top hitse value%identityAlignment
A0A0A0KLZ3 Uncharacterized protein9.8e-8060.67Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYG--TGTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        MSH S D   DFF  + NF + F +QTKYG   G  AS+ I+SG+GLVL+Y  +   K+K  QRV TRS+S+GALHGGK+AMKRLLQ+ KMRA P+NKD+
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYG--TGTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTD------RPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLY
        +++KL   I  D       PNF K+Q+IV KLEM GQEDKAIE LK AA+EA+K SLP  E+EYQMLLVE  IYKG++ +AE +PCL N+ TSDVRR LY
Subjt:  YLEKLKKMINTD------RPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLY

Query:  KAIIQLLLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        KAII++L N TQ+A +EWEEFR MR+ FLLPPDVKDSHFY LLA+F+ FK VV +L++DI +K +AK
Subjt:  KAIIQLLLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

A0A5D3CMZ4 Uncharacterized protein1.3e-5063.74Show/hide
Query:  MRATPKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVR
        MRA P+ KD+YL+KL   I +D P+F+KLQ+IVAKLEM GQEDK IE LK AA++A+K S P  EYEYQMLLVE  IYKG   +AE +PCLNN D SDVR
Subjt:  MRATPKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVR

Query:  RSLYKAIIQLLLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        R L+KAII++LLN TQ+A +EWEEFR +R+ +LLPPDVKDS FYTLLA+F+ F+ VV +L++DI +K +AK
Subjt:  RSLYKAIIQLLLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

A0A6J1CZJ5 uncharacterized protein LOC1110157018.0e-9874.71Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M H SLD ARDF LIMRNFL++FKIQTKYGT    AASSVIISGIGLVL+YV++ +K+EK D+RV  RSMS+GALHGGKLAMKRLLQY KMRAT KN+D+
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
         LEKL+KMI    P+F KLQSIVAKLEMRGQEDKAIEILK+AAKEA++NSL H EYEYQ+LLVE LIYKGNI EAE   CLN E+TSDVRRSLYKAIIQ+
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN +KA+E+WEEF+ MR++FLLPPDVKDS FY L+ EFE FK VVDLL KDI E+ K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

A0A6J1EIU2 uncharacterized protein LOC1114337294.4e-8865.9Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M++ SL FARD F++M+NFL++FKIQTKYGT  G  ASSVIISGIGL+L+Y ++ +KKEK  QRV TRSMS+GALHGG++AMKR+LQYHK+RA  + +  
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
        YLEKL+ M NTD P+F  +Q ++AK+EM GQEDKAIEILK+A KEA++ SL + EYEYQMLLVEALIYKG+I EA    CLN+++ SDVRR LYK II+L
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN QKA EEWE+FR MR QF LPPD+KDS FY L+  FE FK VVDLLK+DI +K+K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

A0A6J1KTB7 uncharacterized protein LOC1114974391.6e-8564.75Show/hide
Query:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR
        M   SL FARD FL+M+NFL++FKIQTKYGT  G  ASSVIISGIGL+L+Y ++ +KKEK  QRV TRSMS+GALHGG++AMKR+LQY KMRA  + +  
Subjt:  MSHSSLDFARDFFLIMRNFLRNFKIQTKYGT--GTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDR

Query:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL
        YLEKL+   +TD P+F  +Q+++ K+EMRGQEDKAIEILK+A KEA++ SL + EYEYQMLLVEALIYKG+I EA    CLN+++ SDVRR LYK II+L
Subjt:  YLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQL

Query:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK
        LLNN QKA EEWE+FR MR  F LPPD++DS FY L+  FE FK VVDLLK+DI +K+K K
Subjt:  LLNNTQKAREEWEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34530.1 unknown protein4.6e-2937.31Show/hide
Query:  DQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQML
        D+R L++S+SMGA+ GGKLA++RLL  H  R    +      + + +++ ++P+F  LQ  + K+EM G+E K  E+LK+A ++A+K    H+ YE +ML
Subjt:  DQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQML

Query:  LVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQLLLNNTQKAREE-WEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKA
        LVE LIY GN+ EA    CL +E  +D RR LY+ II  L  +  K  EE +  FR ++     P   ++     +   F++FK V++ LK +I +  K 
Subjt:  LVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQLLLNNTQKAREE-WEEFRNMRNQFLLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKA

Query:  K
        K
Subjt:  K

AT2G34530.2 unknown protein1.5e-2442.96Show/hide
Query:  DQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQML
        D+R L++S+SMGA+ GGKLA++RLL  H  R    +      + + +++ ++P+F  LQ  + K+EM G+E K  E+LK+A ++A+K    H+ YE +ML
Subjt:  DQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQML

Query:  LVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKA
        LVE LIY GN+ EA    CL +E  +D RR LY+A
Subjt:  LVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKA

AT2G34540.2 unknown protein2.4e-0932.64Show/hide
Query:  KLAMKRLLQYHKMRAT----PKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIRE
        K A++ L +   M A+    P  K   L KL  + + D  + +K++++    E  G+ ++A+++L+ A    +  + P   +  QM LVE LI     +E
Subjt:  KLAMKRLLQYHKMRAT----PKNKDRYLEKLKKMINTDRPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIRE

Query:  AEMVPCLNNEDT--SDVRRSLYKAIIQLLLNNTQKAREEWEEFR
        A    CLN+E+   SDVR  LYKAII  +L+   +A++ W+EFR
Subjt:  AEMVPCLNNEDT--SDVRRSLYKAIIQLLLNNTQKAREEWEEFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACATTCCTCATTAGACTTTGCAAGGGACTTTTTTCTGATAATGAGAAACTTCCTCAGGAATTTTAAAATCCAAACCAAATATGGAACAGGGACCGCAGCGTCGAG
TGTCATCATCTCGGGCATTGGTCTTGTGTTGCTTTATGTCTTTAGTCTGAAGAAAAAGGAGAAATTTGACCAGCGTGTTCTTACAAGGTCAATGTCCATGGGAGCTCTAC
ATGGTGGCAAACTAGCCATGAAAAGATTGCTTCAATACCACAAAATGAGAGCAACCCCAAAAAACAAAGATAGATATCTGGAGAAGTTAAAGAAAATGATCAACACAGAC
CGTCCTAATTTCATGAAGCTCCAGAGCATTGTTGCAAAGCTGGAAATGAGAGGACAAGAAGATAAAGCTATTGAAATATTAAAAAGAGCAGCAAAAGAAGCTCAGAAGAA
TTCACTTCCACACCAAGAATATGAATATCAGATGCTCCTCGTGGAAGCACTCATTTACAAGGGAAATATTAGGGAGGCGGAAATGGTTCCATGCTTGAATAATGAGGACA
CATCGGATGTTCGACGCTCATTATATAAGGCTATAATTCAACTGTTGCTGAATAACACCCAGAAAGCAAGAGAAGAATGGGAAGAGTTTAGGAATATGAGAAACCAATTC
CTCTTACCACCTGACGTTAAAGACTCTCACTTTTACACTCTCCTGGCCGAATTCGAGCAGTTTAAACTAGTCGTCGACCTGCTCAAAAAAGACATTCATGAGAAGCAGAA
AGCAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACATTCCTCATTAGACTTTGCAAGGGACTTTTTTCTGATAATGAGAAACTTCCTCAGGAATTTTAAAATCCAAACCAAATATGGAACAGGGACCGCAGCGTCGAG
TGTCATCATCTCGGGCATTGGTCTTGTGTTGCTTTATGTCTTTAGTCTGAAGAAAAAGGAGAAATTTGACCAGCGTGTTCTTACAAGGTCAATGTCCATGGGAGCTCTAC
ATGGTGGCAAACTAGCCATGAAAAGATTGCTTCAATACCACAAAATGAGAGCAACCCCAAAAAACAAAGATAGATATCTGGAGAAGTTAAAGAAAATGATCAACACAGAC
CGTCCTAATTTCATGAAGCTCCAGAGCATTGTTGCAAAGCTGGAAATGAGAGGACAAGAAGATAAAGCTATTGAAATATTAAAAAGAGCAGCAAAAGAAGCTCAGAAGAA
TTCACTTCCACACCAAGAATATGAATATCAGATGCTCCTCGTGGAAGCACTCATTTACAAGGGAAATATTAGGGAGGCGGAAATGGTTCCATGCTTGAATAATGAGGACA
CATCGGATGTTCGACGCTCATTATATAAGGCTATAATTCAACTGTTGCTGAATAACACCCAGAAAGCAAGAGAAGAATGGGAAGAGTTTAGGAATATGAGAAACCAATTC
CTCTTACCACCTGACGTTAAAGACTCTCACTTTTACACTCTCCTGGCCGAATTCGAGCAGTTTAAACTAGTCGTCGACCTGCTCAAAAAAGACATTCATGAGAAGCAGAA
AGCAAAATAG
Protein sequenceShow/hide protein sequence
MSHSSLDFARDFFLIMRNFLRNFKIQTKYGTGTAASSVIISGIGLVLLYVFSLKKKEKFDQRVLTRSMSMGALHGGKLAMKRLLQYHKMRATPKNKDRYLEKLKKMINTD
RPNFMKLQSIVAKLEMRGQEDKAIEILKRAAKEAQKNSLPHQEYEYQMLLVEALIYKGNIREAEMVPCLNNEDTSDVRRSLYKAIIQLLLNNTQKAREEWEEFRNMRNQF
LLPPDVKDSHFYTLLAEFEQFKLVVDLLKKDIHEKQKAK