; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011691 (gene) of Snake gourd v1 genome

Gene IDTan0011691
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG07:7374304..7376336
RNA-Seq ExpressionTan0011691
SyntenyTan0011691
Gene Ontology termsGO:0006464 - cellular protein modification process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0140096 - catalytic activity, acting on a protein (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145897.1 uncharacterized protein LOC101214743 [Cucumis sativus]3.8e-10782.14Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G
        MSGGVGP C DISLP+EQ  E +HKE+ DPKQG G  +G+R   KAAAFLS RQLNALAVV+I SASGMVCAEDL FV FSI+YMY +SRVAFP +GG G
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G

Query:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED
        +  VF P+NRVLRLYV  AA+VGLFLP+AYILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA ND FSTPIRVFVPVFYNSRRIF+L EWLR+EFAKED
Subjt:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED

Query:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        KEYSGSVRRL+ GRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
Subjt:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

XP_008437515.1 PREDICTED: uncharacterized protein LOC103482907 [Cucumis melo]6.3e-11084.52Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G
        MSGGVGPTCSDISLP+EQ  E +HKE+ DPKQG    +G+R   KAAAFLS RQLNALAVV+I SASGMVCAEDLAFV FSI+YMY ISRVAFP MGG G
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G

Query:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED
        +  VF P+NRVLRLYV  AA++GLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIF+L EWLRDEFAKED
Subjt:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED

Query:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        KEYSGSVRRL+ GRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
Subjt:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

XP_022156297.1 uncharacterized protein LOC111023219 [Momordica charantia]5.1e-10481.89Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGGGE
        MSGGVGPTCSDISLP EQ  E +HKE CD K G  +  G R    AAAFLSFRQLNALAVVVI SASGMVCAEDLAFV FS+ YMY ISRVAFP  GGGE
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGGGE

Query:  P-TVFSP-KNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKE
           VFSP +++VLRLYV +AA +GLFLPI YILEGFFEDDKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIF++AEWLRDEFAKE
Subjt:  P-TVFSP-KNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKE

Query:  DKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY-SLYKSKD
        DKE+SGSVRRLL GRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  DKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY-SLYKSKD

XP_022973006.1 uncharacterized protein LOC111471524 [Cucurbita maxima]1.3e-10276.63Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQ-----GMK----LGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRV
        MSGGVGPTCSDISLP EQ  E++HKESCDPKQ Q     G+K    +G++    AAAFLSF+QLNALAVVVI SASGMVCAEDLAFV FS++YMY ISRV
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQ-----GMK----LGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRV

Query:  AFPTM-GGGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEW
        AFP + G GEPTVFS +NR+LRLY F   VVG FLPIAYILEGFFE+DKEGIKAASPHVFLLASQ FMEGVA NDRFSTPIRVFVPV YN+RR+F+L EW
Subjt:  AFPTM-GGGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEW

Query:  LRDEFAKEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        LRDEFAKEDKEYSGSVRR + GR LAV NMA+WSFNLFG LLP+Y+P+AFKRYYS+ KSKD
Subjt:  LRDEFAKEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

XP_038906981.1 uncharacterized protein LOC120092829 [Benincasa hispida]2.0e-11185.49Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPK-QGQGMKL--GQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMG
        MSGGVGPTCSDISLP+EQ  E +HKE+ D K Q QG+KL  G     KAAAFLSFRQLNALAVV+I SASGMVCAEDLAFV FS++YMY ISRVAFPT+G
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPK-QGQGMKL--GQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMG

Query:  -GGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFA
          GEPTVFSP+N++LRLYVF AAVVGLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA  DRFSTPIRVFVPVFYNSRRIF+LAEWLRDEF 
Subjt:  -GGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFA

Query:  KEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        KEDKEYSGSVRRLL GRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
Subjt:  KEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KK26 Uncharacterized protein1.8e-10782.14Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G
        MSGGVGP C DISLP+EQ  E +HKE+ DPKQG G  +G+R   KAAAFLS RQLNALAVV+I SASGMVCAEDL FV FSI+YMY +SRVAFP +GG G
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G

Query:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED
        +  VF P+NRVLRLYV  AA+VGLFLP+AYILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA ND FSTPIRVFVPVFYNSRRIF+L EWLR+EFAKED
Subjt:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED

Query:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        KEYSGSVRRL+ GRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
Subjt:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

A0A1S3AUS6 uncharacterized protein LOC1034829073.0e-11084.52Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G
        MSGGVGPTCSDISLP+EQ  E +HKE+ DPKQG    +G+R   KAAAFLS RQLNALAVV+I SASGMVCAEDLAFV FSI+YMY ISRVAFP MGG G
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G

Query:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED
        +  VF P+NRVLRLYV  AA++GLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIF+L EWLRDEFAKED
Subjt:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED

Query:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        KEYSGSVRRL+ GRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
Subjt:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

A0A5D3C472 Uncharacterized protein3.0e-11084.52Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G
        MSGGVGPTCSDISLP+EQ  E +HKE+ DPKQG    +G+R   KAAAFLS RQLNALAVV+I SASGMVCAEDLAFV FSI+YMY ISRVAFP MGG G
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGG-G

Query:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED
        +  VF P+NRVLRLYV  AA++GLFLPIAYILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIF+L EWLRDEFAKED
Subjt:  EPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKED

Query:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        KEYSGSVRRL+ GRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
Subjt:  KEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

A0A6J1DRN6 uncharacterized protein LOC1110232192.5e-10481.89Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGGGE
        MSGGVGPTCSDISLP EQ  E +HKE CD K G  +  G R    AAAFLSFRQLNALAVVVI SASGMVCAEDLAFV FS+ YMY ISRVAFP  GGGE
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGGGE

Query:  P-TVFSP-KNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKE
           VFSP +++VLRLYV +AA +GLFLPI YILEGFFEDDKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIF++AEWLRDEFAKE
Subjt:  P-TVFSP-KNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKE

Query:  DKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY-SLYKSKD
        DKE+SGSVRRLL GRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  DKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY-SLYKSKD

A0A6J1I7I0 uncharacterized protein LOC1114715246.1e-10376.63Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQ-----GMK----LGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRV
        MSGGVGPTCSDISLP EQ  E++HKESCDPKQ Q     G+K    +G++    AAAFLSF+QLNALAVVVI SASGMVCAEDLAFV FS++YMY ISRV
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQ-----GMK----LGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRV

Query:  AFPTM-GGGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEW
        AFP + G GEPTVFS +NR+LRLY F   VVG FLPIAYILEGFFE+DKEGIKAASPHVFLLASQ FMEGVA NDRFSTPIRVFVPV YN+RR+F+L EW
Subjt:  AFPTM-GGGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEW

Query:  LRDEFAKEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD
        LRDEFAKEDKEYSGSVRR + GR LAV NMA+WSFNLFG LLP+Y+P+AFKRYYS+ KSKD
Subjt:  LRDEFAKEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein9.5e-3238.58Show/hide
Query:  QLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMG--GGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFL
        +L  +A +++ SASG+V   D+ F AF+ IY+  +SR+AFP+ G     P VF   +++ RLYV     +GLFLP+AY+L GF   D   +++A+PH+FL
Subjt:  QLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMG--GGEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFL

Query:  LASQVFMEGVARN-DRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKEDKEYSGS---VRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY
        L+ Q+  E V      FS P+R  VP+ Y   RIF +  W +D +  +    + +   V     GR LA+AN+  +  NL  FL+P +LPRAF++Y+
Subjt:  LASQVFMEGVARN-DRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKEDKEYSGS---VRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY

AT5G23920.1 unknown protein4.7e-3942.05Show/hide
Query:  RQLNALAVVVILSASGMVCAEDLAFVAFSIIYMY-LISRVAFPTMGGGEPTVFS-PKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVF
        RQL  L+ +++L+A G+V   ++AFV    IY+Y  +SR AFP     +    S PKN++ + Y    A++GL  P+ YI +G +  D  G  AA+PH+F
Subjt:  RQLNALAVVVILSASGMVCAEDLAFVAFSIIYMY-LISRVAFPTMGGGEPTVFS-PKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVF

Query:  LLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYS
        LL+ Q F E +  +D++S PI +  PVFYN+RRIF+L +W++ EF+  D +  G   RL  GR +A  N  +W +NLFG LLPV+LPR+ + Y+S
Subjt:  LLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKEDKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYYS

AT5G52420.1 unknown protein8.6e-6550Show/hide
Query:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMG--G
        MSGGVGPT +DI+LP+E+E E    +S       G         K A F SFRQLN LA++++LSASG+V  +D  F   ++IY + +S++ FP      
Subjt:  MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMG--G

Query:  GEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKE
         +  + S  N++ R+YV  A +VGL +PI YI EG  EDDK G+ AA+PHVFLLASQ+FMEG+A    FS P R+ VP+ YN+RR+ +L EW+  EF++E
Subjt:  GEPTVFSPKNRVLRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKE

Query:  DKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY
        D   + S RR+ AG+ LA AN+ +WSFNLFG L+PVYLPRAFKRYY
Subjt:  DKEYSGSVRRLLAGRALAVANMALWSFNLFGFLLPVYLPRAFKRYY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCGGGGTGGGGCCTACGTGCAGCGACATAAGCCTCCCAAGGGAGCAAGAACTGGAAGTCGTCCACAAAGAATCGTGCGATCCAAAGCAGGGGCAGGGGATGAA
ATTAGGGCAGAGGAATAATGACAAGGCGGCGGCGTTTCTGAGTTTCCGGCAACTGAACGCGCTGGCGGTGGTGGTGATCTTGTCGGCGAGCGGAATGGTGTGCGCGGAGG
ATTTGGCGTTTGTGGCGTTCTCAATTATTTACATGTACTTGATCTCCAGAGTGGCATTTCCGACGATGGGCGGCGGGGAGCCGACGGTGTTCAGCCCGAAGAACAGGGTG
CTCCGGTTGTACGTGTTCGTCGCCGCCGTGGTGGGCCTTTTCCTCCCCATAGCCTACATTCTTGAGGGGTTCTTTGAAGATGATAAAGAAGGCATCAAAGCCGCTTCTCC
CCATGTCTTCCTTCTCGCCAGTCAGGTTTTCATGGAAGGCGTGGCGAGGAACGACAGATTCTCGACGCCAATTCGTGTGTTCGTGCCAGTTTTCTACAACTCGAGGAGGA
TCTTCAGCCTGGCCGAGTGGCTGCGGGACGAGTTCGCCAAGGAAGACAAAGAATACAGCGGCTCTGTGCGGCGGCTGTTGGCCGGAAGAGCACTTGCTGTGGCCAACATG
GCGCTTTGGAGCTTCAATCTGTTTGGGTTCTTGTTGCCTGTGTATCTCCCAAGAGCTTTCAAGAGATATTATTCTCTTTACAAATCCAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
AAGTTCTCTCCTCTTTTCTCTTATCTGAAAAAACAGAATCAAGAAAAAAACAGAGAAAAAATGTCCGGCGGGGTGGGGCCTACGTGCAGCGACATAAGCCTCCCAAGGGA
GCAAGAACTGGAAGTCGTCCACAAAGAATCGTGCGATCCAAAGCAGGGGCAGGGGATGAAATTAGGGCAGAGGAATAATGACAAGGCGGCGGCGTTTCTGAGTTTCCGGC
AACTGAACGCGCTGGCGGTGGTGGTGATCTTGTCGGCGAGCGGAATGGTGTGCGCGGAGGATTTGGCGTTTGTGGCGTTCTCAATTATTTACATGTACTTGATCTCCAGA
GTGGCATTTCCGACGATGGGCGGCGGGGAGCCGACGGTGTTCAGCCCGAAGAACAGGGTGCTCCGGTTGTACGTGTTCGTCGCCGCCGTGGTGGGCCTTTTCCTCCCCAT
AGCCTACATTCTTGAGGGGTTCTTTGAAGATGATAAAGAAGGCATCAAAGCCGCTTCTCCCCATGTCTTCCTTCTCGCCAGTCAGGTTTTCATGGAAGGCGTGGCGAGGA
ACGACAGATTCTCGACGCCAATTCGTGTGTTCGTGCCAGTTTTCTACAACTCGAGGAGGATCTTCAGCCTGGCCGAGTGGCTGCGGGACGAGTTCGCCAAGGAAGACAAA
GAATACAGCGGCTCTGTGCGGCGGCTGTTGGCCGGAAGAGCACTTGCTGTGGCCAACATGGCGCTTTGGAGCTTCAATCTGTTTGGGTTCTTGTTGCCTGTGTATCTCCC
AAGAGCTTTCAAGAGATATTATTCTCTTTACAAATCCAAAGATTGAAATTACTTTTTTGGATTCTCTGTGGTGAAGAAGAAGAAATAATCTTCTTTTTTTCTTTTTTCTT
TTTTCTTTATAAGATAGCCAAAAATATTTTGGATTAGTTTTGGAGTAATTATTCGATTGTACTGTAATGGAAGTTTTCAAATGTATTGATCTGGCTCTGCATAGTGAGAT
GATCTTGTCTTGTCCATGTAATATAGTGGATCATCTACCAAAAAATATATATATTTTTCAAGGATGAGGTGGATCAGATTTTGAACCATACACTTTTTAAATGGATAACG
TCTTAATT
Protein sequenceShow/hide protein sequence
MSGGVGPTCSDISLPREQELEVVHKESCDPKQGQGMKLGQRNNDKAAAFLSFRQLNALAVVVILSASGMVCAEDLAFVAFSIIYMYLISRVAFPTMGGGEPTVFSPKNRV
LRLYVFVAAVVGLFLPIAYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVARNDRFSTPIRVFVPVFYNSRRIFSLAEWLRDEFAKEDKEYSGSVRRLLAGRALAVANM
ALWSFNLFGFLLPVYLPRAFKRYYSLYKSKD