; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016704 (gene) of Snake gourd v1 genome

Gene IDTan0016704
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4228 domain-containing protein
Genome locationLG10:21567819..21568879
RNA-Seq ExpressionTan0016704
SyntenyTan0016704
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581636.1 hypothetical protein SDJN03_21638, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8768.33Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MGMK LSVQNMVSWSCFHL K         S    NIKLI S G+VR+YHRPI VS++LLEFPKHLVCRSD+FYIGQKIP LSEHD LQLGHKYFLLPKP
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVT
        FFHSVLSFV+IASFFS  N  N     RF  N +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQGNPK      SP LP GKICTTPQLAK+YTQLV 
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVT

Query:  RRQWKPKLETIRETKKK-GITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK
         R+WKPKLETI ETKKK GI+PFGL K G PF          PSPPP        RS+Y + HHLR+I   YKPKIRIKS+
Subjt:  RRQWKPKLETIRETKKK-GITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK

XP_008465162.1 PREDICTED: uncharacterized protein LOC103502830 [Cucumis melo]8.1e-9871.99Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MG+K+LSVQNMVSWSCFHL+KNA+P D   +  N NIKLI SDG+VRIYHRPI+VSEVLLEFPKH VCRSDSFYIGQKIPPLS+ D LQLGHKYFLLP P
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV
        FFHSVLSFVTIASFFSSSN+NN N   RFINN +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQG NPK    Q +PSLPLGKICTTPQLAKDYTQLV
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV

Query:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA
          R+WKPKLETI+ET+KK ++ FGL KA  PF+S         +P          RSAYP+  HLRSI FAYK KIRIKS++
Subjt:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA

XP_011652332.1 uncharacterized protein LOC105435015 [Cucumis sativus]3.6e-9872.34Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MG+K+LSVQNMVSWSCFHL+KNAIP     S  + NIKLIHSDG+VRIYHRPI+VSEVLLEFPKHLVCRSDSFYIGQKIPPLS+ D LQLGHKYFLLP  
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV
        FFHSVLSFVTIASFFSSSNNNN N   RFINN +AS   PFD+QRTPSGCL+IRVSD+FIS+LLEQG NPK    Q SPSLPLGKICTTPQLAKDYTQLV
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV

Query:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA
          R+WKPKLETI+ET+KK I+ FGL KA            P PS P         RS YP+ HHLRSI FAYK KIRIK+++
Subjt:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA

XP_022935139.1 uncharacterized protein LOC111442099 [Cucurbita moschata]1.2e-8869.04Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MGMK LSVQNMVSWSCFHL K     D     A  NIKLI S G+VR+YHRPI VSE+LLEFPKHLVCRSD+FYIGQKIP LSEHD LQLGHKYFLLPKP
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVT
        FFHSVLSFV+IASFFS  N  N     RF  N +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQGNPK      SP LP GKICTTPQLAK+YTQLV 
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVT

Query:  RRQWKPKLETIRETKKK-GITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK
         R+WKPKLETI ETKKK GI+PFGL K G PF          PSPPP        RS+Y + HHLR+I   YKPKIRIKS+
Subjt:  RRQWKPKLETIRETKKK-GITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK

XP_038905326.1 uncharacterized protein LOC120091392 [Benincasa hispida]1.0e-10073.76Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MG+K+LSVQNMVSWSCFHL+KNAIP       A  NIKLI+SDG+VRIYH+PI+VSEVLLEFPKHLVCRSDSFYIGQKIPPLS+ D LQLGHKYFLLPKP
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPK-IQHQQDSPSLPLGKICTTPQLAKDYTQL
        FFHSVLSFVTIASFFS+S +N+NN N RF+NN +A+C RPFD+QRTPSGCLRIRVSDEFIS+LLEQG NPK + HQQ + SLPLGKICTTPQLAKDYTQL
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPK-IQHQQDSPSLPLGKICTTPQLAKDYTQL

Query:  VTRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK
        V  RQWKPKLETI ETKKK I+ FGL KA  PF S           PP        RS+YP+ HHLRSI  AYKPKIRIKS+
Subjt:  VTRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK

TrEMBL top hitse value%identityAlignment
A0A0A0LCN9 Uncharacterized protein1.8e-9872.34Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MG+K+LSVQNMVSWSCFHL+KNAIP     S  + NIKLIHSDG+VRIYHRPI+VSEVLLEFPKHLVCRSDSFYIGQKIPPLS+ D LQLGHKYFLLP  
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV
        FFHSVLSFVTIASFFSSSNNNN N   RFINN +AS   PFD+QRTPSGCL+IRVSD+FIS+LLEQG NPK    Q SPSLPLGKICTTPQLAKDYTQLV
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV

Query:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA
          R+WKPKLETI+ET+KK I+ FGL KA            P PS P         RS YP+ HHLRSI FAYK KIRIK+++
Subjt:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA

A0A1S3CNA0 uncharacterized protein LOC1035028303.9e-9871.99Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MG+K+LSVQNMVSWSCFHL+KNA+P D   +  N NIKLI SDG+VRIYHRPI+VSEVLLEFPKH VCRSDSFYIGQKIPPLS+ D LQLGHKYFLLP P
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV
        FFHSVLSFVTIASFFSSSN+NN N   RFINN +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQG NPK    Q +PSLPLGKICTTPQLAKDYTQLV
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV

Query:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA
          R+WKPKLETI+ET+KK ++ FGL KA  PF+S         +P          RSAYP+  HLRSI FAYK KIRIKS++
Subjt:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA

A0A5A7T3Y8 DUF4228 domain-containing protein4.0e-8770.72Show/hide
Query:  VKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSN
        +KNA+P D   +  N NIKLI SDG+VRIYHRPI+VSEVLLEFPKH VCRSDSFYIGQKIPPLS+ D LQLGHKYFLLP PFFHSVLSFVTIASFFSSSN
Subjt:  VKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSN

Query:  NNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQWKPKLETIRETKKKG
        +NN N   RFINN +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQG NPK    Q +PSLPLGKICTTPQLAKDYTQLV  R+WKPKLETI+ET+KK 
Subjt:  NNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQWKPKLETIRETKKKG

Query:  ITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA
        ++ FGL KA  PF+S         +P          RSAYP+  HLRSI FAYK KIRIKS++
Subjt:  ITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA

A0A5D3CH36 DUF4228 domain-containing protein3.9e-9871.99Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MG+K+LSVQNMVSWSCFHL+KNA+P D   +  N NIKLI SDG+VRIYHRPI+VSEVLLEFPKH VCRSDSFYIGQKIPPLS+ D LQLGHKYFLLP P
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV
        FFHSVLSFVTIASFFSSSN+NN N   RFINN +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQG NPK    Q +PSLPLGKICTTPQLAKDYTQLV
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQG-NPKIQHQQDSPSLPLGKICTTPQLAKDYTQLV

Query:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA
          R+WKPKLETI+ET+KK ++ FGL KA  PF+S         +P          RSAYP+  HLRSI FAYK KIRIKS++
Subjt:  TRRQWKPKLETIRETKKKGITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA

A0A6J1F4J4 uncharacterized protein LOC1114420995.6e-8969.04Show/hide
Query:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP
        MGMK LSVQNMVSWSCFHL K     D     A  NIKLI S G+VR+YHRPI VSE+LLEFPKHLVCRSD+FYIGQKIP LSEHD LQLGHKYFLLPKP
Subjt:  MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKP

Query:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVT
        FFHSVLSFV+IASFFS  N  N     RF  N +A+C  PFD+QRTPSGCLRIRVSDEFIS+LLEQGNPK      SP LP GKICTTPQLAK+YTQLV 
Subjt:  FFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVT

Query:  RRQWKPKLETIRETKKK-GITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK
         R+WKPKLETI ETKKK GI+PFGL K G PF          PSPPP        RS+Y + HHLR+I   YKPKIRIKS+
Subjt:  RRQWKPKLETIRETKKK-GITPFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28190.1 unknown protein1.0e-3745.19Show/hide
Query:  SWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVTIA
        SW C H      P  D  S     IKL+ SDG +++Y RP+ VSE+  +FPKH +CRSD  YIGQK P LSE + L+LG  YFLLP  FF + LSF+TIA
Subjt:  SWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVTIA

Query:  SFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSG-CLRIRVSDEFISRLLEQGNP----KIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQWKPK
        +           N    +  T    Q PF IQ+   G  LRIRVS+EF+S L+ +G      + + +++      G++CTT +L KDY QLV  R+WKPK
Subjt:  SFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSG-CLRIRVSDEFISRLLEQGNP----KIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQWKPK

Query:  LETIRETK
        LETI ETK
Subjt:  LETIRETK

AT5G12340.1 unknown protein1.3e-2138.55Show/hide
Query:  HVS-EVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLR
        HV+ E++ EFP  +VC +DSFYIG++IP L+  D L  G  YF+LP   F   +   +  S F +SN  N  +N   +N T+ S   PF+  + P+G + 
Subjt:  HVS-EVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLR

Query:  IRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQ--WKPKLETIRETKKKGITP---FGLNK
        I+VS EFI  L+ + N  I+ ++ +       IC +P+L K Y QLV  R+  W P L+TI E K + ++P   FGLN+
Subjt:  IRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQ--WKPKLETIRETKKKGITP---FGLNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGAAGCTTCTAAGTGTTCAGAATATGGTTTCTTGGTCTTGTTTCCATTTGGTGAAGAACGCTATTCCCGCTGACGATCATCACTCCATCGCCAACAATAACAT
CAAACTCATTCATTCCGACGGCCTTGTTAGAATCTACCATCGTCCCATTCACGTTTCTGAGGTCTTGCTTGAATTTCCTAAGCATTTGGTTTGCCGATCTGATTCTTTTT
ACATCGGCCAGAAAATCCCTCCCCTTTCCGAACACGACCAGCTCCAACTCGGCCATAAGTACTTCCTTCTTCCTAAGCCTTTTTTCCACTCTGTTCTCTCTTTTGTCACC
ATCGCTTCTTTTTTCTCCTCTTCTAATAACAACAACAACAACAACAACGCTCGATTCATCAACAACACCTCCGCCAGCTGCCAACGCCCTTTCGACATCCAACGAACCCC
CTCCGGCTGCCTTCGAATCCGTGTCTCCGATGAGTTCATTTCTCGATTACTCGAACAGGGCAATCCCAAAATCCAACACCAACAGGATTCGCCTTCGTTGCCACTTGGTA
AAATCTGCACCACGCCCCAATTGGCCAAAGACTACACCCAGCTCGTCACCAGGCGCCAATGGAAGCCAAAGCTCGAGACCATTCGTGAGACTAAAAAGAAAGGGATTACG
CCCTTTGGCTTGAACAAGGCCGGGAAGCCTTTTCGTTCTCATTTAAAGCTGCTTCCGCCTCCCCCATCGCCGCCGCCCAAGCCCGGCGGATCTGTATGGACCAGATCGGC
GTACCCAATTCATCACCATTTGCGTTCCATTCAATTTGCTTATAAGCCTAAGATTCGGATCAAATCCAAGGCTTAA
mRNA sequenceShow/hide mRNA sequence
CTGTCTTCTCCCTTCTTCTTCATCCTTCACTTTCAAATTCAATTTTCTTATATTTACATCACCCATGGGGATGAAGCTTCTAAGTGTTCAGAATATGGTTTCTTGGTCTT
GTTTCCATTTGGTGAAGAACGCTATTCCCGCTGACGATCATCACTCCATCGCCAACAATAACATCAAACTCATTCATTCCGACGGCCTTGTTAGAATCTACCATCGTCCC
ATTCACGTTTCTGAGGTCTTGCTTGAATTTCCTAAGCATTTGGTTTGCCGATCTGATTCTTTTTACATCGGCCAGAAAATCCCTCCCCTTTCCGAACACGACCAGCTCCA
ACTCGGCCATAAGTACTTCCTTCTTCCTAAGCCTTTTTTCCACTCTGTTCTCTCTTTTGTCACCATCGCTTCTTTTTTCTCCTCTTCTAATAACAACAACAACAACAACA
ACGCTCGATTCATCAACAACACCTCCGCCAGCTGCCAACGCCCTTTCGACATCCAACGAACCCCCTCCGGCTGCCTTCGAATCCGTGTCTCCGATGAGTTCATTTCTCGA
TTACTCGAACAGGGCAATCCCAAAATCCAACACCAACAGGATTCGCCTTCGTTGCCACTTGGTAAAATCTGCACCACGCCCCAATTGGCCAAAGACTACACCCAGCTCGT
CACCAGGCGCCAATGGAAGCCAAAGCTCGAGACCATTCGTGAGACTAAAAAGAAAGGGATTACGCCCTTTGGCTTGAACAAGGCCGGGAAGCCTTTTCGTTCTCATTTAA
AGCTGCTTCCGCCTCCCCCATCGCCGCCGCCCAAGCCCGGCGGATCTGTATGGACCAGATCGGCGTACCCAATTCATCACCATTTGCGTTCCATTCAATTTGCTTATAAG
CCTAAGATTCGGATCAAATCCAAGGCTTAATTATTTGTTTCCATTCCATAAATCATTGTTCATGTAGTTTAGTTTAGTTTAGTTTAGTTTAGTTTTGGATACATTAATTC
TATATATATATATGTATTATTAGAGATGTGTATAAAAATATAACACAGCAAATTGAGAACCAATTTTACTT
Protein sequenceShow/hide protein sequence
MGMKLLSVQNMVSWSCFHLVKNAIPADDHHSIANNNIKLIHSDGLVRIYHRPIHVSEVLLEFPKHLVCRSDSFYIGQKIPPLSEHDQLQLGHKYFLLPKPFFHSVLSFVT
IASFFSSSNNNNNNNNARFINNTSASCQRPFDIQRTPSGCLRIRVSDEFISRLLEQGNPKIQHQQDSPSLPLGKICTTPQLAKDYTQLVTRRQWKPKLETIRETKKKGIT
PFGLNKAGKPFRSHLKLLPPPPSPPPKPGGSVWTRSAYPIHHHLRSIQFAYKPKIRIKSKA