; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015358 (gene) of Snake gourd v1 genome

Gene IDTan0015358
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAM domain-containing protein
Genome locationLG09:66969034..66970130
RNA-Seq ExpressionTan0015358
SyntenyTan0015358
Gene Ontology termsNA
InterPro domainsIPR013761 - Sterile alpha motif/pointed domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589808.1 hypothetical protein SDJN03_15231, partial [Cucurbita argyrosperma subsp. sororia]5.5e-11479.47Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGL+PEDIP FNHDFL KIG+SIAKHRLEILKLAK +REEAT KKLLLSAF KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA +SPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPMFPRSPRSSGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVSGPLDGRDGSPRICCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

KAG7023481.1 hypothetical protein SDJN02_14506, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-11479.09Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGL+PEDIP FNHDFL KIG+SIAKHRLEILKLAK +REEAT KKLLLSAF KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA +SPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPMFPRSPRSSGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVSGPLDGRDGSPR+CCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

XP_022960905.1 uncharacterized protein LOC111461567 [Cucurbita moschata]9.4e-11479.47Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGL+PEDIP FNHDFL KIG+SIAKHRLEILKLAK  REEAT KKLLLSAF KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA +SPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPMFPRSPRSSGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVSGPLDGRDGSPRICCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

XP_022987839.1 uncharacterized protein LOC111485262 [Cucurbita maxima]9.7e-11178.33Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGLKPEDIP FNHDFL +IG+SIAKHRLEILKLAK  REEAT KKLLLSA  KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA ISPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPM PRSPR SGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVS PLDGRDGSPRICCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

XP_023516525.1 uncharacterized protein LOC111780375 [Cucurbita pepo subsp. pepo]1.6e-11379.47Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGL+PEDIP FNHDFL KIG+SIAKHRLEILKLAK  REEAT KKLLLSAF KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA ISPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPMFPRSPRSSGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVSGPLDGRDGSP+ICCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

TrEMBL top hitse value%identityAlignment
A0A5A7U992 Sterile alpha motif, type 29.5e-9670.18Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFH-REEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPD
        MDWFSWLSRTGLDPL+TYEYGLLFARN LKPEDIP FNH FL KIGISIAKHRLEILKLAK H   +      L+SAF KTK CLRNCLR+LI  +  PD
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFH-REEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPD

Query:  KPLFRLDAAEISPEPPTHG---DVTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDR-----------PMFPRSPRSSGPLDG-
        KP+       ISPEPP          VKEV KPPRRRSK+VS SGPLD RTHEK +M+SKSLKLSGPL+RK+R           PMFPRSPR+SGPLDG 
Subjt:  KPLFRLDAAEISPEPPTHG---DVTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDR-----------PMFPRSPRSSGPLDG-

Query:  --------KSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
                KSPK NG  QG+MMR IPPSRSPRVSGPLDGRDGSPRICCRCN+ER+E++DDYHSLWVSLFYDMKPT
Subjt:  --------KSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

A0A6J1C2N7 uncharacterized protein LOC1110068857.5e-10170.96Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKL-----LLSAFTKTKNCLRNCLRKLILTN
        MDWFSWLSRTGLDPL+TYEYGLLFARNG++PEDIP FNHDFLHKIG+S+AKHRLEILKLAK   EE   KK      L+SAF KTK CLRNC+RKL+ +N
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKL-----LLSAFTKTKNCLRNCLRKLILTN

Query:  SRPDKPLFRLDAAEISPEPPTH----GDVTVVKEV--AKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG-----
         +P++ +FR   A ISPE  ++    G    V+EV   KP  RR KNVS SGPLDGR HEK M N KSLKLSGPL+RK+RPMF RSPR+SGPLDG     
Subjt:  SRPDKPLFRLDAAEISPEPPTH----GDVTVVKEV--AKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG-----

Query:  -----KSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
             KSPKFNG  QGKMMR IP SRSPRVSGPLDGRDGSPRICCRCN+ERIETDDDYHSLWVSLFYD+KPT
Subjt:  -----KSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

A0A6J1HAF4 uncharacterized protein LOC1114615674.5e-11479.47Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGL+PEDIP FNHDFL KIG+SIAKHRLEILKLAK  REEAT KKLLLSAF KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA +SPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPMFPRSPRSSGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVSGPLDGRDGSPRICCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

A0A6J1JJY8 uncharacterized protein LOC1114852624.7e-11178.33Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK
        MDWFSWLSRTGLDPL+TYEYGL+FARNGLKPEDIP FNHDFL +IG+SIAKHRLEILKLAK  REEAT KKLLLSA  KTKNCLRNCLRKLI TN++ +K
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDK

Query:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK
         +FR DAA ISPEP T+ +       VKEV KPP+RRSK+VS SGPLDGRTHEKLM NSKSLKLSGPL+RK+RPM PRSPR SGPLDG        +SPK
Subjt:  PLFRLDAAEISPEPPTHGD----VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------KSPK

Query:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
         NG  QG+MMR IPPSRSPRVS PLDGRDGSPRICCRCN+ER+ETDDDYHSLWVSLFYDMKPT
Subjt:  FNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT

A0A6J1L2N4 uncharacterized protein LOC1114992831.2e-9870.68Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFH-REEATHKKLLLSAFTKTKNCLRNCLRKLILT-NSRP
        MDWFSWLSRTGLDPL+TYEYGLLF +NGLKPEDI +FNH FLHKIGIS+A  RLEI+KLAKFH +++ THKK LLSAFTKTKNCLRNCLR L LT NS+P
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFH-REEATHKKLLLSAFTKTKNCLRNCLRKLILT-NSRP

Query:  DKPLFRLDAAEISPEPPTHGD------VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------
        +K +FR +AAEI+P  PTHG+         V EV KP +RRSKNVSHSGPLDG+THEKL+MN KSLKLSGPL RK+RPM PRSP S GPL+G        
Subjt:  DKPLFRLDAAEISPEPPTHGD------VTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDG--------

Query:  KSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKP
        KSPK N + +G+MMR IPPSRSPR SGP+   +GSP ICCRC +ERIETDDDYHSLW SLF+D+KP
Subjt:  KSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15760.1 Sterile alpha motif (SAM) domain-containing protein2.7e-1850.53Show/hide
Query:  WFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEA---THKKL--LLSAFTKTKNCLRNCLRKLI
        WFSWLSRT L+P   +EYGL F++N L+ EDI  F+H+FL  +GISIAKHRLEILKLA+  R+ +   T + +  +++A  KT+ CL + +R  I
Subjt:  WFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEA---THKKL--LLSAFTKTKNCLRNCLRKLI

AT1G80520.1 Sterile alpha motif (SAM) domain-containing protein1.6e-1853.12Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEA--THKKL--LLSAFTKTKNCLRNCLRKLI
        MDWFSWLSRT L+    YEYGL F+ N L+ EDI  FNH+FL  +GISIAKHRLEILKLA+  R+ +  T + +  +L A  KT  C    +R  I
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEA--THKKL--LLSAFTKTKNCLRNCLRKLI

AT2G12462.1 BEST Arabidopsis thaliana protein match is: Sterile alpha motif (SAM) domain-containing protein (TAIR:AT1G15760.1)2.4e-3034.43Show/hide
Query:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEAT---HKKL---LLSAFTKTKNCLRNCLRKLI--
        MDWFSWLS+T LDP  +YEYGL+FA+  L+ EDI  FNH+FL ++G+++ KHR+EILKL+K   +  +   H+ +   L+S   K    + N L K +  
Subjt:  MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEAT---HKKL---LLSAFTKTKNCLRNCLRKLI--

Query:  --------LTNSRPDKPLFRLDAAEISPEPPTHGDVTVVKEVAKPPRRRSKNVSHSGPLD---GRTHEKLMMNSKSLKLSGPLERKDRP---MFPRSPRS
                L   +   P +R  A         + +  VV +V + P  + K +  SGPLD   G   +  +++++S+ LSGPL+R  +    +  RSP  
Subjt:  --------LTNSRPDKPLFRLDAAEISPEPPTHGDVTVVKEVAKPPRRRSKNVSHSGPLD---GRTHEKLMMNSKSLKLSGPLERKDRP---MFPRSPRS

Query:  SGPLDGKSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT
        SG LDG                   +   R+SGPL GR  SP +    NK     DDD  + W ++F+++KPT
Subjt:  SGPLDGKSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSPRICCRCNKERIETDDDYHSLWVSLFYDMKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGTTCTCTTGGCTTTCAAGAACCGGTCTCGATCCTCTTTACACCTACGAATACGGCCTTCTTTTCGCCCGAAATGGCCTCAAACCCGAAGACATTCCCACTTT
CAACCACGATTTTCTTCACAAAATCGGAATCTCCATCGCCAAACACAGGCTCGAGATTCTTAAACTCGCTAAATTCCATAGAGAAGAAGCCACCCACAAGAAATTGCTCC
TTTCCGCTTTCACCAAAACCAAGAACTGCCTCAGAAACTGCCTCCGTAAGCTTATTCTCACCAACTCCAGGCCGGATAAGCCCCTTTTCCGCCTAGATGCGGCGGAGATC
TCGCCGGAGCCGCCGACCCACGGCGACGTTACGGTGGTCAAGGAGGTTGCGAAGCCGCCGAGACGACGGAGTAAAAACGTGTCACATTCGGGGCCGTTGGATGGGAGAAC
GCACGAGAAGCTAATGATGAACAGTAAGAGCTTGAAGTTATCTGGGCCGTTGGAGAGAAAAGACCGGCCCATGTTTCCAAGAAGCCCAAGATCATCTGGGCCTCTGGATG
GTAAAAGCCCAAAGTTTAATGGGCTGGCGCAGGGGAAAATGATGAGGTGGATACCGCCGAGCCGGAGCCCAAGAGTATCTGGGCCGCTGGATGGACGAGATGGAAGCCCA
AGAATTTGCTGTCGTTGTAATAAGGAGAGGATAGAAACGGACGATGATTATCATTCGTTGTGGGTTTCTTTGTTTTATGACATGAAACCAACTTGA
mRNA sequenceShow/hide mRNA sequence
CCTTTTTCTGTGTTTGTTTATATTCGTGAATTGTCAATCTCTTTAGCACTGCCAAAAGATACAAACACATCAACAACCCATTCTTCAAAAGCTTCATCCGCCACTCCTCC
TTCAAATTTCAATGGATTGGTTCTCTTGGCTTTCAAGAACCGGTCTCGATCCTCTTTACACCTACGAATACGGCCTTCTTTTCGCCCGAAATGGCCTCAAACCCGAAGAC
ATTCCCACTTTCAACCACGATTTTCTTCACAAAATCGGAATCTCCATCGCCAAACACAGGCTCGAGATTCTTAAACTCGCTAAATTCCATAGAGAAGAAGCCACCCACAA
GAAATTGCTCCTTTCCGCTTTCACCAAAACCAAGAACTGCCTCAGAAACTGCCTCCGTAAGCTTATTCTCACCAACTCCAGGCCGGATAAGCCCCTTTTCCGCCTAGATG
CGGCGGAGATCTCGCCGGAGCCGCCGACCCACGGCGACGTTACGGTGGTCAAGGAGGTTGCGAAGCCGCCGAGACGACGGAGTAAAAACGTGTCACATTCGGGGCCGTTG
GATGGGAGAACGCACGAGAAGCTAATGATGAACAGTAAGAGCTTGAAGTTATCTGGGCCGTTGGAGAGAAAAGACCGGCCCATGTTTCCAAGAAGCCCAAGATCATCTGG
GCCTCTGGATGGTAAAAGCCCAAAGTTTAATGGGCTGGCGCAGGGGAAAATGATGAGGTGGATACCGCCGAGCCGGAGCCCAAGAGTATCTGGGCCGCTGGATGGACGAG
ATGGAAGCCCAAGAATTTGCTGTCGTTGTAATAAGGAGAGGATAGAAACGGACGATGATTATCATTCGTTGTGGGTTTCTTTGTTTTATGACATGAAACCAACTTGAATC
TTCTTTTCTTTTTTCTTTTTTTTTTTTCCTTCTTCTTTGGAGGGTTTTGTTTTATATGATTATTATGTGTGCATTTTATTTCAATGCCAATGCAACTAAAAGAAGGTTGT
CTACTCAAATTTTGTATGAGTTTATGGCTTAGATATGCTATGGTAAATTTTTATGTGTGTTTTTGTTGTCAATTTAGTTGATATGTTAAGAGATGTGTTGATACTTA
Protein sequenceShow/hide protein sequence
MDWFSWLSRTGLDPLYTYEYGLLFARNGLKPEDIPTFNHDFLHKIGISIAKHRLEILKLAKFHREEATHKKLLLSAFTKTKNCLRNCLRKLILTNSRPDKPLFRLDAAEI
SPEPPTHGDVTVVKEVAKPPRRRSKNVSHSGPLDGRTHEKLMMNSKSLKLSGPLERKDRPMFPRSPRSSGPLDGKSPKFNGLAQGKMMRWIPPSRSPRVSGPLDGRDGSP
RICCRCNKERIETDDDYHSLWVSLFYDMKPT