; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G03640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G03640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLate embryogenesis abundant protein, LEA-14
Genome locationClcChr06:3861164..3862127
RNA-Seq ExpressionClc06G03640
SyntenyClc06G03640
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-8376.26Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        M DKDQA P A  TH R SSD+ + +L+LKRIQRRRFIK   FI  LLII ++I+I+ILMFTLFQ+KDPIIQMN+ISITKLELIN VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPN+ASFKYSNTTTTL+INETVIGEARGPPG+AKARRTV+MN++I+IV DR+L NL+ D+S GK+RL+SFSR+PGRVKLLH++ RN+VVKMNC+  
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIFN+SIEDQ CKRKVKI
Subjt:  INIFNRSIEDQECKRKVKI

TYK14031.1 Late embryogenesis abundant protein, LEA-14 [Cucumis melo var. makuwa]1.7e-8987.75Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        MV KDQA P   AT  R SSDNGET+L+LKRIQR+RFIKCCSFIA LLIIPTI+IIIILMFTLFQIKDPII+MNR+SITKLELIN+VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPNMASFKYSNTTTTLFINETVIGE RGPPGKAKAR+TVRMNV+IDIVADRVLSNL++DVSLGKVRL+SFSRIPG+VKLLHLIGRNVVVKMNC+F+
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIF
        INIF
Subjt:  INIF

XP_008458164.1 PREDICTED: uncharacterized protein LOC103497685 [Cucumis melo]2.4e-9686.3Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        MV KDQA P   AT  R SSDNGET+L+LKRIQR+RFIKCCSFIA LLIIPTI+IIIILMFTLFQIKDPII+MNR+SITKLELIN+VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPNMASFKYSNTTTTLFINETVIGE RGPPGKAKAR+TVRMNV+IDIVADRVLSNL++DVSLGKVRL+SFSRIPG+VKLLHLIGRNVVVKMNC+F+
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIF++SIEDQ+CKRK+K+
Subjt:  INIFNRSIEDQECKRKVKI

XP_011656360.1 uncharacterized protein LOC105435724 [Cucumis sativus]9.3e-9685.39Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        MVDKDQA P   AT +R SSDNGET+L+LKRIQR+RFIKCCSFI  LL+IPTI+IIIILMFTLFQIKDPIIQMNR+SITKLELIN+VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPNMASFKYSNTTTTLFINETVIGE RGP GKAKAR+TVRMNV+IDIVADRVLSNL++DVSLGKVRL+SFSRIPG+VKLLH IGRNVVVKMNC+F+
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIF++SIEDQ+CKRK+K+
Subjt:  INIFNRSIEDQECKRKVKI

XP_038875202.1 uncharacterized protein LOC120067718 [Benincasa hispida]6.2e-10886.12Show/hide
Query:  PYHCLINSSTQNPTTQDFSSLPYAITMVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMN
        P HCLINSSTQ     +    P  ITMVDKDQA P A ATHHRSSSDNGET L+LKRIQRRRFIKCC FI   LIIPTI+IIIILMFTLFQIKDP+I+MN
Subjt:  PYHCLINSSTQNPTTQDFSSLPYAITMVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMN

Query:  RISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSF
        R+SITKLELIN  IPKPGSN+SLTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNV+IDIVADRVLSNLDDDVSLGKVRL+SF
Subjt:  RISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSF

Query:  SRIPGRVKLLHLIGRNVVVKMNCSFMINIFNRSIEDQECKRKVKI
        SRIPGRVKLLHLIGRNVVVKMNC+F+INIFNRSIEDQECKRKVK+
Subjt:  SRIPGRVKLLHLIGRNVVVKMNCSFMINIFNRSIEDQECKRKVKI

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein4.5e-9685.39Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        MVDKDQA P   AT +R SSDNGET+L+LKRIQR+RFIKCCSFI  LL+IPTI+IIIILMFTLFQIKDPIIQMNR+SITKLELIN+VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPNMASFKYSNTTTTLFINETVIGE RGP GKAKAR+TVRMNV+IDIVADRVLSNL++DVSLGKVRL+SFSRIPG+VKLLH IGRNVVVKMNC+F+
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIF++SIEDQ+CKRK+K+
Subjt:  INIFNRSIEDQECKRKVKI

A0A1S3C8G8 uncharacterized protein LOC1034976851.2e-9686.3Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        MV KDQA P   AT  R SSDNGET+L+LKRIQR+RFIKCCSFIA LLIIPTI+IIIILMFTLFQIKDPII+MNR+SITKLELIN+VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPNMASFKYSNTTTTLFINETVIGE RGPPGKAKAR+TVRMNV+IDIVADRVLSNL++DVSLGKVRL+SFSRIPG+VKLLHLIGRNVVVKMNC+F+
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIF++SIEDQ+CKRK+K+
Subjt:  INIFNRSIEDQECKRKVKI

A0A5D3CQG2 Late embryogenesis abundant protein, LEA-148.2e-9087.75Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        MV KDQA P   AT  R SSDNGET+L+LKRIQR+RFIKCCSFIA LLIIPTI+IIIILMFTLFQIKDPII+MNR+SITKLELIN+VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPNMASFKYSNTTTTLFINETVIGE RGPPGKAKAR+TVRMNV+IDIVADRVLSNL++DVSLGKVRL+SFSRIPG+VKLLHLIGRNVVVKMNC+F+
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIF
        INIF
Subjt:  INIF

A0A6J1H4K3 uncharacterized protein LOC1114603391.7e-8275.8Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        M DKDQA P A AT  R SSD+ + KL+LKRIQRRRFIK   FI  LLII ++ +I+IL+FTLFQ+KDPIIQMN ISITKLELIN VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPN+ASFKYSNTTTTL+INETVIGEARGPPG+AKARRTVRMN++I+IV DR+L NL+ D+S GK+RL+SFSR+PGRVK+LH++ RN+VVKMNC+  
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIFN+SIEDQ+CKRKVKI
Subjt:  INIFNRSIEDQECKRKVKI

A0A6J1L0R6 uncharacterized protein LOC1114993183.3e-8375.8Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD
        M DKDQA P A AT  R SSD+ + KL+LK+IQR RFIK   FI  LL+I ++++I+ILMFTLFQ+KDPIIQMN+ISITKLELIN VIPKPGSNVSLTAD
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTAD

Query:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM
        VSVKNPN+ASFKYSNTTTTL+INETVIGEARGPPG+AKARRTVRMN++I+IV DR+L NL++D+S GK+RL+SFSR+PGRVKLLH+I RN+VVKMNC+  
Subjt:  VSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFM

Query:  INIFNRSIEDQECKRKVKI
        INIFN+SIEDQ+CKRKVKI
Subjt:  INIFNRSIEDQECKRKVKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family4.0e-0424.59Show/hide
Query:  KRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGE
        +R   R  +  C+ +AT+ ++  +++++++ FT+F+ KDP I +N + +    + N+      +N S +  V+V+NPN A F + +++  L  +   +G 
Subjt:  KRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGE

Query:  ARGPPGKAKARRTVRMNVSIDI
           P GK  + R   M  +  +
Subjt:  ARGPPGKAKARRTVRMNVSIDI

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.2e-3942.04Show/hide
Query:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQR-RRFIKCCSFI-ATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELI--NDVIPKPGSNVS
        M D +   P A AT    S ++      +K   R R  IKC   + AT LI+ T  I++ L+FT+F++KDPII+MN + +  L+ +   + +   G+N+S
Subjt:  MVDKDQAHPFASATHHRSSSDNGETKLYLKRIQR-RRFIKCCSFI-ATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELI--NDVIPKPGSNVS

Query:  LTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSN--LDDDVS-LGKVRLQSFSRIPGRVKLLHLIGRNVVV
        +  DVSVKNPN ASFKYSNTTT ++   T++GEA G PGKA+  RT RMNV++DI+ DR+LS+  L  ++S  G V + S++R+ G+VK++ ++ ++V V
Subjt:  LTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSN--LDDDVS-LGKVRLQSFSRIPGRVKLLHLIGRNVVV

Query:  KMNCSFMINIFNRSIEDQECKRKVKI
        KMNC+  +NI  ++I+D +CK+K+ +
Subjt:  KMNCSFMINIFNRSIEDQECKRKVKI

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.4e-1122.95Show/hide
Query:  CCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKA
        CC     + ++  I +  +++  +F+ K PI+Q    ++  +     +  +   N +LT ++ +KNPN+A F+Y      ++  +T++G    P     A
Subjt:  CCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKA

Query:  RRTVRMNVSIDIVADRVLSNLDD---DVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFMINIFNRSIEDQECKRKVKI
        + +V +   + +  D+ ++NL D   DV  GK+ +++ +++PG++ LL +    +    +C+ ++   +  +EDQ C  K K+
Subjt:  RRTVRMNVSIDIVADRVLSNLDD---DVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFMINIFNRSIEDQECKRKVKI

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.7e-2432.71Show/hide
Query:  PFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLEL-INDVIPKPGSNVSLTADVSVKNPN
        P  +A+   + S N  T    K+++R+R  K C     LLI+   I+I+IL FTLF+ K P   ++ +++ +L+  +N ++ K   N++L  D+S+KNPN
Subjt:  PFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISITKLEL-INDVIPKPGSNVSLTADVSVKNPN

Query:  MASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLS--NLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFMINIFN
           F Y +++  L     VIGEA  P  +  AR+TV +N+++ ++ADR+LS   L  DV  G + L +F ++ G+V +L +    V    +C   I++ +
Subjt:  MASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLS--NLDDDVSLGKVRLQSFSRIPGRVKLLHLIGRNVVVKMNCSFMINIFN

Query:  RSIEDQECKRKVKI
        R++  Q CK   K+
Subjt:  RSIEDQECKRKVKI

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.2e-1227.14Show/hide
Query:  AITMVDKDQAHPFASA-THHRSSSDNGETKLYLKRIQ----RRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISIT-KLELINDVIPKP
        A++ +++DQA P A      RS   + E + +  R +    + + I CC FIA+L ++   +  I+L  T+F +  P + ++ IS   + + +N  +   
Subjt:  AITMVDKDQAHPFASA-THHRSSSDNGETKLYLKRIQ----RRRFIKCCSFIATLLIIPTIIIIIILMFTLFQIKDPIIQMNRISIT-KLELINDVIPKP

Query:  GSNVSLTADVSVKNPNMASFKYSNTTTTLFINE-TVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLD---DDVSLGKVRLQSFSRIPGRVKLLHLI
          N +++ ++S+ NPN A F   N   + +  E  V+GE+        A+RTV+MN++ +IV  ++L++L    +D++   V L+S   + GRVK + + 
Subjt:  GSNVSLTADVSVKNPNMASFKYSNTTTTLFINE-TVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLD---DDVSLGKVRLQSFSRIPGRVKLLHLI

Query:  GRNVVVKMNC
         + V ++ +C
Subjt:  GRNVVVKMNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTGCTTACCTTCAATTCAACATTGTTCAATGGCTTCAAATTTTGGGCATTTCCCTACCATTGCCTTATAAACTCAAGCACTCAAAATCCCACAACCCAAGATTT
TTCTTCTCTCCCTTATGCCATAACCATGGTGGACAAGGACCAAGCTCATCCATTTGCCTCAGCTACCCACCATCGTTCGAGCAGCGACAACGGCGAAACAAAATTATATC
TAAAGAGAATCCAACGAAGAAGATTCATAAAATGTTGCAGTTTCATAGCCACCCTTCTCATAATACCAACAATAATCATCATCATCATCTTGATGTTCACTCTATTTCAA
ATCAAGGATCCCATAATTCAAATGAACAGAATTTCAATCACAAAGCTCGAGTTGATCAACGATGTCATACCAAAGCCAGGATCCAACGTGTCACTAACTGCTGACGTGTC
AGTGAAAAATCCCAACATGGCATCGTTCAAGTATAGTAACACGACCACTACTTTGTTCATTAATGAGACAGTGATAGGGGAGGCACGAGGGCCGCCAGGGAAAGCCAAGG
CACGACGAACGGTGCGAATGAACGTCTCCATCGACATCGTTGCTGATCGAGTCTTGTCGAACCTCGACGATGACGTGAGTTTGGGGAAGGTGAGATTGCAAAGCTTTTCG
AGGATTCCGGGGAGGGTAAAGTTGCTGCATCTTATAGGAAGAAATGTTGTTGTCAAAATGAATTGTTCTTTCATGATCAATATCTTCAACAGGTCAATTGAGGATCAGGA
ATGCAAAAGGAAGGTGAAAATTTAG
mRNA sequenceShow/hide mRNA sequence
CTAAAAGTCATGCATGTCTTTGCTTACCTTCAATTCAACATTGTTCAATGGCTTCAAATTTTGGGCATTTCCCTACCATTGCCTTATAAACTCAAGCACTCAAAATCCCA
CAACCCAAGATTTTTCTTCTCTCCCTTATGCCATAACCATGGTGGACAAGGACCAAGCTCATCCATTTGCCTCAGCTACCCACCATCGTTCGAGCAGCGACAACGGCGAA
ACAAAATTATATCTAAAGAGAATCCAACGAAGAAGATTCATAAAATGTTGCAGTTTCATAGCCACCCTTCTCATAATACCAACAATAATCATCATCATCATCTTGATGTT
CACTCTATTTCAAATCAAGGATCCCATAATTCAAATGAACAGAATTTCAATCACAAAGCTCGAGTTGATCAACGATGTCATACCAAAGCCAGGATCCAACGTGTCACTAA
CTGCTGACGTGTCAGTGAAAAATCCCAACATGGCATCGTTCAAGTATAGTAACACGACCACTACTTTGTTCATTAATGAGACAGTGATAGGGGAGGCACGAGGGCCGCCA
GGGAAAGCCAAGGCACGACGAACGGTGCGAATGAACGTCTCCATCGACATCGTTGCTGATCGAGTCTTGTCGAACCTCGACGATGACGTGAGTTTGGGGAAGGTGAGATT
GCAAAGCTTTTCGAGGATTCCGGGGAGGGTAAAGTTGCTGCATCTTATAGGAAGAAATGTTGTTGTCAAAATGAATTGTTCTTTCATGATCAATATCTTCAACAGGTCAA
TTGAGGATCAGGAATGCAAAAGGAAGGTGAAAATTTAGACTTTAATATTATTTTTTTTTCCCTTCATTTGAAGTTTTTAACCTTTTTATATGCTCAATTTTTGGTTCTGC
AATGTGTGTGTTGGTGTTAGGACAATCGTTTCCTTGATTTCTCATTTAGTAAAATATGAAAATCTAATATAGCTTATTAATATT
Protein sequenceShow/hide protein sequence
MSLLTFNSTLFNGFKFWAFPYHCLINSSTQNPTTQDFSSLPYAITMVDKDQAHPFASATHHRSSSDNGETKLYLKRIQRRRFIKCCSFIATLLIIPTIIIIIILMFTLFQ
IKDPIIQMNRISITKLELINDVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGEARGPPGKAKARRTVRMNVSIDIVADRVLSNLDDDVSLGKVRLQSFS
RIPGRVKLLHLIGRNVVVKMNCSFMINIFNRSIEDQECKRKVKI