; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G004350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G004350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionLEA_2 domain-containing protein
Genome locationCmo_Chr17:2920511..2921170
RNA-Seq ExpressionCmoCh17G004350
SyntenyCmoCh17G004350
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575200.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]8.2e-10695.89Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAP   CRPSSDDYQE+LHLKRIQRRRFIKLFCFIIGLLIILSV VILIL+FTLFQVKDPIIQMN ISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVK+LHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQDCKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.3e-10695.89Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAP T CRPSSDDYQE+LHLKRIQRRRFIKLFCFIIGLLIILSV VILIL+FTLFQVKDPIIQMN ISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVK+LHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

XP_022959336.1 uncharacterized protein LOC111460339 [Cucurbita moschata]2.9e-111100Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQDCKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

XP_023006660.1 uncharacterized protein LOC111499318 [Cucurbita maxima]3.4e-10494.52Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLA ATDCRPSSDDYQEKLHLK+IQR RFIK FCFII LL+ILSV VILIL+FTLFQVKDPIIQMN ISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLN+DMSSGKLRLRSFSRVPGRVK+LHI+RRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQDCKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

XP_023548342.1 uncharacterized protein LOC111807010 [Cucurbita pepo subsp. pepo]1.5e-10494.06Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAPATDCRPS+DDYQEKLHLKR  +RRFIKLFCFIIGLL+ILSV VILILIFTLFQVKDPIIQMN ISITKLELING+IPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVD+LLLNLNSDMSSGKLRLRSFSRVPGRVK+LHI+RRNI+VKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQDCKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein2.6e-8173.97Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        M DKDQA+PL PAT  R SSD+ + +LHLKRIQR+RFIK   FI+ LL+I ++ +I+IL+FTLFQ+KDPIIQMN +SITKLELIN VIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTL+INETVIGE RGP G+AKAR+TVRMN+TI+IV DR+L NLN+D+S GK+RLRSFSR+PG+VK+LH + RN+VVKMNCT  
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIF+KSIEDQ CKRK+K+
Subjt:  INIFNKSIEDQDCKRKVKI

A0A1S3C8G8 uncharacterized protein LOC1034976855.8e-8173.97Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        M  KDQA+PL PAT  R SSD+ + +LHLKRIQR+RFIK   FI  LLII ++ +I+IL+FTLFQ+KDPII+MN +SITKLELIN VIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTL+INETVIGE RGPPG+AKAR+TVRMN+TI+IV DR+L NLN+D+S GK+RLRSFSR+PG+VK+LH++ RN+VVKMNCT  
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIF+KSIEDQ CKRK+K+
Subjt:  INIFNKSIEDQDCKRKVKI

A0A5D3CQG2 Late embryogenesis abundant protein, LEA-144.0e-7474.02Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        M  KDQA+PL PAT  R SSD+ + +LHLKRIQR+RFIK   FI  LLII ++ +I+IL+FTLFQ+KDPII+MN +SITKLELIN VIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTL+INETVIGE RGPPG+AKAR+TVRMN+TI+IV DR+L NLN+D+S GK+RLRSFSR+PG+VK+LH++ RN+VVKMNCT  
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIF
        INIF
Subjt:  INIF

A0A6J1H4K3 uncharacterized protein LOC1114603391.4e-111100Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQDCKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

A0A6J1L0R6 uncharacterized protein LOC1114993181.7e-10494.52Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLA ATDCRPSSDDYQEKLHLK+IQR RFIK FCFII LL+ILSV VILIL+FTLFQVKDPIIQMN ISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLN+DMSSGKLRLRSFSRVPGRVK+LHI+RRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQDCKRKVKI
        INIFNKSIEDQDCKRKVKI
Subjt:  INIFNKSIEDQDCKRKVKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.2e-4343.75Show/hide
Query:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGV--IPKPGSNVSLT
        MAD +  RPLAPAT   P SD+    +      R R     C     LI+ +  ++L L+FT+F+VKDPII+MN + +  L+ + G   +   G+N+S+ 
Subjt:  MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGV--IPKPGSNVSLT

Query:  ADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLN--LNSDMS-SGKLRLRSFSRVPGRVKVLHILRRNIVVKM
         DVSVKNPN ASFKYSNTTT +Y   T++GEA G PG+A+  RT RMN+T++I++DR+L +  L  ++S SG + + S++RV G+VK++ I+++++ VKM
Subjt:  ADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLN--LNSDMS-SGKLRLRSFSRVPGRVKVLHILRRNIVVKM

Query:  NCTSTINIFNKSIEDQDCKRKVKI
        NCT  +NI  ++I+D DCK+K+ +
Subjt:  NCTSTINIFNKSIEDQDCKRKVKI

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-1024.32Show/hide
Query:  KLFCFIIGLLIILSVGVILILIFT-LFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQA
        ++ C + G++ +L V  +  LI   +F+ K PI+Q  + ++  +     +  +   N +LT ++ +KNPNVA F+Y      +Y  +T++G    P    
Subjt:  KLFCFIIGLLIILSVGVILILIFT-LFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQA

Query:  KARRTVRMNLTINIVVDRLLLNLN---SDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI
         A+ +V +   + + +D+ + NL     D+  GK+ + + +++PG++ +L I +  +    +C   +   +  +EDQ C  K K+
Subjt:  KARRTVRMNLTINIVVDRLLLNLN---SDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.1e-0528.93Show/hide
Query:  KDPIIQMNNISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLN---SD
        KDP   + +I +T L+L       P  +  L   V V NPN+A+  YS+T  T+  + TV+G A    G   AR    + L   +    L  +     SD
Subjt:  KDPIIQMNNISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLN---SD

Query:  MSSGKLRLRSFSRVPGRVKVL
        +++ +++L +   + G  KVL
Subjt:  MSSGKLRLRSFSRVPGRVKVL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-2235.57Show/hide
Query:  KRIQRRRFIKL-FCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLEL-INGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVI
        K+++R+R  K+  CF I LLI+L   VI+IL FTLF+ K P   ++++++ +L+  +N ++ K   N++L  D+S+KNPN   F Y +++  L     VI
Subjt:  KRIQRRRFIKL-FCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLEL-INGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVI

Query:  GEARGPPGQAKARRTVRMNLTINIVVDRLL--LNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI
        GEA  P  +  AR+TV +N+T+ ++ DRLL    L SD+ +G + L +F +V G+V VL I +  +    +C  +I++ ++++  Q CK   K+
Subjt:  GEARGPPGQAKARRTVRMNLTINIVVDRLL--LNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.0e-1429.41Show/hide
Query:  DKDQARPLAP----ATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISIT-KLELINGVIPKPGSNVSL
        ++DQA+PLAP        +P  +D       K +  +  + L C  I  L +L     ++L  T+F +  P + +++IS   + + +NG +     N ++
Subjt:  DKDQARPLAP----ATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISIT-KLELINGVIPKPGSNVSL

Query:  TADVSVKNPNVASFKYSNTTTTLYINE-TVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLN---LNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVV
        + ++S+ NPN A F   N   + Y  E  V+GE+        A+RTV+MNLT  IV  +LL +   L  D++   + L+S   V GRVK + I R+ + +
Subjt:  TADVSVKNPNVASFKYSNTTTTLYINE-TVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLN---LNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVV

Query:  KMNC
        + +C
Subjt:  KMNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATAAGGACCAAGCTCGACCTCTCGCCCCAGCTACTGACTGCCGTCCGAGTAGCGATGACTACCAAGAAAAACTGCATCTAAAGAGAATCCAGCGAAGAAGGTT
CATAAAACTCTTTTGTTTCATAATTGGCCTTCTCATAATTCTATCAGTAGGAGTCATCCTCATCTTGATATTCACTCTATTTCAAGTCAAGGATCCCATTATTCAAATGA
ACAACATTTCAATCACAAAACTCGAGTTGATCAACGGTGTCATACCAAAGCCAGGATCCAACGTGTCATTGACAGCTGACGTGTCAGTGAAAAATCCTAACGTGGCGTCG
TTCAAGTACAGTAACACGACGACAACTCTGTACATTAACGAGACAGTGATAGGGGAGGCTCGAGGGCCGCCGGGGCAAGCCAAGGCACGACGAACGGTGCGAATGAACCT
TACTATCAACATCGTCGTTGACCGACTCTTGTTGAACCTTAATAGCGACATGAGCTCAGGGAAGCTGAGACTGAGAAGCTTTTCGAGAGTTCCGGGGAGAGTGAAAGTAT
TACATATTCTAAGGAGAAATATTGTTGTCAAAATGAACTGTACGTCGACGATCAATATCTTCAACAAATCGATTGAAGATCAAGATTGCAAGAGGAAGGTGAAGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGATAAGGACCAAGCTCGACCTCTCGCCCCAGCTACTGACTGCCGTCCGAGTAGCGATGACTACCAAGAAAAACTGCATCTAAAGAGAATCCAGCGAAGAAGGTT
CATAAAACTCTTTTGTTTCATAATTGGCCTTCTCATAATTCTATCAGTAGGAGTCATCCTCATCTTGATATTCACTCTATTTCAAGTCAAGGATCCCATTATTCAAATGA
ACAACATTTCAATCACAAAACTCGAGTTGATCAACGGTGTCATACCAAAGCCAGGATCCAACGTGTCATTGACAGCTGACGTGTCAGTGAAAAATCCTAACGTGGCGTCG
TTCAAGTACAGTAACACGACGACAACTCTGTACATTAACGAGACAGTGATAGGGGAGGCTCGAGGGCCGCCGGGGCAAGCCAAGGCACGACGAACGGTGCGAATGAACCT
TACTATCAACATCGTCGTTGACCGACTCTTGTTGAACCTTAATAGCGACATGAGCTCAGGGAAGCTGAGACTGAGAAGCTTTTCGAGAGTTCCGGGGAGAGTGAAAGTAT
TACATATTCTAAGGAGAAATATTGTTGTCAAAATGAACTGTACGTCGACGATCAATATCTTCAACAAATCGATTGAAGATCAAGATTGCAAGAGGAAGGTGAAGATATAG
Protein sequenceShow/hide protein sequence
MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILIFTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTADVSVKNPNVAS
FKYSNTTTTLYINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKVLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI