; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg26025 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg26025
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionLEA_2 domain-containing protein
Genome locationCarg_Chr17:2806594..2807253
RNA-Seq ExpressionCarg26025
SyntenyCarg26025
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575200.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]2.1e-10999.09Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAPT HCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-110100Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQNCKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

XP_022959336.1 uncharacterized protein LOC111460339 [Cucurbita moschata]3.1e-10595.89Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAP T CRPSSDDYQE+LHLKRIQRRRFIKLFCFIIGLLIILSV VILIL+FTLFQVKDPIIQMN ISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVK+LHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

XP_023006660.1 uncharacterized protein LOC111499318 [Cucurbita maxima]8.5e-10393.61Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLA  T CRPSSDDYQE+LHLK+IQR RFIK FCFII LL+ILSV+VILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVDRLLLNLN+DMSSGKLRLRSFSRVPGRVKLLHI+RRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

XP_023548342.1 uncharacterized protein LOC111807010 [Cucurbita pepo subsp. pepo]1.9e-10292.24Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAP T CRPS+DDYQE+LHLKR  +RRFIKLFCFIIGLL+ILSV+VILIL+FTLFQVKDPIIQMNKISITKLELING+IPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVD+LLLNLNSDMSSGKLRLRSFSRVPGRVKLLHI+RRNI+VKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein1.5e-8173.97Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        M DKDQA+PL P T  R SSD+ + +LHLKRIQR+RFIK   FI+ LL+I ++++I+ILMFTLFQ+KDPIIQMN++SITKLELIN VIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTL+INETVIGE RGP G+AKAR+TV+MN+TI+IV DR+L NLN+D+S GK+RLRSFSR+PG+VKLLH + RN+VVKMNCT  
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIF+KSIEDQ CKRK+K+
Subjt:  INIFNKSIEDQNCKRKVKI

A0A1S3C8G8 uncharacterized protein LOC1034976852.0e-8173.97Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        M  KDQA+PL P T  R SSD+ + +LHLKRIQR+RFIK   FI  LLII ++++I+ILMFTLFQ+KDPII+MN++SITKLELIN VIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTL+INETVIGE RGPPG+AKAR+TV+MN+TI+IV DR+L NLN+D+S GK+RLRSFSR+PG+VKLLH++ RN+VVKMNCT  
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIF+KSIEDQ CKRK+K+
Subjt:  INIFNKSIEDQNCKRKVKI

A0A5D3CQG2 Late embryogenesis abundant protein, LEA-141.4e-7474.02Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        M  KDQA+PL P T  R SSD+ + +LHLKRIQR+RFIK   FI  LLII ++++I+ILMFTLFQ+KDPII+MN++SITKLELIN VIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPN+ASFKYSNTTTTL+INETVIGE RGPPG+AKAR+TV+MN+TI+IV DR+L NLN+D+S GK+RLRSFSR+PG+VKLLH++ RN+VVKMNCT  
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIF
        INIF
Subjt:  INIF

A0A6J1H4K3 uncharacterized protein LOC1114603391.5e-10595.89Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLAP T CRPSSDDYQE+LHLKRIQRRRFIKLFCFIIGLLIILSV VILIL+FTLFQVKDPIIQMN ISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVK+LHILRRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

A0A6J1L0R6 uncharacterized protein LOC1114993184.1e-10393.61Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
        MADKDQARPLA  T CRPSSDDYQE+LHLK+IQR RFIK FCFII LL+ILSV+VILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTAD

Query:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST
        VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTV+MNLTINIVVDRLLLNLN+DMSSGKLRLRSFSRVPGRVKLLHI+RRNIVVKMNCTST
Subjt:  VSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTST

Query:  INIFNKSIEDQNCKRKVKI
        INIFNKSIEDQ+CKRKVKI
Subjt:  INIFNKSIEDQNCKRKVKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family9.6e-0426.36Show/hide
Query:  CFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARR
        C +  + +++ ++V+L++ FT+F+ KDP     KIS+  ++L +  +    +N S +  V+V+NPN A F + +++  L  +   +G    P G+  + R
Subjt:  CFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARR

Query:  TVQMNLTINI
           M  T  +
Subjt:  TVQMNLTINI

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-4042.41Show/hide
Query:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGV--IPKPGSNVSLT
        MAD +  RPLAP T   P SD+    +      R R     C     LI+ +  ++L L+FT+F+VKDPII+MN + +  L+ + G   +   G+N+S+ 
Subjt:  MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGV--IPKPGSNVSLT

Query:  ADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLN--LNSDMS-SGKLRLRSFSRVPGRVKLLHILRRNIVVKM
         DVSVKNPN ASFKYSNTTT +Y   T++GEA G PG+A+  RT +MN+T++I++DR+L +  L  ++S SG + + S++RV G+VK++ I+++++ VKM
Subjt:  ADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLN--LNSDMS-SGKLRLRSFSRVPGRVKLLHILRRNIVVKM

Query:  NCTSTINIFNKSIEDQNCKRKVKI
        NCT  +NI  ++I+D +CK+K+ +
Subjt:  NCTSTINIFNKSIEDQNCKRKVKI

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.1e-1124.86Show/hide
Query:  KLFCFIIGLLIILSVIVILILMFT-LFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQA
        ++ C + G++ +L VI +  L+   +F+ K PI+Q    ++  +     +  +   N +LT ++ +KNPNVA F+Y      +Y  +T++G    P    
Subjt:  KLFCFIIGLLIILSVIVILILMFT-LFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVIGEARGPPGQA

Query:  KARRTVQMNLTINIVVDRLLLNLN---SDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTSTINIFNKSIEDQNCKRKVKI
         A+ +V +   + + +D+ + NL     D+  GK+ + + +++PG++ LL I +  +    +C   +   +  +EDQ C  K K+
Subjt:  KARRTVQMNLTINIVVDRLLLNLN---SDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTSTINIFNKSIEDQNCKRKVKI

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.2e-2335.57Show/hide
Query:  KRIQRRRFIKL-FCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLEL-INGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVI
        K+++R+R  K+  CF I LLI+L  IVI+IL FTLF+ K P   ++ +++ +L+  +N ++ K   N++L  D+S+KNPN   F Y +++  L     VI
Subjt:  KRIQRRRFIKL-FCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLEL-INGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTLYINETVI

Query:  GEARGPPGQAKARRTVQMNLTINIVVDRLL--LNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTSTINIFNKSIEDQNCKRKVKI
        GEA  P  +  AR+TV +N+T+ ++ DRLL    L SD+ +G + L +F +V G+V +L I +  +    +C  +I++ ++++  Q+CK   K+
Subjt:  GEARGPPGQAKARRTVQMNLTINIVVDRLL--LNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTSTINIFNKSIEDQNCKRKVKI

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.6e-1429.9Show/hide
Query:  DKDQARPLAP----TTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISIT-KLELINGVIPKPGSNVSL
        ++DQA+PLAP    T   +P  +D       K +  +  + L C  I  L +L  +  ++L  T+F +  P + ++ IS   + + +NG +     N ++
Subjt:  DKDQARPLAP----TTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISIT-KLELINGVIPKPGSNVSL

Query:  TADVSVKNPNVASFKYSNTTTTLYINE-TVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLN---LNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVV
        + ++S+ NPN A F   N   + Y  E  V+GE+        A+RTV+MNLT  IV  +LL +   L  D++   + L+S   V GRVK + I R+ + +
Subjt:  TADVSVKNPNVASFKYSNTTTTLYINE-TVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLN---LNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVV

Query:  KMNC
        + +C
Subjt:  KMNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATAAGGACCAAGCTCGACCTCTCGCCCCAACTACTCACTGCCGTCCGAGTAGCGACGACTACCAAGAACAATTGCATCTAAAGAGAATCCAGCGAAGAAGGTT
CATAAAACTCTTCTGTTTCATAATTGGCCTTCTCATAATACTATCAGTAATAGTCATCCTTATCTTGATGTTCACTCTATTCCAAGTCAAGGATCCCATAATCCAAATGA
ACAAGATTTCAATCACAAAACTCGAGTTGATCAACGGTGTCATACCAAAGCCAGGATCCAACGTGTCATTGACAGCTGACGTGTCAGTGAAAAATCCTAACGTGGCGTCG
TTCAAGTACAGTAACACGACGACAACTCTGTACATTAACGAGACAGTGATAGGGGAGGCTCGAGGGCCGCCGGGGCAAGCCAAGGCACGACGAACGGTGCAAATGAACCT
TACTATCAACATCGTCGTTGACCGACTCTTGTTGAACCTTAACAGCGACATGAGCTCAGGGAAGCTGAGACTGAGAAGCTTTTCGAGAGTTCCAGGGAGAGTGAAACTAT
TACATATTCTAAGGAGAAATATTGTTGTCAAAATGAACTGTACGTCGACGATCAATATCTTCAACAAATCGATTGAAGATCAAAATTGCAAGAGGAAGGTGAAGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGATAAGGACCAAGCTCGACCTCTCGCCCCAACTACTCACTGCCGTCCGAGTAGCGACGACTACCAAGAACAATTGCATCTAAAGAGAATCCAGCGAAGAAGGTT
CATAAAACTCTTCTGTTTCATAATTGGCCTTCTCATAATACTATCAGTAATAGTCATCCTTATCTTGATGTTCACTCTATTCCAAGTCAAGGATCCCATAATCCAAATGA
ACAAGATTTCAATCACAAAACTCGAGTTGATCAACGGTGTCATACCAAAGCCAGGATCCAACGTGTCATTGACAGCTGACGTGTCAGTGAAAAATCCTAACGTGGCGTCG
TTCAAGTACAGTAACACGACGACAACTCTGTACATTAACGAGACAGTGATAGGGGAGGCTCGAGGGCCGCCGGGGCAAGCCAAGGCACGACGAACGGTGCAAATGAACCT
TACTATCAACATCGTCGTTGACCGACTCTTGTTGAACCTTAACAGCGACATGAGCTCAGGGAAGCTGAGACTGAGAAGCTTTTCGAGAGTTCCAGGGAGAGTGAAACTAT
TACATATTCTAAGGAGAAATATTGTTGTCAAAATGAACTGTACGTCGACGATCAATATCTTCAACAAATCGATTGAAGATCAAAATTGCAAGAGGAAGGTGAAGATATAG
Protein sequenceShow/hide protein sequence
MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILMFTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVAS
FKYSNTTTTLYINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGRVKLLHILRRNIVVKMNCTSTINIFNKSIEDQNCKRKVKI