; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G007730 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G007730
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionLEA_2 domain-containing protein
Genome locationCmo_Chr01:4009949..4010671
RNA-Seq ExpressionCmoCh01G007730
SyntenyCmoCh01G007730
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013783 - Immunoglobulin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607382.1 hypothetical protein SDJN03_00724, partial [Cucurbita argyrosperma subsp. sororia]4.7e-12399.17Show/hide
Query:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
        MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFS L AFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
Subjt:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST

Query:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
        NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
Subjt:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF

Query:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
        HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
Subjt:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR

KAG7037051.1 hypothetical protein SDJN02_00672, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-10387.45Show/hide
Query:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
        MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFS L AFCAWICLAVFGIAITLLI             P   +K+         NST
Subjt:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST

Query:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
        NQN AVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
Subjt:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF

Query:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPT
        HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPT
Subjt:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPT

XP_008457557.1 PREDICTED: uncharacterized protein LOC103497223 [Cucumis melo]1.1e-7162.4Show/hide
Query:  NNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS-----TSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQN
        NN  A +LDR+PS +KG+RRVAFSDSLPKHR+      S R  K     L A CAWIC+ +FGI + +LILGVIFVSFLQSGLPEITV+ML+LS  +I+N
Subjt:  NNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS-----TSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQN

Query:  STNQ--NVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVG
        STNQ  N A+LN K+ M+I+++NKNEK+ELSYS + + LVSE+++LGR+VIPSFS  PGNTT LNVT+NV+R S D+D++S LEDDRKK Q+ V++ M  
Subjt:  STNQ--NVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVG

Query:  SVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFP
         VGFH+GIF L  VPIHV C+FQQ LL+YR+ EPPC+I MFP
Subjt:  SVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFP

XP_022949169.1 uncharacterized protein LOC111452600 [Cucurbita moschata]1.9e-124100Show/hide
Query:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
        MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
Subjt:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST

Query:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
        NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
Subjt:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF

Query:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
        HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
Subjt:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR

XP_022998792.1 uncharacterized protein LOC111493353 [Cucurbita maxima]1.2e-12197.92Show/hide
Query:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
        MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS SLRATKFVFSHL AFCAWICLAVFGIAITLLILGVIFVSFLQS LPEITVKMLDLSKIQIQNST
Subjt:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST

Query:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
        NQNVAVLNTKVRMAIDI+NKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTL VDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
Subjt:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF

Query:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
        HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
Subjt:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR

TrEMBL top hitse value%identityAlignment
A0A1S3C5S1 uncharacterized protein LOC1034972235.4e-7262.4Show/hide
Query:  NNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS-----TSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQN
        NN  A +LDR+PS +KG+RRVAFSDSLPKHR+      S R  K     L A CAWIC+ +FGI + +LILGVIFVSFLQSGLPEITV+ML+LS  +I+N
Subjt:  NNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS-----TSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQN

Query:  STNQ--NVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVG
        STNQ  N A+LN K+ M+I+++NKNEK+ELSYS + + LVSE+++LGR+VIPSFS  PGNTT LNVT+NV+R S D+D++S LEDDRKK Q+ V++ M  
Subjt:  STNQ--NVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVG

Query:  SVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFP
         VGFH+GIF L  VPIHV C+FQQ LL+YR+ EPPC+I MFP
Subjt:  SVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFP

A0A5A7V2C7 Putative transmembrane protein5.4e-7262.4Show/hide
Query:  NNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS-----TSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQN
        NN  A +LDR+PS +KG+RRVAFSDSLPKHR+      S R  K     L A CAWIC+ +FGI + +LILGVIFVSFLQSGLPEITV+ML+LS  +I+N
Subjt:  NNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS-----TSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQN

Query:  STNQ--NVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVG
        STNQ  N A+LN K+ M+I+++NKNEK+ELSYS + + LVSE+++LGR+VIPSFS  PGNTT LNVT+NV+R S D+D++S LEDDRKK Q+ V++ M  
Subjt:  STNQ--NVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVG

Query:  SVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFP
         VGFH+GIF L  VPIHV C+FQQ LL+YR+ EPPC+I MFP
Subjt:  SVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFP

A0A6J1E0W1 uncharacterized protein LOC1110249234.7e-5250.88Show/hide
Query:  DKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNSTNQ--NVAVLNTKVRMAI
        DKG RRV FS+SLP HR+TS   TK     L A+C  IC+  FGI + LLI+ VIF+SFLQSGLPEI++K L LSK +I +STNQ  N AVL+ +V +++
Subjt:  DKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNSTNQ--NVAVLNTKVRMAI

Query:  DIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVT
         ++NKN+K+ELSY D+ + + S++++LG++VI  FS  PGNTT LNVT NV  D +DR++   +++++K+ ++V ++ M   +GFH GIF + KVPIHV 
Subjt:  DIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVT

Query:  C-NFQQYLLLYRVKEPPCSITMFPTR
        C + QQ+LL+ R+KE  C+I MFP R
Subjt:  C-NFQQYLLLYRVKEPPCSITMFPTR

A0A6J1GC15 uncharacterized protein LOC1114526009.3e-125100Show/hide
Query:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
        MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
Subjt:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST

Query:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
        NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
Subjt:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF

Query:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
        HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
Subjt:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR

A0A6J1K8Y2 uncharacterized protein LOC1114933535.6e-12297.92Show/hide
Query:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST
        MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRS SLRATKFVFSHL AFCAWICLAVFGIAITLLILGVIFVSFLQS LPEITVKMLDLSKIQIQNST
Subjt:  MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNST

Query:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
        NQNVAVLNTKVRMAIDI+NKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTL VDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF
Subjt:  NQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGF

Query:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
        HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR
Subjt:  HLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMFPTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G30505.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.6e-1426.88Show/hide
Query:  CAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNSTNQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFS
        CA  C+ V  + I +L++G+   S ++S LP++ V  L  S++ I  S+     ++N  +   + + N N+K  L YS +   + SENI LG+  +  F 
Subjt:  CAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNSTNQNVAVLNTKVRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFS

Query:  QEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMF
        Q+PGN TSL +   + +  +     +LL +  K  + +V + + G +      FK++ +PI + C   +   +    +P C + +F
Subjt:  QEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTCNFQQYLLLYRVKEPPCSITMF

AT4G01110.1 unknown protein7.0e-0826.29Show/hide
Query:  PKHR--STSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGV-IFVSFLQSGLPEITVKMLDLSKIQIQ-NSTNQNVAVLNTKVRMAIDIKNKNEKLEL
        PKH   +      K  +S    FC  +C+ V  I I LLIL V +F  +    LP + +    +S            ++ L  +    +D +N N KL  
Subjt:  PKHR--STSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGV-IFVSFLQSGLPEITVKMLDLSKIQIQ-NSTNQNVAVLNTKVRMAIDIKNKNEKLEL

Query:  SYSDLNMKL-VSEN---IELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTC
         Y ++++ + V E+     LG   +  F ++PGN T + V + V +  +D  ++  L  D K  ++VVK+     VG  +G  K+  V + ++C
Subjt:  SYSDLNMKL-VSEN---IELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGATAACAATCATGTCGCCATTCAACTCGATCGAGTTCCAAGCACTGACAAAGGAGCCCGTCGCGTCGCCTTCTCCGATTCCCTCCCCAAACACCGCTCCACATC
CCTCCGCGCCACCAAATTCGTCTTCTCCCATCTGCTCGCCTTCTGCGCTTGGATTTGCCTCGCCGTGTTCGGAATCGCCATCACTCTTCTCATCCTCGGCGTAATCTTCG
TGTCATTCCTCCAATCCGGTTTGCCTGAAATCACCGTCAAAATGCTCGATCTCTCCAAAATCCAGATTCAAAACTCCACAAATCAAAATGTCGCTGTCCTAAACACAAAG
GTACGTATGGCGATCGATATAAAGAACAAGAACGAGAAATTGGAGTTGAGTTATAGCGATCTTAATATGAAATTAGTATCAGAAAACATCGAATTAGGCAGGAATGTGAT
ACCTAGTTTCTCTCAAGAACCTGGAAACACCACATCGCTAAATGTAACGCTGAATGTGGATCGAGATTCAATAGATCGAGACAGTATATCGCTGCTTGAAGATGACAGAA
AAAAGGCTCAAGTGGTTGTGAAGATCACGATGGTTGGTTCGGTTGGATTTCATCTTGGGATATTCAAGCTCAACAAGGTGCCGATCCATGTGACCTGTAATTTTCAGCAA
TATCTTCTTCTTTATCGCGTCAAGGAGCCGCCGTGTAGTATTACAATGTTTCCTACCAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGATAACAATCATGTCGCCATTCAACTCGATCGAGTTCCAAGCACTGACAAAGGAGCCCGTCGCGTCGCCTTCTCCGATTCCCTCCCCAAACACCGCTCCACATC
CCTCCGCGCCACCAAATTCGTCTTCTCCCATCTGCTCGCCTTCTGCGCTTGGATTTGCCTCGCCGTGTTCGGAATCGCCATCACTCTTCTCATCCTCGGCGTAATCTTCG
TGTCATTCCTCCAATCCGGTTTGCCTGAAATCACCGTCAAAATGCTCGATCTCTCCAAAATCCAGATTCAAAACTCCACAAATCAAAATGTCGCTGTCCTAAACACAAAG
GTACGTATGGCGATCGATATAAAGAACAAGAACGAGAAATTGGAGTTGAGTTATAGCGATCTTAATATGAAATTAGTATCAGAAAACATCGAATTAGGCAGGAATGTGAT
ACCTAGTTTCTCTCAAGAACCTGGAAACACCACATCGCTAAATGTAACGCTGAATGTGGATCGAGATTCAATAGATCGAGACAGTATATCGCTGCTTGAAGATGACAGAA
AAAAGGCTCAAGTGGTTGTGAAGATCACGATGGTTGGTTCGGTTGGATTTCATCTTGGGATATTCAAGCTCAACAAGGTGCCGATCCATGTGACCTGTAATTTTCAGCAA
TATCTTCTTCTTTATCGCGTCAAGGAGCCGCCGTGTAGTATTACAATGTTTCCTACCAGGTGA
Protein sequenceShow/hide protein sequence
MTDNNHVAIQLDRVPSTDKGARRVAFSDSLPKHRSTSLRATKFVFSHLLAFCAWICLAVFGIAITLLILGVIFVSFLQSGLPEITVKMLDLSKIQIQNSTNQNVAVLNTK
VRMAIDIKNKNEKLELSYSDLNMKLVSENIELGRNVIPSFSQEPGNTTSLNVTLNVDRDSIDRDSISLLEDDRKKAQVVVKITMVGSVGFHLGIFKLNKVPIHVTCNFQQ
YLLLYRVKEPPCSITMFPTR