; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013842 (gene) of Snake gourd v1 genome

Gene IDTan0013842
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLEA_2 domain-containing protein
Genome locationLG09:65190142..65192206
RNA-Seq ExpressionTan0013842
SyntenyTan0013842
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7021841.1 hypothetical protein SDJN02_15569, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-9484.33Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV
        MTSSSR DS     SLLPQNA  G  N+V+LSLY PP Y HRRLLRLCA YSAAFLLL+A+AFLLFPSDPSLQLVRLKLN AKVRLLPV+VLDLS SAS+
Subjt:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV
        RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARGSS+VNAT+DLNG+EVIHDAFYLL+DLGKG+IPFD++TEVEG+MGFFFIKFPIKARVSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV

Query:  FVNTKNQTIEHQDCYPE
        FV+TK QTIEHQDCYPE
Subjt:  FVNTKNQTIEHQDCYPE

XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]1.9e-9483.72Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV
        MTSSS  DS PVPY+L+P NA Q NVVVLSLY PP  RHRRLLRLCA YSAAFLLL AVAFLLFPSDPSLQLVRLKLNR KV L+PVV LDLSFS S+RV
Subjt:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV
        RNKNFFSL+YN+LGVSVGYRGRRLG+VSSEGGRVSARGSS+VNATLDLNG+EV+HD  YLL DLGKG+IPFDTET+VEG MG FFIK PIKARVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV

Query:  NTKNQTIEHQDCYPE
        NT NQTIEHQDCYPE
Subjt:  NTKNQTIEHQDCYPE

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]2.8e-9886.51Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV
        MTSSSR DS PVPYSLLP NA   NVVVLSLY PPR+R RRLLRLCA YSAAFLLLSAVAFLLFP+DPSLQLVRLKLNR KVRLLPV++LDLSFSASVRV
Subjt:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARG S+VNATLDLNG EVIHD  YL+EDL  G++PFDTETEVEGYMG FFIKFPIKARVSCEVFV
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV

Query:  NTKNQTIEHQDCYPE
        NT ++TIEHQDCYPE
Subjt:  NTKNQTIEHQDCYPE

XP_023530779.1 uncharacterized protein LOC111793228 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-9484.33Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV
        MTSSSR DS     SLLPQNA  G  N+V+LSLY PP Y HRRLLRLCA YSAAFLLL+A++FLLFPSDPSLQLVRLKLN AKVRLLPV+VLDLS SASV
Subjt:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV
        RVRNKNFFSLDYNYLGVSVGYRG+RLGFVSS+GGRVSARGSS+VNAT+DLNG+EVIHDAFYLL+DLGKG+IPFD++TEVEG+MGFFFIKFPIKARVSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV

Query:  FVNTKNQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKNQTIEHQDCYPE

XP_023530780.1 uncharacterized protein LOC111793228 isoform X2 [Cucurbita pepo subsp. pepo]1.9e-9484.33Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV
        MTSSSR DS     SLLPQNA  G  N+V+LSLY PP Y HRRLLRLCA YSAAFLLL+A++FLLFPSDPSLQLVRLKLN AKVRLLPV+VLDLS SASV
Subjt:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV
        RVRNKNFFSLDYNYLGVSVGYRG+RLGFVSS+GGRVSARGSS+VNAT+DLNG+EVIHDAFYLL+DLGKG+IPFD++TEVEG+MGFFFIKFPIKARVSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV

Query:  FVNTKNQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKNQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein9.0e-9583.72Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV
        MTSSS  DS PVPY+L+P NA Q NVVVLSLY PP  RHRRLLRLCA YSAAFLLL AVAFLLFPSDPSLQLVRLKLNR KV L+PVV LDLSFS S+RV
Subjt:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV
        RNKNFFSL+YN+LGVSVGYRGRRLG+VSSEGGRVSARGSS+VNATLDLNG+EV+HD  YLL DLGKG+IPFDTET+VEG MG FFIK PIKARVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV

Query:  NTKNQTIEHQDCYPE
        NT NQTIEHQDCYPE
Subjt:  NTKNQTIEHQDCYPE

A0A6J1CTN0 uncharacterized protein LOC1110144731.3e-9886.51Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV
        MTSSSR DS PVPYSLLP NA   NVVVLSLY PPR+R RRLLRLCA YSAAFLLLSAVAFLLFP+DPSLQLVRLKLNR KVRLLPV++LDLSFSASVRV
Subjt:  MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARG S+VNATLDLNG EVIHD  YL+EDL  G++PFDTETEVEGYMG FFIKFPIKARVSCEVFV
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFV

Query:  NTKNQTIEHQDCYPE
        NT ++TIEHQDCYPE
Subjt:  NTKNQTIEHQDCYPE

A0A6J1EZ17 uncharacterized protein LOC1114377323.8e-9382.95Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV
        MTSSSR DS     SLLPQNA  G  N+V+LSLY PP Y HRRLLRLCA YSAAFLLL+A++FLLFPSDPSLQLVRL+LN AKVRLLPV+VLDLS SAS+
Subjt:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV
        RVRNKNFFSLDYNYLGVSVGYRGR LGFVSS+GGRVSARG S+VNAT+DLNG+EVIHDAFYLL+DLGKG+IPFD++TEVEG+MGFFFIKFPIKARVSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV

Query:  FVNTKNQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKNQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852806.4e-9384.11Show/hide
Query:  SSSRVDSAPVPYSLLPQNAGQP-NVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVR
        S S+  S PVPYS +P NA  P NVVVLSLY PP YR RRLLRLCALYSAAFLLLSAV FLLFPSDPSLQLVRLKLN  KVRLLP VVLDLSFSASVRVR
Subjt:  SSSRVDSAPVPYSLLPQNAGQP-NVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFVN
        NKNFFSLDYNYLGVSVG+RGRRLGFVSS+GGRVSARGSS+VNATLDLNG+++IHD F+LLEDL KG+IPFDTETEVEG MG FFIKFPIKA VSCEVFV+
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFVN

Query:  TKNQTIEHQDCYPE
        T +QTIEHQDCYPE
Subjt:  TKNQTIEHQDCYPE

A0A6J1KPJ6 uncharacterized protein LOC1114975517.6e-9484.33Show/hide
Query:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV
        MTSSSR DS     SLLPQNA  G  N+V+LSLY PP Y HRRLLRLCA YSAAFLLL+A++FLLFPSDPSLQLVRLKLN AKVRLLPV+VLDLS SASV
Subjt:  MTSSSRVDSAPVPYSLLPQNA--GQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV
        RVRNKNFFSLDYNYLGVSVGYRG RLGFVSS+GGRVSARGSS VNAT+DLNG+EVIHDAFYLL+DLGKG+IPFD++TEVEG+MGFFFIKFPIKARVSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEV

Query:  FVNTKNQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKNQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.2e-3638.16Show/hide
Query:  DSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVRNKNFFS
        D  P+P S    ++ + N  VL   HP     RR +    L S A +L+    ++ +PSDP ++++R+K++   V   PV  +D++   +++V N + +S
Subjt:  DSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVRNKNFFS

Query:  LDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFVNTKNQTI
         D+  L V++ YRG+ LG VSS+GG V+A GSS+++A  +L+G+ V  D  +L+ DL KG + FDT TE  G +G  F +FP+KA+V+C + V+T NQTI
Subjt:  LDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFVNTKNQTI

Query:  EHQDCYP
          Q C P
Subjt:  EHQDCYP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-2836.56Show/hide
Query:  DSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVRNKNFFS
        D  P+P S    ++ + N  VL   HP     RR +    L S A +L+    ++ +PSDP ++++R+K++   V   PV  +D++   +++V N + +S
Subjt:  DSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVRNKNFFS

Query:  LDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKAR
         D+  L V++ YRG+ LG VSS+GG V+A GSS+++A  +L+G+ V  D  +L+ DL KG + FDT TE  G +G  F +FP+K R
Subjt:  LDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKAR

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-5248.61Show/hide
Query:  SSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHR-----RLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSAS
        +SS+ +   +PY+ LP +    +V++L+ Y     RHR     R LR   L++A  LLLSA  +LL+PSDP + + R+ LN   V     + LDLSFS +
Subjt:  SSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHR-----RLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSAS

Query:  VRVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCE
        ++VRN++FFSLDY+ L VS+GYRGR LG V S+GG + AR SS+++ATL+L+G+EV+HD  YL+ DL KGVIPFDT  +V+G +G      PI+ +VSCE
Subjt:  VRVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCE

Query:  VFVNTKNQTIEHQDCY
        V+VN  NQ I HQDC+
Subjt:  VFVNTKNQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCAGCTCCAGGGTCGATTCTGCCCCTGTGCCCTATTCTCTTCTTCCCCAAAATGCCGGACAGCCAAACGTCGTCGTTTTATCCCTCTACCATCCCCCTCGATA
CCGACATCGACGACTTCTCCGCCTCTGTGCCCTCTACTCCGCCGCCTTCCTCCTCCTCTCCGCCGTAGCTTTTCTACTTTTCCCCTCCGATCCGTCGCTCCAACTCGTCC
GATTGAAACTCAATCGCGCCAAAGTCCGTTTGTTGCCTGTTGTCGTCCTTGACCTTTCCTTCTCTGCTTCTGTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTAC
AATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTCAGCTCCGAGGGCGGTCGAGTTTCTGCTCGAGGGTCTTCTTTTGTGAATGCCACTCTTGA
TTTGAATGGGATTGAAGTCATTCACGATGCCTTTTACTTGCTTGAGGATTTGGGGAAAGGCGTCATTCCATTCGATACGGAGACGGAAGTCGAAGGATACATGGGGTTTT
TCTTTATCAAATTCCCCATTAAGGCAAGGGTATCATGTGAGGTATTTGTGAATACCAAAAACCAAACAATTGAACATCAAGATTGTTACCCTGAGTGA
mRNA sequenceShow/hide mRNA sequence
GCACGGCATCACAAAATTCAGTTCCCGTTTTTAATTTTCTCTACCAAAAAACATGACCTCCAGCTCCAGGGTCGATTCTGCCCCTGTGCCCTATTCTCTTCTTCCCCAAA
ATGCCGGACAGCCAAACGTCGTCGTTTTATCCCTCTACCATCCCCCTCGATACCGACATCGACGACTTCTCCGCCTCTGTGCCCTCTACTCCGCCGCCTTCCTCCTCCTC
TCCGCCGTAGCTTTTCTACTTTTCCCCTCCGATCCGTCGCTCCAACTCGTCCGATTGAAACTCAATCGCGCCAAAGTCCGTTTGTTGCCTGTTGTCGTCCTTGACCTTTC
CTTCTCTGCTTCTGTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTACAATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTCAGCTCCG
AGGGCGGTCGAGTTTCTGCTCGAGGGTCTTCTTTTGTGAATGCCACTCTTGATTTGAATGGGATTGAAGTCATTCACGATGCCTTTTACTTGCTTGAGGATTTGGGGAAA
GGCGTCATTCCATTCGATACGGAGACGGAAGTCGAAGGATACATGGGGTTTTTCTTTATCAAATTCCCCATTAAGGCAAGGGTATCATGTGAGGTATTTGTGAATACCAA
AAACCAAACAATTGAACATCAAGATTGTTACCCTGAGTGAGGGGAAGGATAGAAATTCAGTTTTAGACTTTTATGACGTGAAGTTGGTAAGTGGGAACTCCCCTGATCCT
GCTGAATTTGTCTGTAATTATCACTCGTAGAAAGTTAGGCTCCATTGTGGCATGCTAGGGACTTGAAAGAAGAAAATGGTGAATATCTCTGTGGATTATTACATAAGAAC
CATGTTATTTATTTATTTATAATTTTCTCTATTAGAGTAGAGCTCAGATTCTTACACACGTTATAGTTTGTTCATGTGTACATGTGTGGCAGTCATTGCTATTGCATTAA
TAAGAGTAAATTCTGTTTCTATT
Protein sequenceShow/hide protein sequence
MTSSSRVDSAPVPYSLLPQNAGQPNVVVLSLYHPPRYRHRRLLRLCALYSAAFLLLSAVAFLLFPSDPSLQLVRLKLNRAKVRLLPVVVLDLSFSASVRVRNKNFFSLDY
NYLGVSVGYRGRRLGFVSSEGGRVSARGSSFVNATLDLNGIEVIHDAFYLLEDLGKGVIPFDTETEVEGYMGFFFIKFPIKARVSCEVFVNTKNQTIEHQDCYPE