; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034418 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034418
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLEA_2 domain-containing protein
Genome locationchr3:7214325..7216118
RNA-Seq ExpressionLag0034418
SyntenyLag0034418
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]2.0e-9584.19Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV
        MTS+S DDSVPVPY+L+P NA QQNVVVLSLYRPP  +HRRLLRLCA YSAAFLLL A+AFLLFPSDPSLQLVRLKLNRVKV L+PVV LDLSFS S+RV
Subjt:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV
        RNKNFFSL+YN+LGVSVGYRGRRLG+VSS+GGRVSARGSSYVNAT DLNG EV+HDV YLL DLGKG+IPFDTET+VEG MGLFFIK PIKA+VSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV

Query:  NTKSQTIEHQDCYPE
        NT +QTIEHQDCYPE
Subjt:  NTKSQTIEHQDCYPE

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]1.3e-9986.98Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV
        MTS+SRDDSVPVPYSLLP NA  QNVVVLSLYRPPR++ RRLLRLCA YSAAFLLLSA+AFLLFP+DPSLQLVRLKLNR+KVRLLPV++LDLSFSASVRV
Subjt:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNAT DLNGFEVIHD  YL+EDL  G++PFDTETEVEGYMGLFFIKFPIKA+VSCEVFV
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV

Query:  NTKSQTIEHQDCYPE
        NT  +TIEHQDCYPE
Subjt:  NTKSQTIEHQDCYPE

XP_022987870.1 uncharacterized protein LOC111485280 [Cucurbita maxima]4.4e-9586.45Show/hide
Query:  STSRDDSVPVPYSLLPQN-AGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR
        S S+D S+PVPYS +P N A  QNVVVLSLYRPP Y+ RRLLRLCALYSAAFLLLSA+ FLLFPSDPSLQLVRLKLN VKVRLLP VVLDLSFSASVRVR
Subjt:  STSRDDSVPVPYSLLPQN-AGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN
        NKNFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNAT DLNG ++IHDVF+LLEDL KG+IPFDTETEVEG MGLFFIKFPIKA VSCEVFV+
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN

Query:  TKSQTIEHQDCYPE
        T SQTIEHQDCYPE
Subjt:  TKSQTIEHQDCYPE

XP_023530779.1 uncharacterized protein LOC111793228 isoform X1 [Cucurbita pepo subsp. pepo]5.7e-9585.25Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNA--GQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASV
        MTS+SRDDSV    SLLPQNA  G QN+V+LSLYRPP Y HRRLLRLCA YSAAFLLL+AL+FLLFPSDPSLQLVRLKLN  KVRLLPV+VLDLS SASV
Subjt:  MTSTSRDDSVPVPYSLLPQNA--GQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEV
        RVRNKNFFSLDYNYLGVSVGYRG+RLGFVSSDGGRVSARGSSYVNAT DLNG EVIHD FYLL+DLGKG+IPFD++TEVEG+MG FFIKFPIKA+VSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEV

Query:  FVNTKSQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKSQTIEHQDCYPE

XP_023530780.1 uncharacterized protein LOC111793228 isoform X2 [Cucurbita pepo subsp. pepo]5.7e-9585.25Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNA--GQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASV
        MTS+SRDDSV    SLLPQNA  G QN+V+LSLYRPP Y HRRLLRLCA YSAAFLLL+AL+FLLFPSDPSLQLVRLKLN  KVRLLPV+VLDLS SASV
Subjt:  MTSTSRDDSVPVPYSLLPQNA--GQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEV
        RVRNKNFFSLDYNYLGVSVGYRG+RLGFVSSDGGRVSARGSSYVNAT DLNG EVIHD FYLL+DLGKG+IPFD++TEVEG+MG FFIKFPIKA+VSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEV

Query:  FVNTKSQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKSQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein9.5e-9684.19Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV
        MTS+S DDSVPVPY+L+P NA QQNVVVLSLYRPP  +HRRLLRLCA YSAAFLLL A+AFLLFPSDPSLQLVRLKLNRVKV L+PVV LDLSFS S+RV
Subjt:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV
        RNKNFFSL+YN+LGVSVGYRGRRLG+VSS+GGRVSARGSSYVNAT DLNG EV+HDV YLL DLGKG+IPFDTET+VEG MGLFFIK PIKA+VSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV

Query:  NTKSQTIEHQDCYPE
        NT +QTIEHQDCYPE
Subjt:  NTKSQTIEHQDCYPE

A0A6J1CTN0 uncharacterized protein LOC1110144736.3e-10086.98Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV
        MTS+SRDDSVPVPYSLLP NA  QNVVVLSLYRPPR++ RRLLRLCA YSAAFLLLSA+AFLLFP+DPSLQLVRLKLNR+KVRLLPV++LDLSFSASVRV
Subjt:  MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNAT DLNGFEVIHD  YL+EDL  G++PFDTETEVEGYMGLFFIKFPIKA+VSCEVFV
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFV

Query:  NTKSQTIEHQDCYPE
        NT  +TIEHQDCYPE
Subjt:  NTKSQTIEHQDCYPE

A0A6J1HAC8 uncharacterized protein LOC1114615746.8e-9485.05Show/hide
Query:  STSRDDSVPVPYSLLPQN-AGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR
        S S+D S+PVPYS +P N A  QN+VVLSLYRPP Y+ RRLLRLC LYSAAFLLLSA+ FLLFPSDPSLQLVRLKLN V VRLLP VVLDLSFSASVRVR
Subjt:  STSRDDSVPVPYSLLPQN-AGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN
        N NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNAT DLNG ++IHDVF+LLEDL KG+IPFDTETEVEG MGLFFIKFPIKA VSCEVFV+
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN

Query:  TKSQTIEHQDCYPE
        T SQTIEHQDCYPE
Subjt:  TKSQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852802.1e-9586.45Show/hide
Query:  STSRDDSVPVPYSLLPQN-AGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR
        S S+D S+PVPYS +P N A  QNVVVLSLYRPP Y+ RRLLRLCALYSAAFLLLSA+ FLLFPSDPSLQLVRLKLN VKVRLLP VVLDLSFSASVRVR
Subjt:  STSRDDSVPVPYSLLPQN-AGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN
        NKNFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNAT DLNG ++IHDVF+LLEDL KG+IPFDTETEVEG MGLFFIKFPIKA VSCEVFV+
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN

Query:  TKSQTIEHQDCYPE
        T SQTIEHQDCYPE
Subjt:  TKSQTIEHQDCYPE

A0A6J1KPJ6 uncharacterized protein LOC1114975516.8e-9484.79Show/hide
Query:  MTSTSRDDSVPVPYSLLPQNA--GQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASV
        MTS+SRDDSV    SLLPQNA  G QN+V+LSLYRPP Y HRRLLRLCA YSAAFLLL+AL+FLLFPSDPSLQLVRLKLN  KVRLLPV+VLDLS SASV
Subjt:  MTSTSRDDSVPVPYSLLPQNA--GQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASV

Query:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEV
        RVRNKNFFSLDYNYLGVSVGYRG RLGFVSSDGGRVSARGSS VNAT DLNG EVIHD FYLL+DLGKG+IPFD++TEVEG+MG FFIKFPIKA+VSC+V
Subjt:  RVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEV

Query:  FVNTKSQTIEHQDCYPE
        FVNTK QTIEHQDCYPE
Subjt:  FVNTKSQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.5e-3740.59Show/hide
Query:  YSLLPQNAGQQ-NVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVRNKNFFSLDYNY
        Y  LP ++  + N  VL    P     RR +    L S A    S L ++ +PSDP ++++R+K++ V V   PV  +D++   +++V N + +S D+  
Subjt:  YSLLPQNAGQQ-NVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVRNKNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVNTKSQTIEHQDC
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G  V  DV +L+ DL KG + FDT TE  G +G+ F +FP+KAKV+C + V+T +QTI  Q C
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVNTKSQTIEHQDC

Query:  YP
         P
Subjt:  YP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.4e-2938.67Show/hide
Query:  YSLLPQNAGQQ-NVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVRNKNFFSLDYNY
        Y  LP ++  + N  VL    P     RR +    L S A    S L ++ +PSDP ++++R+K++ V V   PV  +D++   +++V N + +S D+  
Subjt:  YSLLPQNAGQQ-NVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVRNKNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAK
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G  V  DV +L+ DL KG + FDT TE  G +G+ F +FP+K +
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAK

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.6e-5349.53Show/hide
Query:  STSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQH-RRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR
        ++S+ +   +PY+ LP +   Q+V++L+ YR  R     R LR   L++A  LLLSA  +LL+PSDP + + R+ LN + V     + LDLSFS +++VR
Subjt:  STSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQH-RRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN
        N++FFSLDY+ L VS+GYRGR LG V S GG + AR SSY++AT +L+G EV+HDV YL+ DL KGVIPFDT  +V+G +G+     PI+ KVSCEV+VN
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVN

Query:  TKSQTIEHQDCY
          +Q I HQDC+
Subjt:  TKSQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCACCTCTAGGGACGATTCTGTCCCTGTGCCCTACTCTCTTCTTCCCCAAAATGCCGGACAGCAAAACGTCGTCGTTTTATCGCTCTACCGTCCCCCTCGCTA
CCAACACCGTCGCCTTCTCCGCCTCTGTGCTCTCTACTCCGCCGCCTTCCTCCTCCTCTCCGCCTTAGCTTTTCTACTTTTCCCGTCCGATCCGTCGCTCCAACTCGTCC
GATTGAAACTCAATCGCGTCAAAGTCCGTCTGCTGCCTGTCGTCGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGCAATAAGAATTTCTTCTCTCTCGACTAC
AATTACCTTGGCGTTTCTGTCGGCTACCGGGGGAGACGACTTGGATTTGTGAGCTCCGACGGCGGTCGAGTTTCTGCCCGAGGGTCTTCTTATGTGAACGCCACTTTCGA
TTTGAATGGGTTCGAGGTCATTCACGACGTCTTTTACTTGCTTGAGGATTTGGGCAAGGGTGTCATTCCATTCGACACGGAGACGGAGGTGGAAGGATACATGGGGCTTT
TCTTTATCAAATTCCCGATTAAGGCAAAGGTTTCATGTGAGGTATTTGTGAATACGAAAAGCCAGACAATCGAACATCAAGATTGCTACCCTGAGGGAAGGATAGAATTT
CAGTTTCATAATGATTTTTGTGGGAACTCCTCTGCTCCTGCTGAATTTGCTGTAAATATCACTCGTAGAAAGTTAGGCTCCATTGTTGTATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCTCCACCTCTAGGGACGATTCTGTCCCTGTGCCCTACTCTCTTCTTCCCCAAAATGCCGGACAGCAAAACGTCGTCGTTTTATCGCTCTACCGTCCCCCTCGCTA
CCAACACCGTCGCCTTCTCCGCCTCTGTGCTCTCTACTCCGCCGCCTTCCTCCTCCTCTCCGCCTTAGCTTTTCTACTTTTCCCGTCCGATCCGTCGCTCCAACTCGTCC
GATTGAAACTCAATCGCGTCAAAGTCCGTCTGCTGCCTGTCGTCGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGCAATAAGAATTTCTTCTCTCTCGACTAC
AATTACCTTGGCGTTTCTGTCGGCTACCGGGGGAGACGACTTGGATTTGTGAGCTCCGACGGCGGTCGAGTTTCTGCCCGAGGGTCTTCTTATGTGAACGCCACTTTCGA
TTTGAATGGGTTCGAGGTCATTCACGACGTCTTTTACTTGCTTGAGGATTTGGGCAAGGGTGTCATTCCATTCGACACGGAGACGGAGGTGGAAGGATACATGGGGCTTT
TCTTTATCAAATTCCCGATTAAGGCAAAGGTTTCATGTGAGGTATTTGTGAATACGAAAAGCCAGACAATCGAACATCAAGATTGCTACCCTGAGGGAAGGATAGAATTT
CAGTTTCATAATGATTTTTGTGGGAACTCCTCTGCTCCTGCTGAATTTGCTGTAAATATCACTCGTAGAAAGTTAGGCTCCATTGTTGTATGTTAG
Protein sequenceShow/hide protein sequence
MTSTSRDDSVPVPYSLLPQNAGQQNVVVLSLYRPPRYQHRRLLRLCALYSAAFLLLSALAFLLFPSDPSLQLVRLKLNRVKVRLLPVVVLDLSFSASVRVRNKNFFSLDY
NYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATFDLNGFEVIHDVFYLLEDLGKGVIPFDTETEVEGYMGLFFIKFPIKAKVSCEVFVNTKSQTIEHQDCYPEGRIEF
QFHNDFCGNSSAPAEFAVNITRRKLGSIVVC