; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007585 (gene) of Snake gourd v1 genome

Gene IDTan0007585
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG11:21252681..21254413
RNA-Seq ExpressionTan0007585
SyntenyTan0007585
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579221.1 hypothetical protein SDJN03_23669, partial [Cucurbita argyrosperma subsp. sororia]7.8e-8784.58Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY +HSRADTGGWQKHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

KAG7016734.1 hypothetical protein SDJN02_21844 [Cucurbita argyrosperma subsp. argyrosperma]7.8e-8784.58Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

XP_022939105.1 uncharacterized protein LOC111445108 [Cucurbita moschata]7.8e-8784.58Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

XP_022992987.1 uncharacterized protein LOC111489148 [Cucurbita maxima]3.0e-8684.08Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGW+KHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

XP_023550275.1 uncharacterized protein LOC111808498 [Cucurbita pepo subsp. pepo]6.6e-8684.08Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         L LGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0KV04 Uncharacterized protein5.7e-8380.2Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL V+ IIASL+LIAFV AIGAEMRRSTA   PD+YDE+T+CVYDSDAST YGLVAFGL+LIS T+LM VTRCLCCGKGL+SGGSTVCA+I F++SW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK + VLP+ NLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEG+ M P N PQ EQH  R  EF 
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  KV
        KV
Subjt:  KV

A0A1S3CSX8 uncharacterized protein LOC1035045961.7e-8279.7Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA +L V+ IIASL+LIAFV AIGAEMRRSTA+  PD+YDE+T+CVYDSDAST YGLVAFGL+LIS T+LM VTRCLCCGKGLKSGGSTVCA+I F++SW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LP+ NLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEG+ M P N PQ E H  R  EFE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  KV
        KV
Subjt:  KV

A0A6J1DHA0 uncharacterized protein LOC1110200445.1e-8483.74Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SLSV+AIIASL+LIAFVLAIGAEMRRSTA   PDEYDE+THCVYDSDASTAYGL AFGL+LISHTLLMAVTRCLCCGKGLKSGGSTVCAI+LFVVSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNE-GVAMVPPNFPQHEQHNSRTGEF
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC  LR GVFAAAAA+TFLSLVFSILYY  HSRADTGGWQKHQNE GV MV  +  QHEQH+ RT  F
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNE-GVAMVPPNFPQHEQHNSRTGEF

Query:  EKV
        EKV
Subjt:  EKV

A0A6J1FLQ5 uncharacterized protein LOC1114451083.8e-8784.58Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

A0A6J1JRH3 uncharacterized protein LOC1114891481.4e-8684.08Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW
        MA SL+ VAIIASL+LIAFV AIGAEMRRSTA  +PDEYDE+THCVYDSDAST YGLVAFGL+LISHT+LM VTRCLCCGKGLKSGGSTVCAIILF+VSW
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSW

Query:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE
         LFLGAESLLLAGS+RNAYHTK +  LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGW+KHQNEGV MVPP+F  HEQH+  TG FE
Subjt:  LLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFE

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)3.0e-2841.72Show/hide
Query:  SLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLF
        S  V  ++ +L L+AF  +I AE RRS  K+  D    +T CVYDSD +T YG+ AF  +L S +LLM+VT+C+C G+ L  G     +II F+ SW+ F
Subjt:  SLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLF

Query:  LGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRA
        L AE+ ++AG+ +NAYHTK    L  +  SCA LR+G+F A A     ++V ++ YY   +++
Subjt:  LGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRA

AT1G52910.1 Protein of unknown function (DUF1218)1.1e-3549.11Show/hide
Query:  VVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGA
        V+ I+  L LIA  LAI AE RRS  K  PD   E  HC Y SD +T+YG  AF L+ IS  ++M  +RC CCGK LK GGS  C I+LF++ W+ FL A
Subjt:  VVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGA

Query:  ESLLLAGSLRNAYHTKIKRVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKH
        E  LLAGS+RNAYHT  +R+  I+N  SC ++R+GVFAA A+    + + S  YY  +SRA  G    H
Subjt:  ESLLLAGSLRNAYHTKIKRVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKH

AT1G68220.1 Protein of unknown function (DUF1218)1.2e-5355.61Show/hide
Query:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGS-TVCAIILFVVS
        MAVS+S++ ++ +L+L+AFV A GAE RRSTA   PD+YDE T C Y ++AST YG+ AFGL+L+S  ++  VT+CLC GKGL +G S TV AI+ FVVS
Subjt:  MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGS-TVCAIILFVVS

Query:  WLLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEG--VAMVPPNFPQHEQHNSRTG
        W+ FLGAE+ LL GS RNAYHTK + +   K LSCA+L  GVFAA AA T +SL+ +ILYY  HS+ADTGGW+KHQN+G  + M  P+    +Q+     
Subjt:  WLLFLGAESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEG--VAMVPPNFPQHEQHNSRTG

Query:  EFEKV
        EF KV
Subjt:  EFEKV

AT3G15480.1 Protein of unknown function (DUF1218)4.4e-3549.69Show/hide
Query:  VVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGA
        VV I+  L LIA  LAI AE RRS  K E D   +  +CVY +D +T+YG  AF L+ +S  L+MA +RC CCGK L  GGS  CAIILF++ W+ FL A
Subjt:  VVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGA

Query:  ESLLLAGSLRNAYHTKIKRVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRA
        E  LLA S+RNAYHT+ +++  +++  SC ++R+GVFAA AA T  + + S  YY  +SRA
Subjt:  ESLLLAGSLRNAYHTKIKRVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRA

AT4G27435.1 Protein of unknown function (DUF1218)3.1e-3349.38Show/hide
Query:  VVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGA
        V AI+    LIAF LA+ AE RRSTA+   D   +  +CVYDSD +T YG+ AF   + S  L+M V+RC CCGK LK GGS   A+ILF+VSW+ FL A
Subjt:  VVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGA

Query:  ESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRA
        E  LLAGS+ NAYHTK + +       C  LR+GVFAA A+  F + + S  YY  +  A
Subjt:  ESLLLAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTCTTTGTCCGTCGTAGCCATAATTGCTTCTCTCTATCTCATAGCCTTCGTCCTCGCTATCGGAGCTGAAATGCGCCGGAGCACGGCAAAGGCAGAGCCAGA
TGAGTACGATGAGTCTACTCACTGCGTCTACGATTCCGATGCGTCGACGGCGTACGGTCTTGTGGCGTTTGGTTTGGTGTTGATTAGCCACACACTACTTATGGCTGTCA
CCAGATGTCTCTGCTGCGGCAAAGGCTTGAAAAGTGGAGGATCCACTGTTTGTGCGATCATCCTCTTCGTCGTTTCTTGGCTTCTCTTCTTGGGAGCTGAATCTTTGTTG
CTGGCTGGGTCTCTGAGGAACGCCTATCACACCAAGATCAAACGAGTATTGCCTATCAAGAACTTGTCGTGTGCTATGCTTCGCCGTGGGGTGTTCGCCGCTGCAGCAGC
CTTGACGTTTCTGTCGTTGGTGTTTTCGATCCTTTACTACTCGATGCATTCGAGAGCCGATACCGGAGGCTGGCAGAAGCACCAGAATGAAGGTGTTGCCATGGTGCCTC
CTAATTTTCCACAGCACGAGCAGCATAATAGTCGAACTGGTGAATTTGAGAAGGTTTAG
mRNA sequenceShow/hide mRNA sequence
TAAAAACAAAAAGCATCCCAAAGCAAAGATATTATTTCTTGAAAAACTAAAGAAAGGACTGGCTGCCGCAGATCGGAGAGTTCACCAGAGTTGGATTAGTTTCATTACAG
TACAACAATGGCGGTTTCTTTGTCCGTCGTAGCCATAATTGCTTCTCTCTATCTCATAGCCTTCGTCCTCGCTATCGGAGCTGAAATGCGCCGGAGCACGGCAAAGGCAG
AGCCAGATGAGTACGATGAGTCTACTCACTGCGTCTACGATTCCGATGCGTCGACGGCGTACGGTCTTGTGGCGTTTGGTTTGGTGTTGATTAGCCACACACTACTTATG
GCTGTCACCAGATGTCTCTGCTGCGGCAAAGGCTTGAAAAGTGGAGGATCCACTGTTTGTGCGATCATCCTCTTCGTCGTTTCTTGGCTTCTCTTCTTGGGAGCTGAATC
TTTGTTGCTGGCTGGGTCTCTGAGGAACGCCTATCACACCAAGATCAAACGAGTATTGCCTATCAAGAACTTGTCGTGTGCTATGCTTCGCCGTGGGGTGTTCGCCGCTG
CAGCAGCCTTGACGTTTCTGTCGTTGGTGTTTTCGATCCTTTACTACTCGATGCATTCGAGAGCCGATACCGGAGGCTGGCAGAAGCACCAGAATGAAGGTGTTGCCATG
GTGCCTCCTAATTTTCCACAGCACGAGCAGCATAATAGTCGAACTGGTGAATTTGAGAAGGTTTAGTTATGGTAGGACTGAATTATTAGAAGTTTCATATTCAATGTTCA
TTAGACTTTATGCCCAAAAAGACAGGTACAACAAAATATCAACTAGTTCTTGCTCAACTACGAATATGTAAAGTGGTATCATATGTAATCTAGGTGTTGAAAATATAGCT
GGTATATCATATATGTTTTGATTTTTAACTGGAACTCCAAGATTCAATGCTAAGTGAATTTTGGA
Protein sequenceShow/hide protein sequence
MAVSLSVVAIIASLYLIAFVLAIGAEMRRSTAKAEPDEYDESTHCVYDSDASTAYGLVAFGLVLISHTLLMAVTRCLCCGKGLKSGGSTVCAIILFVVSWLLFLGAESLL
LAGSLRNAYHTKIKRVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYSMHSRADTGGWQKHQNEGVAMVPPNFPQHEQHNSRTGEFEKV