; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017235 (gene) of Snake gourd v1 genome

Gene IDTan0017235
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationLG11:4064084..4065824
RNA-Seq ExpressionTan0017235
SyntenyTan0017235
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033185.1 hypothetical protein SDJN02_07239, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-9975.4Show/hide
Query:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL
        ML+SITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSK+QQK RKDQ+L SVSLR FSDIPL E  GKASFD YLEDKPR+VKATFPGKS QL
Subjt:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL

Query:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV
        NQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNITKV++L+MTNWE+NGI R+YRPSS N+CSRG IYSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV

Query:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        PD+L+FVP DV QSIVE  L+ M+ED+K K++DRLVEDYC FRKE++K
Subjt:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]3.6e-10075.81Show/hide
Query:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL
        ML+SITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSK+QQK RKDQ+L SVSLR FSDIPL E  GKASFD YLEDKPR+VKATFPGKS QL
Subjt:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL

Query:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV
        NQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNITKV++L+MTNWE+NGIHR+YRPSS N+CSRG IYSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV

Query:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        PD+L+FVP DV QSIVE  L+ M+ED+K K++DRLVEDYC FRKE++K
Subjt:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

XP_022990863.1 uncharacterized protein LOC111487627 [Cucurbita maxima]9.5e-9371.89Show/hide
Query:  MMLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQ
        MMLYSITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSKA           SVSLR FSDIPL E  GKASFD YLEDKPR+VKA FPGKS Q
Subjt:  MMLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQ

Query:  LNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFC
        LNQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNIT+V++L+MTNWE+NGI R+Y PSS N+CSRG IYSEK G RSRLKFQL +NLSF 
Subjt:  LNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFC

Query:  VPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        +PD+L+F+P DV QSIVE  L+AM+ED+K K++DRLVEDYC FRKE++K
Subjt:  VPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]7.3e-10176.21Show/hide
Query:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL
        MLYSITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSK+QQK RKDQ+L SVSLR FSDIPL E  GKASFD YLEDKPR+VKATFPGKS QL
Subjt:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL

Query:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV
        NQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNITKV++L+MTNWE+NGIHR+YRPSS N+CSRG IYS+K G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV

Query:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        PD+L+FVP DV QSIVE  L+AM+ED+K K++DRLVEDYC FRKE++K
Subjt:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

XP_023517005.1 uncharacterized protein LOC111780797 isoform X2 [Cucurbita pepo subsp. pepo]1.1e-9675Show/hide
Query:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL
        MLYSITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSK+QQK RKDQ+L SVSLR FSDIPL E  GKASFD YLEDKPR+VKATFPGKS QL
Subjt:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL

Query:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV
        NQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNITK      TNWE+NGIHR+YRPSS N+CSRG IYS+K G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV

Query:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        PD+L+FVP DV QSIVE  L+AM+ED+K K++DRLVEDYC FRKE++K
Subjt:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein8.7e-8463.49Show/hide
Query:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL
        ML S T+ +CLH       QRE     S LKKQK+ +WKCFA+    QK+    +L SVS   FSD+PL ESPGKASFD YLEDKPR+VKATFPGK++QL
Subjt:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL

Query:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTS-GEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFC
        NQEEWRIETPKI+LLFLK+ PTIDMKIISKT+ GE YP  VPH I K++  +MTNWEINGIH+ YRPSS N+CS G IY +K+G RSRLKFQL+++LSF 
Subjt:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTS-GEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFC

Query:  VPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEKQII
        VPD+L FVPNDV++ I+E  ++AM+EDLKHK++ +LVEDY +FR E+EK+ I
Subjt:  VPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEKQII

A0A1S4E357 uncharacterized protein LOC1034987443.9e-6866.15Show/hide
Query:  KKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQLNQEEWRIETPKIELLFLKVWPTIDM
        ++S LKKQKI  W+CFA+ ++Q+ +  D +L SVS   FSD+ L ESPGKASFD YLEDKPR++KATFPGK +QLNQEEWRIETPKI+LLFLK+WPT+DM
Subjt:  KKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQLNQEEWRIETPKIELLFLKVWPTIDM

Query:  KIISKTS-GEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQSIV
        KIISKT+ GE YP DVP+ I KV+  EMTNWEINGI+++YRPSS N+CS G IY EK+G RS LKF+L+++LSF VPD+L FVPNDV++ ++
Subjt:  KIISKTS-GEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQSIV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X12.9e-7964.54Show/hide
Query:  MMLYSITQQLCLHVTNDVLL--QRESLIKKSHLKKQK-ISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGK
        MM    TQ L  HV N       R +++KK   KKQK I S K  AVSK QQ   + Q+L S S+ FFSDIPL+ESPGKASFD YLEDKPR++KATFPGK
Subjt:  MMLYSITQQLCLHVTNDVLL--QRESLIKKSHLKKQK-ISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGK

Query:  SRQLNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNL
        S+QLNQEEWRIETPK+ELL LK+WP IDMKIISKTSG+ YP  VPH+ITK++ LEMTNWEINGIHRNYRPSS N+ S+G IYSEK G  SRLKFQ  MN 
Subjt:  SRQLNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNL

Query:  SFCVPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEE
        +F VP +LSF+P D+ +SI E  L+ M+EDL +K+ID+LVEDY +FRKE++
Subjt:  SFCVPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEE

A0A6J1HEU2 uncharacterized protein LOC1114623971.8e-10075.81Show/hide
Query:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL
        ML+SITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSK+QQK RKDQ+L SVSLR FSDIPL E  GKASFD YLEDKPR+VKATFPGKS QL
Subjt:  MLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQL

Query:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV
        NQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNITKV++L+MTNWE+NGIHR+YRPSS N+CSRG IYSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCV

Query:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        PD+L+FVP DV QSIVE  L+ M+ED+K K++DRLVEDYC FRKE++K
Subjt:  PDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

A0A6J1JUJ3 uncharacterized protein LOC1114876274.6e-9371.89Show/hide
Query:  MMLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQ
        MMLYSITQQ  L V N V+LQ      KSHLK QK+S WKCFAVSKA           SVSLR FSDIPL E  GKASFD YLEDKPR+VKA FPGKS Q
Subjt:  MMLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQ

Query:  LNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFC
        LNQEEWRIETPKIE LFLK+WPTID+KIISKTSGEGYPSDVPHNIT+V++L+MTNWE+NGI R+Y PSS N+CSRG IYSEK G RSRLKFQL +NLSF 
Subjt:  LNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFC

Query:  VPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
        +PD+L+F+P DV QSIVE  L+AM+ED+K K++DRLVEDYC FRKE++K
Subjt:  VPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)2.9e-3941.88Show/hide
Query:  SLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSR--QLNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEI
        S +  +DI L ESP +A FD YLEDK R+ +A FP K +  +LN+EEWRI+   I+  FL   P + M+I  K++G+ YPSDVP +ITKV+EL MT WE+
Subjt:  SLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSR--QLNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEI

Query:  NGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK
         G+ R   P+   +  +G +Y ++ G  +RLK +L   +SF +P  L+ VP DV +++    L  +++++KH+ I+ LV DY +F+ E +K
Subjt:  NGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEK

AT5G39530.1 Protein of unknown function (DUF1997)4.3e-4344.27Show/hide
Query:  SLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGK--SRQLNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEI
        S R  +DIPL+ESP +A FD YLEDK R+ +A FP K  S +LN+EEWRI+   I  LFL VWP +DM++  K++G+ YP DVP +ITKV+EL M  W++
Subjt:  SLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGK--SRQLNQEEWRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEI

Query:  NGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEKQ
         G+ R   P+  ++  +G +Y ++ G  +RL+ QL MN+SF +P  L  VP DV +++    L  ++E++KHK    L+ DY  F+ E + Q
Subjt:  NGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQSIVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTAGCTCCCATGATGCTCTATTCAATTACACAACAACTATGTTTGCATGTTACGAATGATGTTCTTCTTCAAAGAGAGAGCTTGATCAAGAAATCCCATTTGAA
GAAGCAAAAAATTAGTAGCTGGAAGTGCTTTGCAGTGTCAAAAGCACAGCAGAAACTACGAAAAGACCAGAGCTTGTTTTCTGTTTCTTTGAGATTTTTCAGTGACATAC
CACTTGATGAATCTCCAGGGAAAGCTTCTTTTGATCACTACTTGGAAGATAAACCCAGAATAGTCAAAGCAACATTTCCAGGAAAAAGTAGACAGCTCAACCAGGAAGAG
TGGAGGATTGAAACGCCAAAAATCGAGTTGCTGTTTCTCAAGGTATGGCCAACAATTGATATGAAAATAATCAGCAAAACCAGTGGAGAAGGTTACCCATCAGATGTTCC
TCATAATATCACAAAAGTTGTCGAGCTTGAAATGACAAACTGGGAGATCAATGGCATCCACAGAAACTACAGGCCATCTTCAATCAATATTTGTTCTAGAGGAACTATTT
ACAGTGAAAAATTAGGGGCTAGAAGCCGCCTTAAGTTTCAACTCTTAATGAATCTAAGCTTTTGTGTCCCGGACTCTCTGAGTTTCGTTCCGAATGACGTTGTCCAGAGC
ATCGTCGAAATGGCTTTGAGGGCAATGATTGAGGACTTGAAGCATAAAAGTATAGATAGATTGGTTGAGGATTATTGTGAGTTCAGGAAGGAGGAGGAGAAGCAAATTAT
TAGAGCACAAGTAGAAGCATAG
mRNA sequenceShow/hide mRNA sequence
TTCAAAACTTTTGAGATTAAATGAAACTAGCTCCCATGATGCTCTATTCAATTACACAACAACTATGTTTGCATGTTACGAATGATGTTCTTCTTCAAAGAGAGAGCTTG
ATCAAGAAATCCCATTTGAAGAAGCAAAAAATTAGTAGCTGGAAGTGCTTTGCAGTGTCAAAAGCACAGCAGAAACTACGAAAAGACCAGAGCTTGTTTTCTGTTTCTTT
GAGATTTTTCAGTGACATACCACTTGATGAATCTCCAGGGAAAGCTTCTTTTGATCACTACTTGGAAGATAAACCCAGAATAGTCAAAGCAACATTTCCAGGAAAAAGTA
GACAGCTCAACCAGGAAGAGTGGAGGATTGAAACGCCAAAAATCGAGTTGCTGTTTCTCAAGGTATGGCCAACAATTGATATGAAAATAATCAGCAAAACCAGTGGAGAA
GGTTACCCATCAGATGTTCCTCATAATATCACAAAAGTTGTCGAGCTTGAAATGACAAACTGGGAGATCAATGGCATCCACAGAAACTACAGGCCATCTTCAATCAATAT
TTGTTCTAGAGGAACTATTTACAGTGAAAAATTAGGGGCTAGAAGCCGCCTTAAGTTTCAACTCTTAATGAATCTAAGCTTTTGTGTCCCGGACTCTCTGAGTTTCGTTC
CGAATGACGTTGTCCAGAGCATCGTCGAAATGGCTTTGAGGGCAATGATTGAGGACTTGAAGCATAAAAGTATAGATAGATTGGTTGAGGATTATTGTGAGTTCAGGAAG
GAGGAGGAGAAGCAAATTATTAGAGCACAAGTAGAAGCATAGAAGATCATCTACCACATTGGATTACAATATTTGTTGTATAACAAGTGAGGCGTGGGGATTCAAACTTT
TAATCTCTATTTAATCCCAGGGTATATATATGTTTTAATTTAAGTAATTCAACAATGCTCAAATTGCCTCATGTTTGTTATTTATTATGAAAATAAAAGCATGAGAATAT
ATATGACTTTGTCC
Protein sequenceShow/hide protein sequence
MKLAPMMLYSITQQLCLHVTNDVLLQRESLIKKSHLKKQKISSWKCFAVSKAQQKLRKDQSLFSVSLRFFSDIPLDESPGKASFDHYLEDKPRIVKATFPGKSRQLNQEE
WRIETPKIELLFLKVWPTIDMKIISKTSGEGYPSDVPHNITKVVELEMTNWEINGIHRNYRPSSINICSRGTIYSEKLGARSRLKFQLLMNLSFCVPDSLSFVPNDVVQS
IVEMALRAMIEDLKHKSIDRLVEDYCEFRKEEEKQIIRAQVEA