; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1569 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1569
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNL-type domain-containing protein
Genome locationMC09:21410086..21421047
RNA-Seq ExpressionMC09g1569
SyntenyMC09g1569
Gene Ontology termsGO:0006457 - protein folding (biological process)
GO:0030150 - protein import into mitochondrial matrix (biological process)
GO:0050821 - protein stabilization (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0051087 - chaperone binding (molecular function)
InterPro domainsIPR007853 - Zinc finger, DNL-type
IPR024158 - Mitochondrial import protein TIM15


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601335.1 DNL-type zinc finger protein, partial [Cucurbita argyrosperma subsp. sororia]1.03e-9180.59Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        M AAKTL+ SSPALLQ S NSSQ  P+PSI+AFRPI+SSKANPSND FIRSR+F TAPV RE R KV  VSRLVDGNSG+D++SA+RNSD GAAIDIK P
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLV FTCNQC  RTKR+INRLAYERGLVFVQCAGC KYHKLVDNLGLIVEYDF+E+D+DV+SNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

KAG7032119.1 DNL-type zinc finger protein [Cucurbita argyrosperma subsp. argyrosperma]1.62e-8879.41Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        M AAKTL  SSPALLQ S N SQ  P+PSI AFRPI+SSKANPSN  FIRSR F TAPV RE R KV  VSRLVDGNSG+D++SA+RNSD GAAIDIK P
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLV FTCNQC  RTKR+INRLAYERGLVFVQCAGC KYHKLVDNLGLIVEYDF+E+D+DV+SNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

XP_004135125.1 uncharacterized protein C24H6.02c isoform X3 [Cucumis sativus]7.31e-8876.16Show/hide
Query:  AAMTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIK
        A+M AA T+  SSP LLQHS N SQ  P+PSI +F+PI+SSKANPSN VFIRSRNF TAPV RERRYKV  VS LVDG +G+D++ ++RNSD GAAIDIK
Subjt:  AAMTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIK

Query:  FPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
         PRRSL+V FTCNQC+ RTKR+INRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDF+EED+D+DS+SDQV
Subjt:  FPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

XP_022149076.1 uncharacterized protein LOC111017577 [Momordica charantia]1.51e-118100Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

XP_023511593.1 DNL-type zinc finger protein isoform X1 [Cucurbita pepo subsp. pepo]3.99e-8979.41Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        M AAKTL  SSPALLQ S N SQ  P+PSI+AFRPI+SSKANPSN  FIRSR+F TAPV RE R KV  VSRLVDGNSG D++SA+RNSD GAAIDIK P
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLV FTCNQC  RTKR+INRLAYERGLVFVQCAGC KYHKLVDNLGLIVEYDF+E+D+DV+SNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

TrEMBL top hitse value%identityAlignment
A0A0A0KR03 DNL-type domain-containing protein3.54e-8876.16Show/hide
Query:  AAMTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIK
        A+M AA T+  SSP LLQHS N SQ  P+PSI +F+PI+SSKANPSN VFIRSRNF TAPV RERRYKV  VS LVDG +G+D++ ++RNSD GAAIDIK
Subjt:  AAMTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIK

Query:  FPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
         PRRSL+V FTCNQC+ RTKR+INRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDF+EED+D+DS+SDQV
Subjt:  FPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

A0A5A7SVJ1 Mitochondrial protein import protein ZIM17 isoform X21.85e-8776.47Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        M AAKT+  SSP LL H  N SQ  P+PSI +F+PI+SSKANPSN VFIRSRNF TAPV R RRYKV  VS LVDG +G+D++S++RNSD GAAIDIK P
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSL+V FTCNQC+ RTKR+INRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDF+EED D+DSNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

A0A6J1D799 uncharacterized protein LOC1110175777.33e-119100Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

A0A6J1GZ90 uncharacterized protein LOC111458229 isoform X16.45e-8878.82Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        M AAKT   SSPALLQ S N SQ  P+PSI AFRPI+SSKANPSN  FIRSR F TAPV RE R KV  VSRLVDGNSG+D++SA+RNSD GAAIDIK P
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLV FTCNQC  RTKR+INRLAYERGLVFVQCAGC KYHKLVDNLGLIVEYDF+E+D+DV+SNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

A0A6J1J9C7 DNL-type zinc finger protein isoform X11.30e-8777.65Show/hide
Query:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP
        M AAKTL  SSPALLQ S N SQ  P+PSI+AFRPI+SSKANPSN  FI SR+F TAPV RE R KV  VSRLVDG+SG+D++SA+RNSD GAAIDIK P
Subjt:  MTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFP

Query:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV
        RRSLLV FTCNQC  RTKR+INRLAYERGLVFVQCAGC KYHKLVDNLGLIVEYDF+++D+DV+SNSDQV
Subjt:  RRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV

SwissProt top hitse value%identityAlignment
A1L1P7 DNL-type zinc finger protein2.3e-0543.9Show/hide
Query:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNL
        +TC  C+ R+ + I++LAY +G+V V C GC+ +H + DNL
Subjt:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNL

Q0IH40 DNL-type zinc finger protein6.0e-0642.86Show/hide
Query:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLG
        +TC  C  R+ + I+++AY +G+V V+C GC+ +H + DNLG
Subjt:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLG

Q5SXM8 DNL-type zinc finger protein2.7e-0647.62Show/hide
Query:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLG
        +TC  C  R+ + I++LAY +G+V V C GCQ +H + DNLG
Subjt:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLG

Q9D113 DNL-type zinc finger protein1.3e-0546.34Show/hide
Query:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNL
        +TC  C  R+ + I++LAY +G+V V C GCQ +H + DNL
Subjt:  FTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNL

Arabidopsis top hitse value%identityAlignment
AT1G68730.1 Zim17-type zinc finger protein1.2e-2857.8Show/hide
Query:  PVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQ
        P  R R + VF    L D +  N ++ +  ++++ A+IDIK PRRSL VEFTCN C  RTKR+INR AYE+GLVFVQCAGC K+HKLVDNLGLIVEYDF+
Subjt:  PVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQ

Query:  EEDMDVDSN
        E   D+ ++
Subjt:  EEDMDVDSN

AT3G54826.1 Zim17-type zinc finger protein5.6e-0738Show/hide
Query:  PRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLG
        PR   ++ FTC  C+ R+ ++ +R +YE G+V V+C GC   H + D  G
Subjt:  PRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLG

AT5G27280.1 Zim17-type zinc finger protein6.8e-1352.24Show/hide
Query:  KFPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVD
        K PRR + V FTCN C  RT R IN  AY  G VFVQC GC  +HKLVDNL L  E  +       D
Subjt:  KFPRRSLLVEFTCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCAGCCATGACAGCGGCAAAAACTCTGTTCGTTTCCTCTCCCGCGCTTCTTCAGCACTCCTTGAACTCTTCTCAGATTTGGCCGTACCCTTCTATAAGGGCATTCAGACC
TATTATCTCTTCGAAAGCAAACCCTAGCAATGATGTCTTCATTAGGAGTAGGAACTTCCATACAGCACCAGTTATTCGGGAACGCAGATACAAGGTGTTTGCGGTTTCTA
GATTGGTGGATGGTAATTCTGGCAACGACAATGATTCTGCACGCAGAAATTCGGATATGGGGGCAGCCATTGACATAAAGTTTCCAAGAAGAAGTTTGCTGGTGGAATTT
ACATGTAACCAATGTAATGGGAGGACAAAGAGGATTATAAATCGATTAGCTTACGAACGGGGGCTTGTTTTTGTTCAATGTGCAGGTTGTCAAAAGTATCACAAACTGGT
CGACAATCTTGGCCTCATAGTAGAGTATGACTTCCAGGAGGAAGACATGGATGTGGACTCGAATTCAGATCAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
GCAGCCATGACAGCGGCAAAAACTCTGTTCGTTTCCTCTCCCGCGCTTCTTCAGCACTCCTTGAACTCTTCTCAGATTTGGCCGTACCCTTCTATAAGGGCATTCAGACC
TATTATCTCTTCGAAAGCAAACCCTAGCAATGATGTCTTCATTAGGAGTAGGAACTTCCATACAGCACCAGTTATTCGGGAACGCAGATACAAGGTGTTTGCGGTTTCTA
GATTGGTGGATGGTAATTCTGGCAACGACAATGATTCTGCACGCAGAAATTCGGATATGGGGGCAGCCATTGACATAAAGTTTCCAAGAAGAAGTTTGCTGGTGGAATTT
ACATGTAACCAATGTAATGGGAGGACAAAGAGGATTATAAATCGATTAGCTTACGAACGGGGGCTTGTTTTTGTTCAATGTGCAGGTTGTCAAAAGTATCACAAACTGGT
CGACAATCTTGGCCTCATAGTAGAGTATGACTTCCAGGAGGAAGACATGGATGTGGACTCGAATTCAGATCAAGTTTGATTATGTTCTAACGATATCATTTCAATGTTTC
GACTTTCGTTTTGGATCGTTTCCATGACGTTTGTTCCACAGATGATGATATCTTGAAATTTCATGCCATTATTTGCCATTCCATTGTTTCCTGGAGGGACAGCAATATCA
TAATTTGACAGGCAAGAATGATTGTTGCTCTAGCATTCTGATCTCAAGGCGTGGAGTAAAAGTAGTTAGCCTTGTTGATGTATAAGTTTTACGGTCATCTGAAAGGAGAT
TGTTTTTCATTTAAAAAAAAAAAATTGAGGTTGCAATTGACAAAGGAAAAATCAATTTACTACTCTGCTTGCTTATTCTTAGGTACAAAACGACGTATGAATTCTATCAT
TTCCATTACAAACAGAACAATATGTGGAAAATATGGTGCATCTTGGTAAGAAATTTTGATGGGTGATGTGCAATGGATAGAGATGGAGTGTTTTCTTTCTTTTGCTTCTT
CCTTCTCCTTAGCTGGAAGAATTGTTGCATTACATCTGCACATTCTGATGCCAATACACCTCGTCGAATCGTCATCTTCGGGTGGAATGGATGGACAGGTGCTGGCTTCT
CAGGCCGTTCTGAGACGTTCTCCTCCCCACCATCAGGAAAAAGTCTGTGGAGGGTCAAATAAATCAGAGGAAACGAGAGTGATTGTTTATAGATTGAAATGTCAAATTTC
TAAGTCAAGTCAAACAGAACTCTTTCACGAAACAAAGTAGAAACAGAAATCAAACTGCTATCTGCCTAATCACATTCCTTATACATGTGACTATATATCAAATCCCATTG
AATGAGGAATGGCCAGAACAAAGCTAAGAGTTATTATAATTAATGGATGCTTTGATGGTAAATTGCTTACCTGATCCAGCTGCCATCAGCTCCAAGAAGCTTATTTGGTG
CTCCCCACACAAGATTTTCAATTCGAGCTTGTAGTATCGCTCCAGCGCACATAGGGCACGGCTCAAGTGTCACGTAGAGGGTGGTCTCCTGAGAATATCAACAAATGATC
CAATACCAATCATATGGTTAATAATAATAATATAAAGAGGATGCATAGCATATCTTCCTCTTCGTACAACATAAATGTTCAATGTAAGAACCGATTTAGAATTGATAAAT
GCATTGTGCTTGATTACGAAGACCACATGAATAAATAGGCGGTAAAAGTAGTAACAGAAAATTACAGCAAGCCTCCACGTCTTTAATAGCTTTGAAGCCTCCCGAATGCA
GATCATTTCGGCATGGGCTGTGGAATCTCGAAGCTCTTCCACCCTAAAAATTGAACTCATCAACATCCAAAAGAAAGCAATCAACTGATCAAGATAGGCATTCTGCCAAA
GTACTTACAGATTGCAGCCACGAGCAATAATTTTTCCATGTTTGACCAATACTGCTCCAACAGGCACCTCCCAAGTATCAGCCGCCTTCTTGGCTTCAGCAAGTGCTTCC
CTCATAAACATTTCATCAATTTTTCGCTGCTCACTTTCGAGTATGTATGCTTCTTCCCATTCATCAAACCGATCTCTTAGAACTTGCTTGTTCCTTTGAAGCTTCCTTTG
TTTCACTTCCCCATCCTTGGTTCCAGTTGTTGATATCTCTGGTAATGTTGCATCAGAAGAACGTCCCAACTGCTCCGCAGAGCTGCTTGCTAAGGCATCATCTTTACCAC
TTCGAGAAATTTCTTCAATAATTGGAGATCTTCTCATGCCTTGGGGAGGCAATGGTAATGATGACTGTTCTATATCCAAACCAGAAGAAATAACGTCAACCTTCCCACCA
TCTGTCAAGATAACTTCTCCAGAAACCAACAAATTCCCACCTGACGGCTTCCTCTCCATTATATTTGGGGAAGATGGGGTATCTACTTCATAATATATATATTTTTCTTT
CCTATCACCAGATGAATCTGGACTCCGGGCAGAAGGATTTGGTTCTTCCAGCTGAACTGAAGAAGAAGTGGTTCTTCCCATTTTTGTGTTATCAGTCTCCTCATGTTCAC
GGCCAGAAAACCATGTCTCATTACTAACAGACTCATTTGGTGAATTCCTTCCACCTGATCTTAAAGCTGAATCAGAGGTTTCAGCACGTGAACTCCAACGGAGATGAACT
ATGTCTGAAATGACGTTCCACAAAGACCTACCACTTCTCTTGACAATTGCATTTTCACTATGTGCACTAACCTCAGGATCATCAGTTTTGGGAGGCTGCTCAGTAGTTGA
GTCCATTACATGCCACATCTCATCAGAAGGACCCTTGGTTCCAGAGCTCCGTGATGAGAGCCTTGAGTCATGGTCCTTCATTTGAGAATCTACAGAAATCTTCTGCCCAT
AATTCTCTTCCTGGAGTAAATCTACTTCAGAGATATCCTCTTCCTTCTGGGTTTCAGAAATTAAGAGTTCATTCCTGGACTTCTCAATGAACTCTCCAACAAACTGTGCA
GACGATCTCTCTAGACGGTCAGCTGAACCTAGAGAATCATCAGGAGTAATAAGGTATACAGGCTCCCCAGTACTCTCATCTGCCCCACTTTCTCTATAAGATTTAGAGTG
CAAAGCTGGACTTCCCCGTGACTGCATGTAGGGAGCACCTGAACTGCTTCCTGATGTTCTTCTGGATACTACTTGGCTTGCCATTTCACTAGTAGAATCTACTCGCAATG
GATCTCTAGCTACAAGCTGAGATGGAGGAGGCATTAACACTGCCTGTGTGCTTTTATTTTCTTCCTCATTTACACCAATCCGTTGATGACTACTTTGGGAAATTAAATTC
ATGGATCCTTGAACTTTTGATTTTTTGACTTGATTTTTAGCCTCTTTTCCCAATTCAGTATGCACCACACGGTCATTATTTTGACTATTTCTATTGGTTTCGTCGACAGT
TTTTTCCACTAATGATTTATTAGTTTCTTGCTTGAAAATTCTATTTTCAAATGTTTGTTCAGAGGATGAAGCTGATAATTTATCGGTAGCGTGAACCACTGAAATGTTAG
AGGCATCTCGGGAACCTTTTCTGGACATAACTTGCTGAGAATCCTTTTGATCTGTCGAAGACTTTATGTCTTTAGCCACACGATTTGATGCATTAACCGATACTGCATTT
TGTTCCTCATTTCGTCTTCTAATTTCAGAACTTGAAACAGAGGTGTTGGTGTTGCCCACTTTGGTTACTGATAGAACACCAAGATGAAGAAAATTTTGTCTTGAATCCTT
TTCTCCAATGGCATGATTTCTCATTTGACGATGTCTTTCTTCAATCTCTTGATTAGAAGTTACAAGCACATCTGTACTTTCACCTGCATTTATCACTGCATCAGACTGAC
TTGTCTGATGAGAGATCCTTTCTCTCTCAATTTTCCTATTTTCTGAAACTCCCAAAAGTTGTTGAAATGAAGAACTTCGCCTTGAAATTTTTTCTTTATGGACATGACCA
ACAGTTTCATGATTGTCGCTGCTTGCTTTTTTTATGAGATTTGCATCTAATTCTAGGTTTTCATTCCCACTGGTAAGTCTTTTCTGAGAACTAGAAATTGCTTCTGCATT
ACTGCTGTTTATTTGTGATACTTCTGCTAGTTTTTTCCCTTCATTTATAAACACACCACCAACTTTCTTACCCGATACATCATATTGTTTTGATTGCTCATTCAAAGTCA
CTGCCATTTTCAATTCCTCTTCCTTGTCAACCAACTTCTTCTTGGAACTTGAAGTACTTTCATAGCCACTTTTCCTGGTCTTTGACAATCTTGAGTTCATCTCAGAGCTT
TCATTTAAGGAGGATGTTACTCTTGTTGATACTTCACTAATTTTCTTCTCTGAGTTCTTCCTCAAATGCCAATCAGCACAATTCCCTATCGTAATATCACCTAGCACTGA
TTCCTCTTCTCGCTCCTTCAAATTGTCTCCTTGTCTCTTGAACTCTTCCTTGATCTGTCCATCTAATCTTTCTCCCACATCACTCATCGTGTCATATCTGTACCCACTCG
ACGATTCCTCCACAAATTGCACATTCTTGTCTTCAACTTCAGCATCGTTCTCAATATCCCCAGACGAGGAAAGTGAATAATAAGATGAACAAGTAGACCCTTCTTTTCTT
AAACACTGACTGTCACCTCTTGAATGAACTAACTGATCTCCATTCCGTCTATATCCATCTTTGCTCAACTCAACTGCAATTGATCCAACTCTATTACTTTGTTTACTATT
TAAAGAACCATACTTATTGATTTTCCTCTCTTCTCCATCACCTCTACTATCGCTCTCTCTTTTTCTGTCTACTCTTATCTTTTCCAATGAACTTAACTTTTTCTCCTGAA
CACCTCTTCTCCCCCTTCTCTCCAGTTCTACTCTTCTCGACGAACTTATACTTCTCTCCTTAGCCCCGGACTTTCCTCGGCTCCCCAGTTCCACCCCCCTAGATGACTTA
ATCTTTTCCTGGCTACTAAAACCCTCCCCTATCAAGCTGATCATTGCCTCAGCAATGTCTTCTTCATCATATCCATCCGACCTATCAAATCCACAATCACTCTCAGAAAT
CATACAACAGCTTCTGTATCTCCTTCCCTCACTACAATGACAACTCCTACCATCACTAATGGAGTAGGGTACCTCATAACAACCCTGAACACGCCCATACTCCGGAAGCG
TGTAACAGAACCGGTTTCGGCCACCCAACATCAACCTTCTAGAAATCGACCATTGAAGAAGGGTGGATTGCCTCGGCCCATAGAAAATGCTAGGGCCTAAAGCCACTCTG
TGGATTGGAAATGCATAACAAACGCAACAAGAACAACACGATGATGATGAAAGATTCAATGGGTGGTTTCCATTACATCTTTCATTTAAGAAGATGGGACGCTCATTAAA
GCAATGGGAAATAGGCCCTTTGCTCCGGATGGAGTATACAGTGGAGCTGACATAATTGTTGTACATAATTTTTGTAATTCTACTCACCTGAAGAGAAGGCTCTCGAGGAG
TTGAGGGATCGCGAAAACTGAAGCTCGGAGAAGAACAACAACCATCTGGGGCCGCGTGGATTTTGATTTTTGAGTAGAGTAACTCACCTTCTGCAACAATGGAAGTAAAA
AACAATTGACAGCTCGATGCCGCAGCTTTGAATCGGTGGGTTGACTCGGAGAGAGAAGCTACTCCACGCTATTGTCTCCATAACTTCGGCCAGATTGCTGGGCTTGTAAT
GAGTTATATTTCAGGCTTGGCCCAAATGACATTTAGAAATGGGCTTTTATTAACACAGCCTTGCCCAAATAACATTAGGGATGGGCTATTATTAACGTGGCCCGGCCCAA
ATACACATTTTCCTTAAAATACAAAAAAAT
Protein sequenceShow/hide protein sequence
AAMTAAKTLFVSSPALLQHSLNSSQIWPYPSIRAFRPIISSKANPSNDVFIRSRNFHTAPVIRERRYKVFAVSRLVDGNSGNDNDSARRNSDMGAAIDIKFPRRSLLVEF
TCNQCNGRTKRIINRLAYERGLVFVQCAGCQKYHKLVDNLGLIVEYDFQEEDMDVDSNSDQV