; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G200630 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G200630
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationCicolChr10:27601836..27606400
RNA-Seq ExpressionCcUC10G200630
SyntenyCcUC10G200630
Gene Ontology termsGO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136694.1 uncharacterized protein LOC101213732 isoform X1 [Cucumis sativus]7.6e-10374.81Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK
        M H++VAVS Q PQL+INPN +  SKCYVH KKK+    YSN++CFA+K NNSN N  Q+PPIFSL+FSSF PLSESPQASFDDYIEDEARLLRATF GK
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK

Query:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLL
        SEK+NQD+WRVEMPSFQ+L +KVSPVADVRL+C+  SST+D PIHIP +VSKFIDLQLM WE+KGL  D K  K  INV GA+YAERT+SKSVLTNNLLL
Subjt:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLL

Query:  NLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY
        NL+N A   P+DFFAQDF QPL EKGLKGMMEE M EFTENLLLDY+KYKKE Q+NEVP+NY
Subjt:  NLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY

XP_008443384.1 PREDICTED: uncharacterized protein LOC103486982 [Cucumis melo]3.2e-10173.86Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA
        M HNLVAVS Q PQLIINPN +  SKCYVH KKK+    YSN++CFA+K N      N+NN +Q+PPIFSL+FSSFHPLSESPQASFDDYIEDE RLLRA
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA

Query:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLT
        TF GKSEK++QD WRVEMP+FQ+L +KVSPVADVRL+C+SC  T+D PIHIPH+VSKFIDLQLM WE+KGL  D K  K  INV GA+YAERT+SKSVL 
Subjt:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLT

Query:  NNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV
        NNLLLNL+N A P P+DFFAQDF QPLAEKGLKGMMEE M EF ENLLLDY+KYKKEKQ +NEV
Subjt:  NNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV

XP_022934711.1 uncharacterized protein LOC111441814 isoform X2 [Cucurbita moschata]5.1e-8367.07Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLIIN     KC  HQ++  + +   FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G D KPQ F I+V G +YA RT  ESKS+L N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
         +P P      DF QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

XP_038905853.1 uncharacterized protein LOC120091799 isoform X1 [Benincasa hispida]4.0e-12083.59Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNLVAVSFQ PQLIIN N RSKCYVH KKK+YS+++CFAVKNNNSN++HQ+PPIFSL+FSSFHPLSESPQASFDDYIEDEARLLR TF GKSEK+NQD
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA
        EWR++MPSFQL   +VS VADVRLNCRS ++ QDYPIHIPHHVSKFIDLQLMRWE+KGLGT+ KPQ+FTINV GALYAERTESKS+LTNN +LNLHNFAA
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA

Query:  PTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG
        PTP DFFAQDF QP AEKGLKGMMEETMNEFTE LLLDYSKYKKEKQ+NEV AN G
Subjt:  PTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG

XP_038905855.1 uncharacterized protein LOC120091799 isoform X2 [Benincasa hispida]2.2e-8667.58Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNLVAVSFQ PQLIIN N RSKCYVH KKK+YS+++CFAVKNNNSN++HQ+PPIFSL+FSSFHPLSESPQASFDDYIEDEARLLR TF GKSEK+NQ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA
                                                           MRWE+KGLGT+ KPQ+FTINV GALYAERTESKS+LTNN +LNLHNFAA
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA

Query:  PTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG
        PTP DFFAQDF QP AEKGLKGMMEETMNEFTE LLLDYSKYKKEKQ+NEV AN G
Subjt:  PTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG

TrEMBL top hitse value%identityAlignment
A0A0A0LC26 Uncharacterized protein3.7e-10374.81Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK
        M H++VAVS Q PQL+INPN +  SKCYVH KKK+    YSN++CFA+K NNSN N  Q+PPIFSL+FSSF PLSESPQASFDDYIEDEARLLRATF GK
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK

Query:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLL
        SEK+NQD+WRVEMPSFQ+L +KVSPVADVRL+C+  SST+D PIHIP +VSKFIDLQLM WE+KGL  D K  K  INV GA+YAERT+SKSVLTNNLLL
Subjt:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLL

Query:  NLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY
        NL+N A   P+DFFAQDF QPL EKGLKGMMEE M EFTENLLLDY+KYKKE Q+NEVP+NY
Subjt:  NLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY

A0A1S3B8N8 uncharacterized protein LOC1034869821.5e-10173.86Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA
        M HNLVAVS Q PQLIINPN +  SKCYVH KKK+    YSN++CFA+K N      N+NN +Q+PPIFSL+FSSFHPLSESPQASFDDYIEDE RLLRA
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNYMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA

Query:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLT
        TF GKSEK++QD WRVEMP+FQ+L +KVSPVADVRL+C+SC  T+D PIHIPH+VSKFIDLQLM WE+KGL  D K  K  INV GA+YAERT+SKSVL 
Subjt:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLT

Query:  NNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV
        NNLLLNL+N A P P+DFFAQDF QPLAEKGLKGMMEE M EF ENLLLDY+KYKKEKQ +NEV
Subjt:  NNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV

A0A6J1F2K4 uncharacterized protein LOC111441814 isoform X22.5e-8367.07Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLIIN     KC  HQ++  + +   FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G D KPQ F I+V G +YA RT  ESKS+L N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
         +P P      DF QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

A0A6J1F3D2 uncharacterized protein LOC111441814 isoform X15.7e-8061.76Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESP-----------------------QASFDD
        M HNL AVSF FPQLIIN     KC  HQ++  + +   FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP                       QASFD+
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESP-----------------------QASFDD

Query:  YIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALY
        YI DE RLLRATF GKSEKLN+ EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G D KPQ F I+V G +Y
Subjt:  YIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALY

Query:  AERT--ESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
        A RT  ESKS+L N+L+L+LH+F +P P      DF QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AERT--ESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

A0A6J1J0I3 uncharacterized protein LOC1114823523.2e-8366.4Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLII+     KC  HQK+ ++ +   FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SP+ DVRL+CRSC+  +DYPIHIP HVSKF+DLQ+MRWEV+G+G D K Q F I+V GA YA RT  ESKSVL N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEV
         +  P      DF QP AEKGLKGMM+E+M +FT+NL+LDY+KYKKEKQ   V
Subjt:  AAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)5.3e-3033.85Show/hide
Query:  FSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSE--KLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRW
        +S + S+   L ESPQA FD+Y+ED++R+  A F  K +  +LN++EWR++M   +   +   PV  +R+ C+  S+ QDYP  +P H++K ++L + +W
Subjt:  FSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSE--KLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRW

Query:  EVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQEN
        E++GL   ++P  FT+ V GALY +R    + L   L   + +F  P+ L    +D  + +A   L G+++   +   E+L+ DYSK+K E++++
Subjt:  EVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQEN

AT5G39530.1 Protein of unknown function (DUF1997)5.5e-3538.27Show/hide
Query:  PPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK--SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQL
        P  +S R S+  PL+ESPQA FD+Y+ED++R+  A F  K  S +LN++EWR++M     L + V PV D+RL C+  S+ QDYP  +P  ++K ++L +
Subjt:  PPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK--SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQL

Query:  MRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
        MRW+++GL   ++P  F++ V GALY +R    + L   L +N+ +F  P  L+   +D  + LA   L G++E   ++   +LL DYS++K E++
Subjt:  MRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCATAATTTGGTTGCTGTTTCTTTTCAATTCCCACAGCTTATTATCAATCCAAACAACAGATCAAAATGTTATGTTCATCAGAAAAAGAAGAACTATTCTAATTA
TATGTGTTTTGCAGTGAAAAATAATAATAGTAATAATAATCATCAAAGCCCTCCAATTTTCTCTCTCAGATTCTCAAGTTTCCATCCACTTTCTGAGTCTCCTCAGGCTT
CCTTTGATGATTACATTGAAGATGAAGCTAGATTGTTGAGAGCCACTTTTGTTGGAAAAAGTGAAAAATTAAACCAGGATGAATGGAGAGTTGAAATGCCATCTTTCCAA
TTGCTTTTGGTCAAGGTCAGCCCGGTAGCTGACGTAAGATTAAATTGTAGAAGCTGTAGCTCTACTCAAGATTACCCTATTCATATTCCTCACCATGTCTCCAAATTTAT
TGACCTTCAATTGATGAGATGGGAGGTGAAGGGATTGGGCACAGATTTGAAACCACAAAAGTTCACAATCAATGTAAATGGAGCTTTGTATGCTGAAAGAACAGAATCAA
AAAGTGTTCTCACAAATAATTTACTTCTCAATCTTCACAATTTTGCTGCCCCAACACCCCTTGATTTCTTTGCACAAGATTTTTTTCAACCTCTTGCAGAAAAGGGATTG
AAGGGAATGATGGAGGAAACAATGAATGAATTTACAGAAAATTTGCTCTTGGATTACAGCAAATACAAGAAGGAGAAGCAAGAGAATGAAGTTCCAGCCAATTATGGCTA
A
mRNA sequenceShow/hide mRNA sequence
ATGGGTCATAATTTGGTTGCTGTTTCTTTTCAATTCCCACAGCTTATTATCAATCCAAACAACAGATCAAAATGTTATGTTCATCAGAAAAAGAAGAACTATTCTAATTA
TATGTGTTTTGCAGTGAAAAATAATAATAGTAATAATAATCATCAAAGCCCTCCAATTTTCTCTCTCAGATTCTCAAGTTTCCATCCACTTTCTGAGTCTCCTCAGGCTT
CCTTTGATGATTACATTGAAGATGAAGCTAGATTGTTGAGAGCCACTTTTGTTGGAAAAAGTGAAAAATTAAACCAGGATGAATGGAGAGTTGAAATGCCATCTTTCCAA
TTGCTTTTGGTCAAGGTCAGCCCGGTAGCTGACGTAAGATTAAATTGTAGAAGCTGTAGCTCTACTCAAGATTACCCTATTCATATTCCTCACCATGTCTCCAAATTTAT
TGACCTTCAATTGATGAGATGGGAGGTGAAGGGATTGGGCACAGATTTGAAACCACAAAAGTTCACAATCAATGTAAATGGAGCTTTGTATGCTGAAAGAACAGAATCAA
AAAGTGTTCTCACAAATAATTTACTTCTCAATCTTCACAATTTTGCTGCCCCAACACCCCTTGATTTCTTTGCACAAGATTTTTTTCAACCTCTTGCAGAAAAGGGATTG
AAGGGAATGATGGAGGAAACAATGAATGAATTTACAGAAAATTTGCTCTTGGATTACAGCAAATACAAGAAGGAGAAGCAAGAGAATGAAGTTCCAGCCAATTATGGCTA
A
Protein sequenceShow/hide protein sequence
MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNYMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQ
LLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDLKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDFFQPLAEKGL
KGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG