; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G203840 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G203840
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionprotein HAPLESS 2 isoform X1
Genome locationCiama_Chr11:3260410..3264321
RNA-Seq ExpressionCaUC11G203840
SyntenyCaUC11G203840
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142706.1 protein HAPLESS 2 isoform X1 [Cucumis sativus]4.7e-11287.82Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGM+ ILSYFVLLLLYLLHKIGIF CIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC FMCIKLA+VKRTRRRH RRRD+EEEFE EEGKC+H STSDS+NV +HVES+SSRRVS++WRRNHR SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        +RK V+VGNHLNEI SFGMYGSSK VHKERKYRRGR R
Subjt:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

XP_008444189.1 PREDICTED: protein HAPLESS 2 isoform X1 [Cucumis melo]1.5e-11389.08Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANL+KMGM+ ILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC+FMCIKLA+VKRTRRRH RRRDLEEEFESEEGKCQH STSDSSNV +HVESRSSR  SR+WRRNH+ SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        +RK V+VGNH NEI SFGMYGSSK VHKERKYRRGRQR
Subjt:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

XP_022154011.1 uncharacterized protein LOC111021352 [Momordica charantia]3.3e-9776.99Show/hide
Query:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF
        +MGNVASS+ASG FSA+GK+F SPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLK+GM+LILS FV+LLLYLLHKIGIFGCI RGLCRM WTC++SYF
Subjt:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF

Query:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK
        YAW+YCCTFMCIKL +VKRTRRR  RRRDLEEEFESE GK ++GS+SDSS+VP+ +E RSS+R SR+WR NHR SQMRK+LRPK  GIRVRSGR LVYGK
Subjt:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        H+RK  +V N L EIHS G +GSSK VH+E +Y+RGRQ+
Subjt:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

XP_038897053.1 uncharacterized protein LOC120085227 isoform X1 [Benincasa hispida]1.4e-11690.76Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSL SGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCV+NLLKMGM+LILSYFVLL L LLHKIGIFGCIGRGLC+MIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCCTFMCIKLA+VKRTRRRH RRRDLEEEFESEEGKCQHGSTSDSSNVP+HVESRSSRR  R+WRRNHR S+MRKSLRP+GHGIRVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        +RK V+VGNHLNEIHSFGMYGSSK VHKERKYRR  QR
Subjt:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

XP_038897054.1 uncharacterized protein LOC120085227 isoform X2 [Benincasa hispida]9.5e-9779.83Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSL SGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCV+NLLKMGM+LILSYF                             +SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCCTFMCIKLA+VKRTRRRH RRRDLEEEFESEEGKCQHGSTSDSSNVP+HVESRSSRR  R+WRRNHR S+MRKSLRP+GHGIRVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        +RK V+VGNHLNEIHSFGMYGSSK VHKERKYRR  QR
Subjt:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

TrEMBL top hitse value%identityAlignment
A0A0A0L1V0 Uncharacterized protein2.3e-11287.82Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGM+ ILSYFVLLLLYLLHKIGIF CIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC FMCIKLA+VKRTRRRH RRRD+EEEFE EEGKC+H STSDS+NV +HVES+SSRRVS++WRRNHR SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        +RK V+VGNHLNEI SFGMYGSSK VHKERKYRRGR R
Subjt:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

A0A1S4DVV8 protein HAPLESS 2 isoform X17.1e-11489.08Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANL+KMGM+ ILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC+FMCIKLA+VKRTRRRH RRRDLEEEFESEEGKCQH STSDSSNV +HVESRSSR  SR+WRRNH+ SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        +RK V+VGNH NEI SFGMYGSSK VHKERKYRRGRQR
Subjt:  KRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

A0A6J1DMF3 uncharacterized protein LOC1110213521.6e-9776.99Show/hide
Query:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF
        +MGNVASS+ASG FSA+GK+F SPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLK+GM+LILS FV+LLLYLLHKIGIFGCI RGLCRM WTC++SYF
Subjt:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF

Query:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK
        YAW+YCCTFMCIKL +VKRTRRR  RRRDLEEEFESE GK ++GS+SDSS+VP+ +E RSS+R SR+WR NHR SQMRK+LRPK  GIRVRSGR LVYGK
Subjt:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        H+RK  +V N L EIHS G +GSSK VH+E +Y+RGRQ+
Subjt:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

A0A6J1FIX0 uncharacterized protein LOC1114458932.6e-8474.37Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLASG+F A+ KVFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGM++IL+YFVLLLLYL HKIGIFGCIGRGLCRMIWTCL+SY +
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEE-GKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK
        AWEYCCTFMCIKLA+VKRTRRRH RRRDLEEE ESEE  K ++GS+SDSSN    +ESR S+RVSR+ RR+HR SQ  K+LRP  HGIRVRSGRVLVY K
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEE-GKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQ
        H                    GSSK V KER YRRGRQ
Subjt:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQ

A0A6J1IVE5 uncharacterized protein LOC1114803121.4e-8272.8Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLASG+F A+ KVFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGM++IL+YFVLLLLYL HKIGIFGCIGRG CRMIWTCL+SY +
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFES-EEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK
        AWEYCCTFMC+KLA+VKRTRR H RRRDLEEE ES EE K ++GS+ DSSN  + +ESR S RVSR+ RR+HR SQ  K+LRP  HGIRVRSGRVLVY K
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFES-EEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR
        H                    GSSK V K+RKYRRGRQR
Subjt:  HKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR

SwissProt top hitse value%identityAlignment
F4JP36 Protein HAPLESS 24.4e-0436.51Show/hide
Query:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G++L L     LLL+LLH+ G+F
Subjt:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIF

Arabidopsis top hitse value%identityAlignment
AT1G21722.1 unknown protein1.3e-3847.09Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNV  S  +G   +IG  FGSPLDFLSG+SCSSVC S WDFICY+ENFCVANL K  +ILILSYF L  +Y+L+K+G + CI  G  +++W  +S +FY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRV
           YCC+F C  L + KR RRR    R +EE+++            D+S+  D V+   S    R  R   +  ++RKSLRP+ H +RV
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRV

AT1G78922.1 unknown protein4.4e-1529.35Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGN     ++ I   IG +F +PL    GRSC  VC   WD  C+IE+FC+ ++ K+ ++  L + +L+ + LL K+GI  C+ + +C+M     ++Y++
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEE---EFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLR
              + +C  L N+ R  RR  R  D+E    ++ S++      S+SDS +  D++  +  RR       +H     R + R
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEE---EFESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLR

AT4G11720.1 hapless 23.1e-0536.51Show/hide
Query:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G++L L     LLL+LLH+ G+F
Subjt:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTTGCTAATGGGCCGACAGTCCAAATCCTCCAAATTTAACACTCAAAACTTCTTCTGCTCTGCCCGACGGTTACCCGCCATTCTTCCTCCTTCTGCTGTT
CTTCATTTCCCGGTGCTCTTCAACCATTATAATCCATGTCTTGGGTTTTCACTGTTTCGTCTTATTTCGCAGGAGTGTTTTGTAATGGGTAATGTAGCCAGTTCA
TTGGCCTCTGGTATTTTTTCGGCCATTGGCAAAGTCTTTGGATCCCCACTTGATTTTCTTTCTGGAAGATCCTGCAGTTCAGTTTGTGGATCAACATGGGATTTC
ATATGCTACATAGAAAACTTCTGTGTTGCCAATTTGCTAAAGATGGGAATGATCTTGATCCTTTCATACTTTGTTCTTTTGCTCCTGTATTTATTACATAAAATT
GGCATCTTTGGATGTATTGGTCGGGGGCTCTGCAGAATGATATGGACTTGTTTATCTTCTTATTTCTATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAG
CTTGCCAATGTCAAAAGAACAAGAAGACGGCACTTTAGGAGAAGAGACCTTGAAGAAGAGTTTGAAAGCGAAGAAGGAAAATGTCAGCATGGGTCAACAAGTGAC
TCGAGTAATGTCCCCGACCATGTCGAGTCGAGAAGTAGCAGACGGGTGTCTCGTCAATGGAGGAGGAACCACAGGCGTTCTCAAATGAGAAAATCGTTGAGGCCG
AAAGGTCATGGAATTCGAGTCAGGAGTGGTAGAGTGTTGGTCTATGGTAAGCATAAAAGAAAATGTGTTAAGGTTGGGAATCATTTGAATGAGATACATAGCTTT
GGGATGTATGGATCATCCAAATGTGTGCATAAAGAAAGGAAATATAGAAGAGGAAGGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
AGAGAAACAATTGGGCTTTTATGGGCTTGCTAATGGGCCGACAGTCCAAATCCTCCAAATTTAACACTCAAAACTTCTTCTGCTCTGCCCGACGGTTACCCGCCA
TTCTTCCTCCTTCTGCTGTTCTTCATTTCCCGGTGCTCTTCAACCATTATAATCCATGTCTTGGGTTTTCACTGTTTCGTCTTATTTCGCAGGAGTGTTTTGTAA
TGGGTAATGTAGCCAGTTCATTGGCCTCTGGTATTTTTTCGGCCATTGGCAAAGTCTTTGGATCCCCACTTGATTTTCTTTCTGGAAGATCCTGCAGTTCAGTTT
GTGGATCAACATGGGATTTCATATGCTACATAGAAAACTTCTGTGTTGCCAATTTGCTAAAGATGGGAATGATCTTGATCCTTTCATACTTTGTTCTTTTGCTCC
TGTATTTATTACATAAAATTGGCATCTTTGGATGTATTGGTCGGGGGCTCTGCAGAATGATATGGACTTGTTTATCTTCTTATTTCTATGCATGGGAGTACTGCT
GCACTTTCATGTGTATCAAGCTTGCCAATGTCAAAAGAACAAGAAGACGGCACTTTAGGAGAAGAGACCTTGAAGAAGAGTTTGAAAGCGAAGAAGGAAAATGTC
AGCATGGGTCAACAAGTGACTCGAGTAATGTCCCCGACCATGTCGAGTCGAGAAGTAGCAGACGGGTGTCTCGTCAATGGAGGAGGAACCACAGGCGTTCTCAAA
TGAGAAAATCGTTGAGGCCGAAAGGTCATGGAATTCGAGTCAGGAGTGGTAGAGTGTTGGTCTATGGTAAGCATAAAAGAAAATGTGTTAAGGTTGGGAATCATT
TGAATGAGATACATAGCTTTGGGATGTATGGATCATCCAAATGTGTGCATAAAGAAAGGAAATATAGAAGAGGAAGGCAAAGATGATATCAGCTGGCTGGGATAT
GAACTCTATATTTGTGCTAAAATTCATTCTAAAAATGTCTTGATCTGTTTGTCTTTTGAGAAGGTTTTTTGCGCTTTTTTCCCCCAACTCTCATGATTATTTTAT
CATTCTCCATAGTTTCTATCCTATATAGGTAGCACTTTTAGAGTTAATAATTAAGTTTTCACGACATTTTTAAAGAATTGCAAATAT
Protein sequenceShow/hide protein sequence
MGLLMGRQSKSSKFNTQNFFCSARRLPAILPPSAVLHFPVLFNHYNPCLGFSLFRLISQECFVMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDF
ICYIENFCVANLLKMGMILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFYAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEFESEEGKCQHGSTSD
SSNVPDHVESRSSRRVSRQWRRNHRRSQMRKSLRPKGHGIRVRSGRVLVYGKHKRKCVKVGNHLNEIHSFGMYGSSKCVHKERKYRRGRQR