; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G14570 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G14570
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionHomeobox Hox-B3-like protein
Genome locationClcChr05:14911800..14916590
RNA-Seq ExpressionClc05G14570
SyntenyClc05G14570
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607056.1 hypothetical protein SDJN03_00398, partial [Cucurbita argyrosperma subsp. sororia]1.3e-9288.15Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA  FDH TQTHL FHSIDPRSLLLHQNSA  H   LQLT E FSMERGPRYRAYAELRESKLRLRN MYRD EQPEKSTPPAKKQV+FVGSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGR
         VAQSVPDFSA LRKENR+PPPGLSPVMEMTPPGKTWGK NIGG+S NSRGSKSASAGEKRGGGL A RKSYAGFEELKGFSTAAANAINGEN+R GGGR
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGR

Query:  RGKTVLGIRQI
        RGKTVLG+RQI
Subjt:  RGKTVLGIRQI

KAG7036757.1 hypothetical protein SDJN02_00377, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-9388.63Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA  FDH TQTHL FHSIDPRSLLLHQNSA  H   LQLT E FSMERGPRYRAYAELRESKLRLRNAMYRD EQPEKSTPPAKKQV+FVGSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGR
         VAQSVPDFSA LRKENR+PPPGLSPVMEMTPPGKTWGK NIGG+S NSRGSKSASAGEKRGGGL A RKSYAGFEELKGFSTAAANAINGEN+R GGGR
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGR

Query:  RGKTVLGIRQI
        RGKTVLG+RQI
Subjt:  RGKTVLGIRQI

XP_022998093.1 uncharacterized protein LOC111492844 [Cucurbita maxima]4.6e-9588.57Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA  FDHLTQTHLPFHSIDPRSLLLHQNSA  H   LQLT E FSMERGPRYRAYAELRESKLRLRN+MYRD EQPEKSTPPAKKQV+FVGSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRR
         VAQSVPDFS+ LRKEN++PPPGLSPVMEMTPPGKTWGK NIGG+S NSRGSKSASAGEKRGGGL A RKSYAGFEELKGFSTAAANAINGEN+RGGGRR
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRR

Query:  GKTVLGIRQI
        GKTVLG+RQI
Subjt:  GKTVLGIRQI

XP_023524771.1 uncharacterized protein LOC111788608 [Cucurbita pepo subsp. pepo]1.3e-9489.15Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA  FDHLTQTHLPFHSIDPRSLLLHQNSA  H   LQLT E FSMERGPRYRAYAELRESKLRLRNAMYRD EQPEKSTPPAKKQV+FVGSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK--NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGG
         VAQSVPDFSA LRKENR+PPPGLSPVMEMTPPGKTWGK  NIGG+S NSRGSKSASAGEKRGGGL A RKSYAGFEELKGFSTAAANAINGEN+R GGG
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK--NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGG

Query:  RRGKTVLGIRQI
        RRGKTVLG+RQI
Subjt:  RRGKTVLGIRQI

XP_038903179.1 uncharacterized protein LOC120089840 [Benincasa hispida]2.6e-9893.33Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA KF+HL QTHLPFHSIDPRSLLLHQNSAAD  ISLQLT EPFSMERGPRYRAYAELRESKLRLRNAMYRD EQPEKSTPPAKKQVKF+ SET+RKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGRR
        AVAQSVPDFSA LRKENRKPPPGLSPVMEMTPPGKTWGK IGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEE+KGFSTAAANAINGENRR GGGRR
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGRR

Query:  GKTVLGIRQI
        GKTVLGIRQI
Subjt:  GKTVLGIRQI

TrEMBL top hitse value%identityAlignment
A0A0A0L9P8 Uncharacterized protein2.3e-9290Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MAHKF      HLPFHSID RSLLLHQNSAADH ISL LTPEPFSMERGPRYRAYAELRESKLRLRNAMYR +E PEKSTPP KKQVKF+GSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPG-LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRR
         VAQSVPDFSA LRKENRKPPPG LSPVMEMTPPGKTWGKNIGGLST SRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENR+ GGRR
Subjt:  AVAQSVPDFSAALRKENRKPPPG-LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRR

Query:  GKTVLGIRQI
        GKTVLG+RQI
Subjt:  GKTVLGIRQI

A0A1S3CEQ6 uncharacterized protein LOC1035001181.0e-7986.01Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAA-DHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRS
        MA KF      HLPFHSID RSLLLHQNSAA DH ISL LTPEPFSMERGPRY+AYAELRESKLR RNAMYR +E PEKSTPP KKQ+KF+GSETVRKR+
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAA-DHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRS

Query:  AAVAQSVPDFSAALRKENRKPPPG-LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGE
        A VAQSVPDFSA LRKENRKPPPG LSPVMEMTPPGKTWGKN+GGLSTNSRGSKSASAGEKRGGGLT VRKSYAGFEELKGFSTA A AINGE
Subjt:  AAVAQSVPDFSAALRKENRKPPPG-LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGE

A0A5A7U2K6 Uncharacterized protein8.5e-8785.31Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAA-DHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRS
        MA KF      HLPFHSID RSLLLHQNSAA DH ISL LTPEPFSMERGPRY+AYAELRESKLR RNAMYR +E PEKSTPP KKQ+KF+GSETVRKRS
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAA-DHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRS

Query:  AAVAQSVPDFSAALRKENRKPPPG-LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGR
        A VAQSVPDFSA LRKENRKPPPG LSPVMEMTPPGKTWGKN+GGLSTNSRGSKSASAGEKRGGGLT VRKSYAGFEELKGFSTA A AINGENR+ GGR
Subjt:  AAVAQSVPDFSAALRKENRKPPPG-LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGR

Query:  RGKTVLGIRQI
        +GKTVLG RQ+
Subjt:  RGKTVLGIRQI

A0A6J1GA97 uncharacterized protein LOC1114522046.1e-9388.15Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA  FDH +QTHL FHSIDPRSLLLHQNSA  H   LQLT E FSMERGPRYRAYAELRESKLRLRNAMYRD EQPEKSTPPAKKQV+FVGSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGR
         VAQSVPDFSA LRKENR+PPPGLSPVMEMTPPGKTWGK NIGG+S NSRGSKSASAGEKRGGGL A RKSYAGFEELKGFSTAAANAINGEN+R GGGR
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR-GGGR

Query:  RGKTVLGIRQI
        RGKTVLG+RQI
Subjt:  RGKTVLGIRQI

A0A6J1KFU5 uncharacterized protein LOC1114928442.2e-9588.57Show/hide
Query:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA
        MA  FDHLTQTHLPFHSIDPRSLLLHQNSA  H   LQLT E FSMERGPRYRAYAELRESKLRLRN+MYRD EQPEKSTPPAKKQV+FVGSETVRKRSA
Subjt:  MAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVKFVGSETVRKRSA

Query:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRR
         VAQSVPDFS+ LRKEN++PPPGLSPVMEMTPPGKTWGK NIGG+S NSRGSKSASAGEKRGGGL A RKSYAGFEELKGFSTAAANAINGEN+RGGGRR
Subjt:  AVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGK-NIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRR

Query:  GKTVLGIRQI
        GKTVLG+RQI
Subjt:  GKTVLGIRQI

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-0437.18Show/hide
Query:  IILYCDNNVAIENSYEPRSHKCGKHMEHKYHLIREIVQRGDVIVKQIASNHNIAYLFTKALMAK---VLEGYLGLGQE
        I +Y DN   I  +  P  HK  KH++ KYH  RE VQ   + ++ I + + +A +FTK L A     L   LGL Q+
Subjt:  IILYCDNNVAIENSYEPRSHKCGKHMEHKYHLIREIVQRGDVIVKQIASNHNIAYLFTKALMAK---VLEGYLGLGQE

Arabidopsis top hitse value%identityAlignment
AT1G67035.1 unknown protein2.3e-1234.95Show/hide
Query:  SLLLHQNSAADHLISLQLTPE--PFSMERGPRYRAYAELRESKLRLRNAMYR-----------DNEQPEKSTPPAKKQVKFVGSE--TVRKRSAAVAQSV
        SLL   N  +D    L+L  +   +  ERG RY  YA LRESKLR++    +             E+  +  P  K +  F  ++  T    S+++AQSV
Subjt:  SLLLHQNSAADHLISLQLTPE--PFSMERGPRYRAYAELRESKLRLRNAMYR-----------DNEQPEKSTPPAKKQVKFVGSE--TVRKRSAAVAQSV

Query:  PDFSAALRKENRKPPPG---LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRRGKTV
        PDFS+ +RKENR+PP     L    E+TPP            +  RGS SASAGEK+GG     RKS                        GGG  G+T+
Subjt:  PDFSAALRKENRKPPPG---LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGRRGKTV

Query:  LGIRQI
        LG RQI
Subjt:  LGIRQI

AT1G67035.2 unknown protein8.8e-2038.36Show/hide
Query:  SLLLHQNSAADHLISLQLTPE--PFSMERGPRYRAYAELRESKLRLRNAMYR-----------DNEQPEKSTPPAKKQVKFVGSE--TVRKRSAAVAQSV
        SLL   N  +D    L+L  +   +  ERG RY  YA LRESKLR++    +             E+  +  P  K +  F  ++  T    S+++AQSV
Subjt:  SLLLHQNSAADHLISLQLTPE--PFSMERGPRYRAYAELRESKLRLRNAMYR-----------DNEQPEKSTPPAKKQVKFVGSE--TVRKRSAAVAQSV

Query:  PDFSAALRKENRKPPPG---LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR---------
        PDFS+ +RKENR+PP     L    E+TPP            +  RGS SASAGEK+G G+  +RKSYA  ++LK  S AAA+AING   +         
Subjt:  PDFSAALRKENRKPPPG---LSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRR---------

Query:  ----GGGRRGKTVLGIRQI
            GGG  G+T+LG RQI
Subjt:  ----GGGRRGKTVLGIRQI

AT5G38300.1 unknown protein1.1e-2539.84Show/hide
Query:  FHSIDPRSLLLHQNSAADHLISLQLTPEPFS-MERGPRYRAYAELRESKLRLRNAMYRD-NEQPEKSTPPAKKQVKFVGS--------------------
        F S+DP SL+L QNS       L+L  + FS  ERGPRY  Y+ LRESKLR++    +   E+ E   P  KKQV+F G+                    
Subjt:  FHSIDPRSLLLHQNSAADHLISLQLTPEPFS-MERGPRYRAYAELRESKLRLRNAMYRD-NEQPEKSTPPAKKQVKFVGS--------------------

Query:  --------------ETVRKRS------------AAVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGKNIGGLSTN--SRGSKSASAGEKRGGGL
                      E V+K+S            +++AQSVPDFSA +RKENR+P      +  +TPP  T     GG+ T   SRGSKSASAGEK+  G+
Subjt:  --------------ETVRKRS------------AAVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGKNIGGLSTN--SRGSKSASAGEKRGGGL

Query:  TAV---RKSYAGFEELKGFSTAAANAINGENRRGGGRR--------GKTVLGIRQI
          +   RKSYA  E+LK  S AAA+AING    GGG R         +T+LG RQI
Subjt:  TAV---RKSYAGFEELKGFSTAAANAINGENRRGGGRR--------GKTVLGIRQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCTGTCTATCATTCTCTATTGTGATAACAATGTGGCAATTGAAAATTCTTACGAACCCAGAAGCCATAAGTGTGGAAAACACATGGAACACAAATATCATCTCAT
TAGGGAGATTGTGCAAAGAGGAGACGTAATCGTCAAGCAGATTGCGTCAAACCACAACATTGCTTATCTGTTTACAAAGGCTCTCATGGCTAAAGTGTTGGAGGGTTACC
TAGGTCTTGGACAAGAGGCCCCACCATCTCAAAAGCACATTAGAGATCTAATGATTGGGAAAACTGGCACACAGGCGTTTGAAAACGCTCCAAATACTCGAATAAGGACT
TGCCAGCTACCTGTGAAATCCCGTATATTTCCTTTTTTATCCGTGCAGCTTGCGAAGTGCTTCTGCGGATCCTCACCTGCATGTCTAGAGAAAGTTGGAAGGGCCGAGAG
GTCTCTTAACGTGTGCACTTCAGGTTGCATCATTATTCTTTCCTCTTCTTCTTCTAAAAATTCCTCAAAAAGCGTGCCTTTTGCACTGAGCATCGCGACGCTATTGCGAA
ATCGTACGCATGGATTCTTTGGAGCGCCACGTCGCCCAAAGGCAGCGATGCTACGCTATGCATTTTGCACAAGTCCTTCCCATTCTATAAACAACGCTGAACGCAGCACC
ATGGGGCGTCAGGCGTTGTGTCTGCAATGCACCAACAACAAAACAGAGCTTCCTCATCCTTCAACAATGGCTCACAAATTCGACCATCTCACCCAAACCCATCTCCCCTT
CCACTCCATCGACCCCAGATCTCTCCTTCTCCACCAGAACTCCGCTGCCGATCACCTCATTTCTCTCCAGCTCACACCGGAGCCTTTTTCCATGGAAAGAGGGCCAAGGT
ACAGGGCCTATGCAGAGCTCAGAGAATCCAAACTCCGCTTGAGAAACGCCATGTACCGGGACAACGAACAGCCGGAAAAGTCCACGCCGCCGGCGAAGAAGCAAGTTAAA
TTTGTGGGTTCGGAGACTGTTCGGAAAAGGTCGGCGGCGGTGGCGCAATCGGTACCAGATTTCTCTGCAGCGCTGAGAAAGGAGAACAGAAAGCCACCGCCGGGGTTGTC
GCCGGTGATGGAGATGACGCCACCAGGGAAGACGTGGGGGAAGAACATTGGGGGATTGTCGACGAATTCGAGGGGGAGTAAGTCGGCGAGTGCAGGGGAGAAAAGGGGCG
GCGGATTGACGGCCGTGAGGAAGAGCTATGCCGGATTTGAGGAGCTGAAAGGGTTTTCTACGGCGGCGGCGAACGCCATTAATGGTGAAAATAGGAGAGGAGGAGGAAGG
AGGGGGAAGACTGTACTGGGAATTAGACAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCTGTCTATCATTCTCTATTGTGATAACAATGTGGCAATTGAAAATTCTTACGAACCCAGAAGCCATAAGTGTGGAAAACACATGGAACACAAATATCATCTCAT
TAGGGAGATTGTGCAAAGAGGAGACGTAATCGTCAAGCAGATTGCGTCAAACCACAACATTGCTTATCTGTTTACAAAGGCTCTCATGGCTAAAGTGTTGGAGGGTTACC
TAGGTCTTGGACAAGAGGCCCCACCATCTCAAAAGCACATTAGAGATCTAATGATTGGGAAAACTGGCACACAGGCGTTTGAAAACGCTCCAAATACTCGAATAAGGACT
TGCCAGCTACCTGTGAAATCCCGTATATTTCCTTTTTTATCCGTGCAGCTTGCGAAGTGCTTCTGCGGATCCTCACCTGCATGTCTAGAGAAAGTTGGAAGGGCCGAGAG
GTCTCTTAACGTGTGCACTTCAGGTTGCATCATTATTCTTTCCTCTTCTTCTTCTAAAAATTCCTCAAAAAGCGTGCCTTTTGCACTGAGCATCGCGACGCTATTGCGAA
ATCGTACGCATGGATTCTTTGGAGCGCCACGTCGCCCAAAGGCAGCGATGCTACGCTATGCATTTTGCACAAGTCCTTCCCATTCTATAAACAACGCTGAACGCAGCACC
ATGGGGCGTCAGGCGTTGTGTCTGCAATGCACCAACAACAAAACAGAGCTTCCTCATCCTTCAACAATGGCTCACAAATTCGACCATCTCACCCAAACCCATCTCCCCTT
CCACTCCATCGACCCCAGATCTCTCCTTCTCCACCAGAACTCCGCTGCCGATCACCTCATTTCTCTCCAGCTCACACCGGAGCCTTTTTCCATGGAAAGAGGGCCAAGGT
ACAGGGCCTATGCAGAGCTCAGAGAATCCAAACTCCGCTTGAGAAACGCCATGTACCGGGACAACGAACAGCCGGAAAAGTCCACGCCGCCGGCGAAGAAGCAAGTTAAA
TTTGTGGGTTCGGAGACTGTTCGGAAAAGGTCGGCGGCGGTGGCGCAATCGGTACCAGATTTCTCTGCAGCGCTGAGAAAGGAGAACAGAAAGCCACCGCCGGGGTTGTC
GCCGGTGATGGAGATGACGCCACCAGGGAAGACGTGGGGGAAGAACATTGGGGGATTGTCGACGAATTCGAGGGGGAGTAAGTCGGCGAGTGCAGGGGAGAAAAGGGGCG
GCGGATTGACGGCCGTGAGGAAGAGCTATGCCGGATTTGAGGAGCTGAAAGGGTTTTCTACGGCGGCGGCGAACGCCATTAATGGTGAAAATAGGAGAGGAGGAGGAAGG
AGGGGGAAGACTGTACTGGGAATTAGACAAATCTGA
Protein sequenceShow/hide protein sequence
MHLSIILYCDNNVAIENSYEPRSHKCGKHMEHKYHLIREIVQRGDVIVKQIASNHNIAYLFTKALMAKVLEGYLGLGQEAPPSQKHIRDLMIGKTGTQAFENAPNTRIRT
CQLPVKSRIFPFLSVQLAKCFCGSSPACLEKVGRAERSLNVCTSGCIIILSSSSSKNSSKSVPFALSIATLLRNRTHGFFGAPRRPKAAMLRYAFCTSPSHSINNAERST
MGRQALCLQCTNNKTELPHPSTMAHKFDHLTQTHLPFHSIDPRSLLLHQNSAADHLISLQLTPEPFSMERGPRYRAYAELRESKLRLRNAMYRDNEQPEKSTPPAKKQVK
FVGSETVRKRSAAVAQSVPDFSAALRKENRKPPPGLSPVMEMTPPGKTWGKNIGGLSTNSRGSKSASAGEKRGGGLTAVRKSYAGFEELKGFSTAAANAINGENRRGGGR
RGKTVLGIRQI