; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g05610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g05610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionATP-dependent DNA helicase
Genome locationchr6:4065673..4069214
RNA-Seq ExpressionMoc06g05610
SyntenyMoc06g05610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443832.1 PREDICTED: uncharacterized protein LOC103487331 isoform X3 [Cucumis melo]1.6e-21172.27Show/hide
Query:  NSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ----------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFD
        +S++ L  +KS VKR+CDDSF   H + DKRFKPD H+                NNP DEPSPLGL LRKSPSLLDLIQMKLS GS+ST AG+SN+E FD
Subjt:  NSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ----------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFD

Query:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF
        F+VK E +DAT+PGT EKLKASNFPASLLKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGP  LNVVL+R PLF
Subjt:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP
        FRETNPQPRKHTLWQATADFTDGEAS+ RQHF+QCP GLLNKHFEKL+QCDS LNFLS+QP IVLGSPYFEP+AS FTTLEQAS  GLE+A +  +  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP

Query:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS
         FQ V S + A+SL IEQ SPQMV EP T+E PSPSSVMD HEIEEN S+KVT +PRNWE IKVPGLHPSMSMSDLVNHIGHHITEQMAST  PFVD GS
Subjt:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS

Query:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS
         EYQ MLD+IA YLL+DNQ SA SDEV LMSRV+SLCCLL KEP  VQS QT+ ++    PN+K+D QLK  AEL+D KN+  HI  ++ P ++  +  S
Subjt:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS

Query:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD
             +MSRKDS+G+LLLHLP+IASLPKFLF+ISDGDEGQD
Subjt:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD

XP_022135007.1 uncharacterized protein LOC111007116 isoform X1 [Momordica charantia]2.1e-30198.28Show/hide
Query:  MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPG
        MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPG
Subjt:  MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPG

Query:  TTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLW
        TTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLW
Subjt:  TTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLW

Query:  QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKI
        QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKI
Subjt:  QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKI

Query:  EQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLND
        EQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLND
Subjt:  EQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLND

Query:  NQSSAASDEVKLMSRVDSLCCLLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLL
        NQSSAASDEVKL SRVDSLCCLLKEPAAVQSFQTSSKSCVG P+HKED QLKNAAEL+DDKNMGGHIKVHLHP EETKDVSVSKQTPAA+SRKDSFGDLL
Subjt:  NQSSAASDEVKLMSRVDSLCCLLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLL

Query:  LHLPQIASLPKFLFNISDGDEGQD
        LHLP+I SLPKFLFNISDGDEGQD
Subjt:  LHLPQIASLPKFLFNISDGDEGQD

XP_022135008.1 uncharacterized protein LOC111007116 isoform X2 [Momordica charantia]5.3e-22897.77Show/hide
Query:  YKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQC
        YKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQC
Subjt:  YKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQC

Query:  PQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS
        PQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS
Subjt:  PQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS

Query:  VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCC
        VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKL SRVDSLCC
Subjt:  VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCC

Query:  LLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDE
        LLKEPAAVQSFQTSSKSCVG P+HKED QLKNAAEL+DDKNMGGHIKVHLHP EETKDVSVSKQTPAA+SRKDSFGDLLLHLP+I SLPKFLFNISDGDE
Subjt:  LLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDE

Query:  GQD
        GQD
Subjt:  GQD

XP_038879515.1 uncharacterized protein LOC120071357 isoform X1 [Benincasa hispida]5.5e-21773.29Show/hide
Query:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ--------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST--AGASNSENFD
        ++S M L PEK  VKR+ DDSF   H +LDKRFKPDFH+              NNP DEPSPLGL LRKSPSLLDLIQMKLS GSAST  AG SN+E F+
Subjt:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ--------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST--AGASNSENFD

Query:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF
        F+VK E +DAT+PGT EKLKASNFPASLL+IGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMAL ANCPDDGP  LNVVL+R PLF
Subjt:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP
        FRETNPQPRKHTLWQATADFTDGEAS++RQHFLQCP GLLNKHFEKLIQCDS LNFLS+QPEIVLGSPYFEP AS FTTL+QAS HGLE+AE+  +  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP

Query:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS-------------VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQ
        TFQ V S S  SSLKIE+ SPQMV EP T+E PSPSS             VMD HEIEENRSTKVT +PRNWEQIKVPG+HPS+SMSDLVNHIGHHITEQ
Subjt:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS-------------VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQ

Query:  MASTNLPFVDGGS-EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKV
        MAST  PFVD GS EYQ+MLD+IA YLL+DNQ SAASDEV LMSRVDSLCCLL KEP  VQS QT+ ++CV   N+++D      A L+D KN+  HIK+
Subjt:  MASTNLPFVDGGS-EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKV

Query:  HLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD
          HPE T++VS S Q  +AMSRKDS+GDLLLHLP+IASLPK LF+ISDGDEGQD
Subjt:  HLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD

XP_038879516.1 uncharacterized protein LOC120071357 isoform X2 [Benincasa hispida]9.1e-22075.05Show/hide
Query:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ--------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST--AGASNSENFD
        ++S M L PEK  VKR+ DDSF   H +LDKRFKPDFH+              NNP DEPSPLGL LRKSPSLLDLIQMKLS GSAST  AG SN+E F+
Subjt:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ--------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST--AGASNSENFD

Query:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF
        F+VK E +DAT+PGT EKLKASNFPASLL+IGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMAL ANCPDDGP  LNVVL+R PLF
Subjt:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP
        FRETNPQPRKHTLWQATADFTDGEAS++RQHFLQCP GLLNKHFEKLIQCDS LNFLS+QPEIVLGSPYFEP AS FTTL+QAS HGLE+AE+  +  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP

Query:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS
        TFQ V S S  SSLKIE+ SPQMV EP T+E PSPSSVMD HEIEENRSTKVT +PRNWEQIKVPG+HPS+SMSDLVNHIGHHITEQMAST  PFVD GS
Subjt:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS

Query:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS
         EYQ+MLD+IA YLL+DNQ SAASDEV LMSRVDSLCCLL KEP  VQS QT+ ++CV   N+++D      A L+D KN+  HIK+  HPE T++VS S
Subjt:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS

Query:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD
         Q  +AMSRKDS+GDLLLHLP+IASLPK LF+ISDGDEGQD
Subjt:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD

TrEMBL top hitse value%identityAlignment
A0A0A0LTD1 Uncharacterized protein4.1e-21072.27Show/hide
Query:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ---------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFD
        ++S++ L P+KS VKR+CDDSF   H + DKRFKPDFH+               NNP DEPSPLGL LRKSPSLLDLIQMKLS GS++T AG SN+E FD
Subjt:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ---------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFD

Query:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF
        F+VK E +DAT+PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGP  LNVVL+R PLF
Subjt:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP
        FRETNPQPRKHTLWQATADFTDGEAS++R+H LQCP GLLNKHFEKLIQCDS LNFLS+QPEIVLGSPYFEP+AS F TLEQAS  G E+A +  +  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP

Query:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS
         FQ V S + A+SL IEQ SPQMV EP T++ PSPSSVMD  EIEENRSTKVT +P NWEQIKVPGLHPSMSMSDLVNHIGHHITEQMAST  PFVD GS
Subjt:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS

Query:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS
         EYQ MLD+IA YLL+DNQ SA SDEV LMSRV+SLCCLL KEP  VQS QT+ ++  G PN+K+D QLK+ AEL+D KN+    + H+ P ++  +  S
Subjt:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS

Query:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD
             AMSRKDS+G+LLLHLP+IASLPKFLF+ISDGDEGQD
Subjt:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD

A0A1S3B908 uncharacterized protein LOC103487331 isoform X37.5e-21272.27Show/hide
Query:  NSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ----------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFD
        +S++ L  +KS VKR+CDDSF   H + DKRFKPD H+                NNP DEPSPLGL LRKSPSLLDLIQMKLS GS+ST AG+SN+E FD
Subjt:  NSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ----------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFD

Query:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF
        F+VK E +DAT+PGT EKLKASNFPASLLKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGP  LNVVL+R PLF
Subjt:  FLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP
        FRETNPQPRKHTLWQATADFTDGEAS+ RQHF+QCP GLLNKHFEKL+QCDS LNFLS+QP IVLGSPYFEP+AS FTTLEQAS  GLE+A +  +  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR--LP

Query:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS
         FQ V S + A+SL IEQ SPQMV EP T+E PSPSSVMD HEIEEN S+KVT +PRNWE IKVPGLHPSMSMSDLVNHIGHHITEQMAST  PFVD GS
Subjt:  TFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS

Query:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS
         EYQ MLD+IA YLL+DNQ SA SDEV LMSRV+SLCCLL KEP  VQS QT+ ++    PN+K+D QLK  AEL+D KN+  HI  ++ P ++  +  S
Subjt:  -EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVS

Query:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD
             +MSRKDS+G+LLLHLP+IASLPKFLF+ISDGDEGQD
Subjt:  KQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD

A0A6J1C1F5 uncharacterized protein LOC111007116 isoform X11.0e-30198.28Show/hide
Query:  MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPG
        MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPG
Subjt:  MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPG

Query:  TTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLW
        TTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLW
Subjt:  TTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLW

Query:  QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKI
        QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKI
Subjt:  QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKI

Query:  EQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLND
        EQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLND
Subjt:  EQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLND

Query:  NQSSAASDEVKLMSRVDSLCCLLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLL
        NQSSAASDEVKL SRVDSLCCLLKEPAAVQSFQTSSKSCVG P+HKED QLKNAAEL+DDKNMGGHIKVHLHP EETKDVSVSKQTPAA+SRKDSFGDLL
Subjt:  NQSSAASDEVKLMSRVDSLCCLLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLL

Query:  LHLPQIASLPKFLFNISDGDEGQD
        LHLP+I SLPKFLFNISDGDEGQD
Subjt:  LHLPQIASLPKFLFNISDGDEGQD

A0A6J1C3L2 uncharacterized protein LOC111007116 isoform X22.6e-22897.77Show/hide
Query:  YKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQC
        YKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQC
Subjt:  YKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQC

Query:  PQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS
        PQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS
Subjt:  PQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSS

Query:  VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCC
        VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKL SRVDSLCC
Subjt:  VMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCC

Query:  LLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDE
        LLKEPAAVQSFQTSSKSCVG P+HKED QLKNAAEL+DDKNMGGHIKVHLHP EETKDVSVSKQTPAA+SRKDSFGDLLLHLP+I SLPKFLFNISDGDE
Subjt:  LLKEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHP-EETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDE

Query:  GQD
        GQD
Subjt:  GQD

A0A6J1JMN7 uncharacterized protein LOC111485770 isoform X29.2e-21072.73Show/hide
Query:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ--------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFDF
        VNS M L PEK A KR+C D F+E H  LDKR KPDFH+              NNP DEPSPLGL LRKSPSLLDLIQMKLS  SAST  GASNSE F+F
Subjt:  VNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQ--------------NNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSAST-AGASNSENFDF

Query:  LVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFF
        +VKK+ ++AT+PG  EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKAN PDDGP  LNVVL+R PLFF
Subjt:  LVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFF

Query:  RETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQ
        RETNPQPRKHTLWQA ADFTDGEAS++RQHFLQCP G+LNKHFEKLIQCDSHLNFLS+QPEIVLGSPYFE  AS FTT+EQA N      +    L TFQ
Subjt:  RETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQ

Query:  GVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS-EY
         V S SV SSL+IEQ+SPQMV +P T+EVPSPSSVMD HEIEENRST+V+ +PRNWEQIKV GLHPSMSM DLV+HIGHHITEQMAST  PFVD GS EY
Subjt:  GVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGS-EY

Query:  QNMLDEIALYLLNDNQ-SSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQ
        Q+ML+EIA YLL+D+Q SSAA  EV +MSRV+SLCCLL KEPAAVQS QTS ++CV   NH+E+ +LK+A E +D  + G HI  ++HPE  K+V  S Q
Subjt:  QNMLDEIALYLLNDNQ-SSAASDEVKLMSRVDSLCCLL-KEPAAVQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQ

Query:  TPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD
          +AMSRKDSFGDL LHLP+IASLPKF F+IS+GDEGQD
Subjt:  TPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein8.7e-2743.67Show/hide
Query:  NFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPGTLNVVLSRWPLFFRETNPQPRKHTLW
        NFP S ++IG W   ++   D+VAK Y+AK KL+WE L G        LK KIEIQW+D+ + + +    D  G L + L + P FF ETNPQ  KHT W
Subjt:  NFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPGTLNVVLSRWPLFFRETNPQPRKHTLW

Query:  -QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFE
         Q   DFT   AS  R+H L  P G+L K+ EKL+  DS  + L + P  V  S YF+
Subjt:  -QATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFE

AT2G24100.1 unknown protein1.6e-11347.83Show/hide
Query:  DDSFEEGHGRLDKRFK-----PDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPGTTEKLKASNFPASLL
        +DS EE H  L+KR K         + +  +EPSPLGL+L+KSPS  +LI+MKLS     +   SNS      VKKE       GT EKLKASNFPA++L
Subjt:  DDSFEEGHGRLDKRFK-----PDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPGTTEKLKASNFPASLL

Query:  KIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRR
        +IG+WEYKSRYEGDLVAKCY+AKHKLVWE+LE GLKSKIEIQWSDIMALKAN P+D PGTL +VL+R PLFFRETNPQPRKHTLWQAT+DFTDG+ASM R
Subjt:  KIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRR

Query:  QHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLE
        QHFLQCP G++NKHFEKL+QCD  L  LS+QPEI L +P+F+ + S+F     + +H                    ++AS +  + +S  + L    L 
Subjt:  QHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLE

Query:  VPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSR
          SPSSVMD   IE    +  +     W QIK+PGLH S+SM+D +      +++Q    N  F           +E+   LL+DN  +  SDE  +MS+
Subjt:  VPSPSSVMDTHEIEENRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSR

Query:  VDSLCCLLKEPAAVQ-SFQTS-SKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLF
        V+S C LL+  A  Q + +T+ ++  VG  N++                 GG        +   D + S +    MSRKDSF DLL+HLP+I SLPKFLF
Subjt:  VDSLCCLLKEPAAVQ-SFQTS-SKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLF

Query:  NISDGD
        NIS+ D
Subjt:  NISDGD

AT3G05770.1 unknown protein3.6e-3340.35Show/hide
Query:  HQNNPFDEPSPLGLTLRKSPSLLDLIQMKLS-HGSASTAGASNSENFDFLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAK
        + N   DE   L L L K+P L++ I+  L  H +       NS           K +T+P + EKLKA NFP S +KIG   + ++   D+VAK Y+AK
Subjt:  HQNNPFDEPSPLGLTLRKSPSLLDLIQMKLS-HGSASTAGASNSENFDFLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAK

Query:  HKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPGTLNVVLSRWPLFFRETNPQPRKHTLW-QATADFTDGEASMRRQHFLQCPQGLLNKH
         KL+WE L G        LKSKIEIQW+D+ + + +    D  G L + L + P FF ETNPQ  KHT W Q   DFT  +AS  R+H L  P G+L K+
Subjt:  HKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPGTLNVVLSRWPLFFRETNPQPRKHTLW-QATADFTDGEASMRRQHFLQCPQGLLNKH

Query:  FEKLIQCDSHLNFLSQQPEIVLGSPYFE
         EKL+  DS  + L + P  V  S YF+
Subjt:  FEKLIQCDSHLNFLSQQPEIVLGSPYFE

AT4G30780.1 unknown protein5.8e-11644.84Show/hide
Query:  NSNMALCPEKSAVKRD-CDDSFEEGHGRLDKR---FKPDFHQN-------NPFDEPSPLGLTLRKSPSLLDLIQMKLSH------GSASTAGASNS----
        + + A C ++  VK +  +D  EE HG L+KR   + P    +       NP DEPSPLGL+L+KSPSLL+LIQMK++H           AGA  S    
Subjt:  NSNMALCPEKSAVKRD-CDDSFEEGHGRLDKR---FKPDFHQN-------NPFDEPSPLGLTLRKSPSLLDLIQMKLSH------GSASTAGASNS----

Query:  ENFDFLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSR
        E+        +     PG+ EKLKASNFPASLLKIG+WEYKSRYEGDLVAKCY+AKHKLVWE+LE GLKSKIEIQWSDIMALKANCP+DGPGTL +VL+R
Subjt:  ENFDFLVKKEIKDATMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSR

Query:  WPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR
         PLFFRETNPQPRKHTLWQAT+DFTDG+ASM RQHFLQC QG++NKHFEKL+QCD  L  LS+QPEI + SPYF+ + S+F    ++  H        + 
Subjt:  WPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQCPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIR

Query:  LPTFQGVA-SPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIE----------ENRS-------------------TKVTCEPRNWEQIK----V
        L T   ++ + ++AS +  + +S  M L   + E PSPSSV+D    E           NR+                     V C+ +N   +     V
Subjt:  LPTFQGVA-SPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIE----------ENRS-------------------TKVTCEPRNWEQIK----V

Query:  PGLHPSMSMSDLVNHI--GHHITE----------QMASTN-----LPFVDGGS--EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAA
         GLH SMS+SD V ++    +IT+          Q  S +     L    GGS  E+    + +   LL+DN    A DE  LM RV+SL  LL K+P  
Subjt:  PGLHPSMSMSDLVNHI--GHHITE----------QMASTN-----LPFVDGGS--EYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLL-KEPAA

Query:  VQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGD
          + Q +++  VG  +  +         + D+ N   +    L        + S   P  M RKDSF DLLLHLP+I SLPKFL NIS+ D
Subjt:  VQSFQTSSKSCVGAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTGAATTCTAACATGGCGTTGTGCCCCGAAAAATCGGCGGTGAAGCGGGATTGTGATGATTCTTTTGAAGAAGGGCACGGTCGCTTGGACAAGCGATTCAAGCC
GGATTTTCATCAGAACAATCCATTTGATGAACCTAGTCCTCTGGGTTTGACGCTAAGGAAGAGCCCGTCCCTGTTGGATCTAATTCAAATGAAACTCTCACATGGGAGTG
CGTCCACAGCTGGAGCTTCAAACAGTGAAAATTTTGACTTTCTGGTTAAAAAAGAAATCAAGGACGCTACCATGCCGGGTACCACTGAGAAACTAAAGGCTTCAAACTTC
CCAGCTTCACTTTTAAAGATTGGACGTTGGGAGTACAAATCAAGGTATGAAGGTGATTTAGTGGCAAAGTGTTACTATGCTAAGCATAAGCTTGTCTGGGAAATCCTCGA
GGGTGGACTCAAAAGTAAAATAGAAATACAATGGTCAGATATTATGGCTTTAAAAGCAAATTGTCCTGATGATGGACCTGGTACACTAAATGTTGTGCTGTCTAGATGGC
CTCTTTTCTTCAGGGAGACCAACCCTCAACCAAGGAAGCACACTTTGTGGCAGGCAACAGCTGACTTTACGGATGGTGAGGCCAGCATGCGGAGGCAGCATTTTCTGCAA
TGTCCACAAGGGCTGCTAAACAAGCATTTTGAAAAGCTCATCCAATGTGACTCGCACCTTAACTTCTTAAGCCAACAGCCGGAAATTGTGTTAGGTTCTCCATATTTTGA
ACCAAAAGCCTCTGTTTTTACTACCTTGGAGCAAGCCAGCAACCATGGTTTGGAGCGAGCAGAGTCTGTAATTCGACTTCCTACTTTCCAGGGTGTTGCTTCACCATCTG
TTGCATCATCATTGAAGATAGAACAGACTTCTCCCCAGATGGTGCTCGAACCGTGCACCTTGGAGGTTCCTTCCCCTAGTTCAGTGATGGATACACATGAGATTGAAGAG
AATAGAAGTACGAAAGTTACCTGCGAGCCAAGAAACTGGGAACAGATTAAGGTTCCAGGGCTCCATCCATCTATGTCGATGAGTGACCTCGTAAACCATATCGGACATCA
TATTACAGAACAAATGGCTTCCACAAATTTGCCGTTTGTTGATGGTGGTTCGGAATATCAAAACATGCTGGATGAAATTGCGCTTTACTTGCTCAACGACAATCAATCAT
CAGCAGCTTCTGATGAGGTAAAACTTATGTCGAGAGTCGATTCTCTCTGTTGCCTTTTGAAAGAACCTGCTGCAGTTCAAAGTTTTCAAACCAGTAGCAAAAGCTGTGTC
GGAGCACCCAACCACAAAGAAGATTTTCAGCTTAAGAATGCTGCGGAGTTGAAGGATGACAAGAATATGGGAGGTCATATTAAGGTTCATCTTCACCCTGAGGAAACAAA
GGATGTCTCCGTAAGCAAACAAACACCAGCAGCTATGTCAAGGAAAGATTCATTTGGGGATTTGCTGTTGCATCTTCCTCAAATTGCGTCGCTCCCAAAGTTCTTGTTTA
ACATTTCAGATGGTGATGAAGGTCAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTGAATTCTAACATGGCGTTGTGCCCCGAAAAATCGGCGGTGAAGCGGGATTGTGATGATTCTTTTGAAGAAGGGCACGGTCGCTTGGACAAGCGATTCAAGCC
GGATTTTCATCAGAACAATCCATTTGATGAACCTAGTCCTCTGGGTTTGACGCTAAGGAAGAGCCCGTCCCTGTTGGATCTAATTCAAATGAAACTCTCACATGGGAGTG
CGTCCACAGCTGGAGCTTCAAACAGTGAAAATTTTGACTTTCTGGTTAAAAAAGAAATCAAGGACGCTACCATGCCGGGTACCACTGAGAAACTAAAGGCTTCAAACTTC
CCAGCTTCACTTTTAAAGATTGGACGTTGGGAGTACAAATCAAGGTATGAAGGTGATTTAGTGGCAAAGTGTTACTATGCTAAGCATAAGCTTGTCTGGGAAATCCTCGA
GGGTGGACTCAAAAGTAAAATAGAAATACAATGGTCAGATATTATGGCTTTAAAAGCAAATTGTCCTGATGATGGACCTGGTACACTAAATGTTGTGCTGTCTAGATGGC
CTCTTTTCTTCAGGGAGACCAACCCTCAACCAAGGAAGCACACTTTGTGGCAGGCAACAGCTGACTTTACGGATGGTGAGGCCAGCATGCGGAGGCAGCATTTTCTGCAA
TGTCCACAAGGGCTGCTAAACAAGCATTTTGAAAAGCTCATCCAATGTGACTCGCACCTTAACTTCTTAAGCCAACAGCCGGAAATTGTGTTAGGTTCTCCATATTTTGA
ACCAAAAGCCTCTGTTTTTACTACCTTGGAGCAAGCCAGCAACCATGGTTTGGAGCGAGCAGAGTCTGTAATTCGACTTCCTACTTTCCAGGGTGTTGCTTCACCATCTG
TTGCATCATCATTGAAGATAGAACAGACTTCTCCCCAGATGGTGCTCGAACCGTGCACCTTGGAGGTTCCTTCCCCTAGTTCAGTGATGGATACACATGAGATTGAAGAG
AATAGAAGTACGAAAGTTACCTGCGAGCCAAGAAACTGGGAACAGATTAAGGTTCCAGGGCTCCATCCATCTATGTCGATGAGTGACCTCGTAAACCATATCGGACATCA
TATTACAGAACAAATGGCTTCCACAAATTTGCCGTTTGTTGATGGTGGTTCGGAATATCAAAACATGCTGGATGAAATTGCGCTTTACTTGCTCAACGACAATCAATCAT
CAGCAGCTTCTGATGAGGTAAAACTTATGTCGAGAGTCGATTCTCTCTGTTGCCTTTTGAAAGAACCTGCTGCAGTTCAAAGTTTTCAAACCAGTAGCAAAAGCTGTGTC
GGAGCACCCAACCACAAAGAAGATTTTCAGCTTAAGAATGCTGCGGAGTTGAAGGATGACAAGAATATGGGAGGTCATATTAAGGTTCATCTTCACCCTGAGGAAACAAA
GGATGTCTCCGTAAGCAAACAAACACCAGCAGCTATGTCAAGGAAAGATTCATTTGGGGATTTGCTGTTGCATCTTCCTCAAATTGCGTCGCTCCCAAAGTTCTTGTTTA
ACATTTCAGATGGTGATGAAGGTCAAGATTAA
Protein sequenceShow/hide protein sequence
MVVNSNMALCPEKSAVKRDCDDSFEEGHGRLDKRFKPDFHQNNPFDEPSPLGLTLRKSPSLLDLIQMKLSHGSASTAGASNSENFDFLVKKEIKDATMPGTTEKLKASNF
PASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPGTLNVVLSRWPLFFRETNPQPRKHTLWQATADFTDGEASMRRQHFLQ
CPQGLLNKHFEKLIQCDSHLNFLSQQPEIVLGSPYFEPKASVFTTLEQASNHGLERAESVIRLPTFQGVASPSVASSLKIEQTSPQMVLEPCTLEVPSPSSVMDTHEIEE
NRSTKVTCEPRNWEQIKVPGLHPSMSMSDLVNHIGHHITEQMASTNLPFVDGGSEYQNMLDEIALYLLNDNQSSAASDEVKLMSRVDSLCCLLKEPAAVQSFQTSSKSCV
GAPNHKEDFQLKNAAELKDDKNMGGHIKVHLHPEETKDVSVSKQTPAAMSRKDSFGDLLLHLPQIASLPKFLFNISDGDEGQD