; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015399 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015399
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold586:83352..84056
RNA-Seq ExpressionMS015399
SyntenyMS015399
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-7466.39Show/hide
Query:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC
        MEIK KGKVHPSPS   PSSSSSVFKLLPAAILAL+SVLSL+EREVLAYMIARSIQSSA  ST DSRKKSA+KASI GG + +++  HKTP+FSCDCFYC
Subjt:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  + KRR RI  QA DKS+PVVQ P    D+C  V+P SPER           +GS   E GE
Subjt:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE

Query:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
        SG  ++V        DHQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

XP_022145193.1 uncharacterized protein LOC111014698 [Momordica charantia]1.3e-12299.15Show/hide
Query:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA
        MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA
Subjt:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA

Query:  YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVSTA
        YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVST 
Subjt:  YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVSTA

Query:  AAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPNT
         AAAADHQKGLATIVVPDVLRYFKSRFSSLWNPNT
Subjt:  AAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPNT

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]4.7e-7566.8Show/hide
Query:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC
        MEIK KGKVHPSPS   PSSSSSVFKLLPAAILAL+SVLSL+EREVLAYMIARSIQSSA  ST DSRKKSA+KASINGG + +++  HKTP+FSCDCFYC
Subjt:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  + KRR RI  QA DKS+PVVQ P    D+C  V+P SPER           +GS   E GE
Subjt:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE

Query:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
        SG  ++V        DHQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]7.2e-7667.62Show/hide
Query:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC
        MEIK KGKVHPSPS   PSSSSSVFKLLPAAILAL+SVLSL+EREVLAYMIARSIQSSA  ST DSRKKSA+KASINGG + +++G HKTPMFSCDCFYC
Subjt:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERD-------DGGEGSLAAEDGE
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  + KRR RI  QA DKS+PVVQ PP    +C  V+P SPER        +  +GS   E GE
Subjt:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERD-------DGGEGSLAAEDGE

Query:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
        SG  ++V        DHQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

XP_023528171.1 uncharacterized protein LOC111791162 [Cucurbita pepo subsp. pepo]1.4e-7466.94Show/hide
Query:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC
        MEIK KGKVHPSPS   PSSSSSVFKLLPAAILAL++VLSL+EREVLAYMIARSIQSSA  ST DSRKKSA+KASI+GG + +++G HKTPMFSCDCFYC
Subjt:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  + KRR RI  QA DKS+PVVQ PP   D+C  V+P SPER           +GS   E GE
Subjt:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE

Query:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWN
        SG  ++V        DHQKGLAT V+PDVL +F SR  SLW+
Subjt:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWN

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein2.8e-6563.41Show/hide
Query:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFAS-TRDSRKKSARKASINGG-----IATITSGN---HKTPMFS
        MEIK K K+HPSP PSSSSSVFKLLPAAILAL S+LSL++REVLAYMIARSIQSSAF S TR SRKKS +K  IN G       T T+ N   HKTP+FS
Subjt:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFAS-TRDSRKKSARKASINGG-----IATITSGN---HKTPMFS

Query:  CDCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAI--DKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDG
        CDCFYCYTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  R KRR RI  Q    +K++PVV CP    D+C  V P SP  +D  EGS+  E  
Subjt:  CDCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAI--DKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDG

Query:  ESG-AAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
        ESG   EDV        +HQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  ESG-AAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

A0A5A7TRT9 Uncharacterized protein4.0e-6461.29Show/hide
Query:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSA-FASTRDSRKKSARKASINGG-------IATITSGNHKTPMFSC
        MEIK K K+HPSP PSSSSSVFKLLPAAILAL S+LSL++REVLAYMIARSIQSSA   STR SRKKS +K SIN G         T T+  HKTP+FSC
Subjt:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSA-FASTRDSRKKSARKASINGG-------IATITSGNHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAI-----DKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAE
        DCFYCYTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  R KRR RI  Q       +K++PV+ CP    D+C  V   SP  +D  EGS+  E
Subjt:  DCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAI-----DKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAE

Query:  DGESG-AAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
          E+G   EDV        +HQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  DGESG-AAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

A0A6J1CVA9 uncharacterized protein LOC1110146986.5e-12399.15Show/hide
Query:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA
        MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA
Subjt:  MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA

Query:  YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVSTA
        YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVST 
Subjt:  YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVSTA

Query:  AAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPNT
         AAAADHQKGLATIVVPDVLRYFKSRFSSLWNPNT
Subjt:  AAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPNT

A0A6J1GV32 uncharacterized protein LOC1114577372.3e-7566.8Show/hide
Query:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC
        MEIK KGKVHPSPS   PSSSSSVFKLLPAAILAL+SVLSL+EREVLAYMIARSIQSSA  ST DSRKKSA+KASINGG + +++  HKTP+FSCDCFYC
Subjt:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  + KRR RI  QA DKS+PVVQ P    D+C  V+P SPER           +GS   E GE
Subjt:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGG-------EGSLAAEDGE

Query:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
        SG  ++V        DHQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

A0A6J1IQ09 uncharacterized protein LOC1114794673.5e-7667.62Show/hide
Query:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC
        MEIK KGKVHPSPS   PSSSSSVFKLLPAAILAL+SVLSL+EREVLAYMIARSIQSSA  ST DSRKKSA+KASINGG + +++G HKTPMFSCDCFYC
Subjt:  MEIKRKGKVHPSPS---PSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERD-------DGGEGSLAAEDGE
        YTAYWCRWDSSPNR+LIH AI+AFEDHL   EKPK N  + KRR RI  QA DKS+PVVQ PP    +C  V+P SPER        +  +GS   E GE
Subjt:  YTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERD-------DGGEGSLAAEDGE

Query:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
        SG  ++V        DHQKGLAT V+PDVL +F SR  SLW+PN
Subjt:  SGAAEDVSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein1.5e-2345.45Show/hide
Query:  EIKRKGKVHPSPSP-SSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA
        ++ RKG VHPSP    S+  +  LLP AI +L +VLS E+REVLAY+I+ +  S     T    K  A K ++          NH +P+F CDCF CYT+
Subjt:  EIKRKGKVHPSPSP-SSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTA

Query:  YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHR
        YW RWDSSP+R LIH  IDAFED L   +  K N    K R +
Subjt:  YWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHR

AT1G24270.1 unknown protein1.1e-2643.98Show/hide
Query:  MEIKRKGKVHPSPS-PSSSS-------SVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSC
        M++ +KGKVHPSP  PSSSS       SVFKLL +AIL L+SVLS E+ EVLAY+I RS+ ++   S +  R                   +HK P+  C
Subjt:  MEIKRKGKVHPSPS-PSSSS-------SVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHL-----PIAEKPKNNAARAKRRHRIRNQAIDKSV
         CF CYT+YW +WDSS NR+LI+  I+AFEDHL       +   K N  RAK+      Q  +KS+
Subjt:  DCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHL-----PIAEKPKNNAARAKRRHRIRNQAIDKSV

AT1G62422.1 unknown protein3.6e-2546.32Show/hide
Query:  RKGKVHPSPSPS--SSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTAYW
        RKG VHPSP P+  +      LLP AIL+L++ LS+E+REVLAY+I+ S  S+     R SR K  ++             NH +P+F CDCF CYT+YW
Subjt:  RKGKVHPSPSPS--SSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTAYW

Query:  CRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARA
         RWD+SP R LIH  IDA+ED L + +K K+   R+
Subjt:  CRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARA

AT5G13090.1 unknown protein4.7e-3339.58Show/hide
Query:  MEIKRKGKVHPSPSP-------SSSS-----------SVFKLLPAAILALISVLSLEEREVLAYMIAR--SIQSSAFASTRDSRKKSARKASINGGIATI
        M++K+KGKV+PSP P       SSSS           SV KLLPA IL L+SVLS EEREVLAY+I R  +I     +S+++  KK + K          
Subjt:  MEIKRKGKVHPSPSP-------SSSS-----------SVFKLLPAAILALISVLSLEEREVLAYMIAR--SIQSSAFASTRDSRKKSARKASINGGIATI

Query:  TSGNHKTPMFSCDCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRH--------------------RIRNQAIDKSVPVVQCPP
        +S NHK P+F C+CF CYT YW RWDSSPNR+LIH  I+AFE+H       +N+A+R+K +                     R+ +     S PVV+ P 
Subjt:  TSGNHKTPMFSCDCFYCYTAYWCRWDSSPNRDLIHHAIDAFEDHLPIAEKPKNNAARAKRRH--------------------RIRNQAIDKSVPVVQCPP

Query:  LEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAED---------VSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN
         E     +    SP R    E +    + E    ED         V  AAA+     KGLA  V+PDVL  F S F  LWNPN
Subjt:  LEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAED---------VSTAAAAAADHQKGLATIVVPDVLRYFKSRFSSLWNPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCGGAAAGGCAAAGTACACCCGTCACCGTCGCCATCGTCTTCCTCCTCCGTCTTCAAGCTACTTCCGGCCGCTATTTTGGCCCTAATCTCCGTCCTCTC
CCTCGAGGAACGCGAAGTTTTGGCTTACATGATTGCCAGGTCCATCCAATCCTCCGCATTCGCTTCCACTCGCGATTCCAGGAAGAAATCGGCCAGAAAAGCTTCAATCA
ATGGCGGAATCGCTACCATTACTTCCGGCAACCACAAAACTCCGATGTTCTCTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAAT
CGCGACCTCATCCACCACGCCATTGACGCCTTCGAAGATCACTTGCCGATCGCCGAGAAGCCGAAGAACAACGCCGCCCGAGCCAAGCGGCGACACAGAATCCGCAATCA
AGCCATCGACAAGTCCGTTCCAGTAGTTCAATGTCCGCCGCTGGAGCCCGACCAGTGCTCCGCCGTCCTTCCGCCTTCGCCGGAGCGCGACGACGGAGGCGAAGGAAGCC
TGGCGGCGGAGGATGGGGAGAGCGGTGCAGCGGAGGACGTGAGCACCGCCGCCGCCGCCGCCGCCGACCATCAGAAGGGTTTGGCGACGATTGTGGTGCCGGACGTGTTG
CGGTATTTCAAATCCCGTTTTAGTAGTCTGTGGAATCCGAATACG
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTAAGCGGAAAGGCAAAGTACACCCGTCACCGTCGCCATCGTCTTCCTCCTCCGTCTTCAAGCTACTTCCGGCCGCTATTTTGGCCCTAATCTCCGTCCTCTC
CCTCGAGGAACGCGAAGTTTTGGCTTACATGATTGCCAGGTCCATCCAATCCTCCGCATTCGCTTCCACTCGCGATTCCAGGAAGAAATCGGCCAGAAAAGCTTCAATCA
ATGGCGGAATCGCTACCATTACTTCCGGCAACCACAAAACTCCGATGTTCTCTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAAT
CGCGACCTCATCCACCACGCCATTGACGCCTTCGAAGATCACTTGCCGATCGCCGAGAAGCCGAAGAACAACGCCGCCCGAGCCAAGCGGCGACACAGAATCCGCAATCA
AGCCATCGACAAGTCCGTTCCAGTAGTTCAATGTCCGCCGCTGGAGCCCGACCAGTGCTCCGCCGTCCTTCCGCCTTCGCCGGAGCGCGACGACGGAGGCGAAGGAAGCC
TGGCGGCGGAGGATGGGGAGAGCGGTGCAGCGGAGGACGTGAGCACCGCCGCCGCCGCCGCCGCCGACCATCAGAAGGGTTTGGCGACGATTGTGGTGCCGGACGTGTTG
CGGTATTTCAAATCCCGTTTTAGTAGTCTGTGGAATCCGAATACG
Protein sequenceShow/hide protein sequence
MEIKRKGKVHPSPSPSSSSSVFKLLPAAILALISVLSLEEREVLAYMIARSIQSSAFASTRDSRKKSARKASINGGIATITSGNHKTPMFSCDCFYCYTAYWCRWDSSPN
RDLIHHAIDAFEDHLPIAEKPKNNAARAKRRHRIRNQAIDKSVPVVQCPPLEPDQCSAVLPPSPERDDGGEGSLAAEDGESGAAEDVSTAAAAAADHQKGLATIVVPDVL
RYFKSRFSSLWNPNT