; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:13968135..13970009
RNA-Seq ExpressionMoc08g18440
SyntenyMoc08g18440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042614.1 Ulp1-like peptidase [Cucumis melo var. makuwa]5.2e-2827.27Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR              + ++ +F         
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------

Query:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
                   R  + + L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE
         TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++     V  LN+  VPR   +SC   +   +L+  +F+S  +V+   L  
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE

Query:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS--RMVEEAGANT
         DAE+ +R++    R       A    S            D  G  + HD  ++ + +  +  L GP+ +         H   D+ N+  R + + G + 
Subjt:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS--RMVEEAGANT

Query:  YTGAPVRPELDAERLTMSGDMNIENTDGV
               PE+  +R    G    E+   V
Subjt:  YTGAPVRPELDAERLTMSGDMNIENTDGV

KGN49944.1 hypothetical protein Csa_000148 [Cucumis sativus]4.2e-3028.64Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR                             +
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S

Query:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
         L PS  +  PR      L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY
        +TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++    V  LN+  VPRI  +SC   +   +L+  VF+S  +V+   L   
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY

Query:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG
        DAE+ +R+S    R       A    S            D  G+ + HD  ++ + +  +  L GP+ +         H+  D+ N+   E    +    
Subjt:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG

Query:  APVRPELDAERLTMSGDMNIENTDGV
            PE+  +R    G  + E+   V
Subjt:  APVRPELDAERLTMSGDMNIENTDGV

XP_011654656.1 uncharacterized protein LOC105435430 isoform X1 [Cucumis sativus]4.2e-3028.64Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR                             +
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S

Query:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
         L PS  +  PR      L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY
        +TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++    V  LN+  VPRI  +SC   +   +L+  VF+S  +V+   L   
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY

Query:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG
        DAE+ +R+S    R       A    S            D  G+ + HD  ++ + +  +  L GP+ +         H+  D+ N+   E    +    
Subjt:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG

Query:  APVRPELDAERLTMSGDMNIENTDGV
            PE+  +R    G  + E+   V
Subjt:  APVRPELDAERLTMSGDMNIENTDGV

XP_022157199.1 uncharacterized protein LOC111023969 [Momordica charantia]3.1e-5740.69Show/hide
Query:  MAHAFKIAEGDRFPAQATSLSHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLRSHL--------------VPSF----------
        M H  K++E DRFPAQ TSLSHLS  NK I QKLTP QL+MFRK T+FGRFVDLDMMFCSALVHY LLR  +              + +F          
Subjt:  MAHAFKIAEGDRFPAQATSLSHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLRSHL--------------VPSF----------

Query:  ----------RRRSPRELRDRYFKDEKGDISLVDFERIYKATVFDNDE---------------------------------------------------T
                  ++ S   LR  YFKD   D+ L +FE+ YK  VF ND+                                                   T
Subjt:  ----------RRRSPRELRDRYFKDEKGDISLVDFERIYKATVFDNDE---------------------------------------------------T

Query:  LNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEYDA
        L GLQSAMKDKV +YK+K      F+V+YSLTGFP AFQ WAYEI+P+L R+GV+ L+D  +PRIF YSC + +T  VLE  VF+S EL +  PL E +A
Subjt:  LNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEYDA

Query:  ERLYRESTFAVRVGSYGTTAAATASTRTDGVGNTELHDGSQSVDDVEFAKLLG-PYTQLIPIPEGTVHHPSDVHNS
        ERLYRE+ F  R      T     S   D   + E  D     +D +F + +G P+        G   H  DV NS
Subjt:  ERLYRESTFAVRVGSYGTTAAATASTRTDGVGNTELHDGSQSVDDVEFAKLLG-PYTQLIPIPEGTVHHPSDVHNS

XP_031741885.1 uncharacterized protein LOC105435430 isoform X2 [Cucumis sativus]4.2e-3028.64Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR                             +
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S

Query:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
         L PS  +  PR      L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY
        +TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++    V  LN+  VPRI  +SC   +   +L+  VF+S  +V+   L   
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY

Query:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG
        DAE+ +R+S    R       A    S            D  G+ + HD  ++ + +  +  L GP+ +         H+  D+ N+   E    +    
Subjt:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG

Query:  APVRPELDAERLTMSGDMNIENTDGV
            PE+  +R    G  + E+   V
Subjt:  APVRPELDAERLTMSGDMNIENTDGV

TrEMBL top hitse value%identityAlignment
A0A0A0KM59 DUF1985 domain-containing protein2.0e-3028.64Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR                             +
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR-----------------------------S

Query:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
         L PS  +  PR      L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  HLVPSFRRRSPR-----ELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY
        +TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++    V  LN+  VPRI  +SC   +   +L+  VF+S  +V+   L   
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEY

Query:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG
        DAE+ +R+S    R       A    S            D  G+ + HD  ++ + +  +  L GP+ +         H+  D+ N+   E    +    
Subjt:  DAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTG

Query:  APVRPELDAERLTMSGDMNIENTDGV
            PE+  +R    G  + E+   V
Subjt:  APVRPELDAERLTMSGDMNIENTDGV

A0A1S3ATU8 uncharacterized protein LOC103482899 isoform X15.6e-2828.09Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR              + ++ +F         
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------

Query:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
                   R  + + L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE
         TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++     V  LN+  VPR   +SC   +   +L+  +F+S  +V+   L  
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE

Query:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS
         DAE+ +R++    R       A    S            D  G  + HD  ++ + +  +  L GP+ +         H   D+ N+
Subjt:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS

A0A1S3AUB0 uncharacterized protein LOC103482899 isoform X25.6e-2828.09Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR              + ++ +F         
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------

Query:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
                   R  + + L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE
         TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++     V  LN+  VPR   +SC   +   +L+  +F+S  +V+   L  
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE

Query:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS
         DAE+ +R++    R       A    S            D  G  + HD  ++ + +  +  L GP+ +         H   D+ N+
Subjt:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS

A0A5A7TGU0 Ulp1-like peptidase2.5e-2827.27Show/hide
Query:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------
        M    KI+  +RFP QA+S+ SH+   NK +++KLTP+QL++F++ TVFGRFVD+D++F S LVH+ILLR              + ++ +F         
Subjt:  MAHAFKIAEGDRFPAQATSL-SHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLR--------------SHLVPSF---------

Query:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------
                   R  + + L  +YF +E   D++ + FE  YK   F +D                                                   
Subjt:  -----------RRRSPRELRDRYFKDE-KGDISLVDFERIYKATVFDND---------------------------------------------------

Query:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE
         TL GLQ+A+K+KV  YK K     ++ VKYSL GFP AFQ WAYEI+ ++     V  LN+  VPR   +SC   +   +L+  +F+S  +V+   L  
Subjt:  ETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRR-HGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAE

Query:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS--RMVEEAGANT
         DAE+ +R++    R       A    S            D  G  + HD  ++ + +  +  L GP+ +         H   D+ N+  R + + G + 
Subjt:  YDAERLYRESTFAVRVGSYGTTAAATAST---------RTDGVGNTELHD-GSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNS--RMVEEAGANT

Query:  YTGAPVRPELDAERLTMSGDMNIENTDGV
               PE+  +R    G    E+   V
Subjt:  YTGAPVRPELDAERLTMSGDMNIENTDGV

A0A6J1DSS5 uncharacterized protein LOC1110239691.5e-5740.69Show/hide
Query:  MAHAFKIAEGDRFPAQATSLSHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLRSHL--------------VPSF----------
        M H  K++E DRFPAQ TSLSHLS  NK I QKLTP QL+MFRK T+FGRFVDLDMMFCSALVHY LLR  +              + +F          
Subjt:  MAHAFKIAEGDRFPAQATSLSHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLRSHL--------------VPSF----------

Query:  ----------RRRSPRELRDRYFKDEKGDISLVDFERIYKATVFDNDE---------------------------------------------------T
                  ++ S   LR  YFKD   D+ L +FE+ YK  VF ND+                                                   T
Subjt:  ----------RRRSPRELRDRYFKDEKGDISLVDFERIYKATVFDNDE---------------------------------------------------T

Query:  LNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEYDA
        L GLQSAMKDKV +YK+K      F+V+YSLTGFP AFQ WAYEI+P+L R+GV+ L+D  +PRIF YSC + +T  VLE  VF+S EL +  PL E +A
Subjt:  LNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEYDA

Query:  ERLYRESTFAVRVGSYGTTAAATASTRTDGVGNTELHDGSQSVDDVEFAKLLG-PYTQLIPIPEGTVHHPSDVHNS
        ERLYRE+ F  R      T     S   D   + E  D     +D +F + +G P+        G   H  DV NS
Subjt:  ERLYRESTFAVRVGSYGTTAAATASTRTDGVGNTELHDGSQSVDDVEFAKLLG-PYTQLIPIPEGTVHHPSDVHNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCATGCATTTAAGATTGCTGAGGGGGATAGATTTCCAGCCCAAGCTACTAGCTTGTCTCACCTGAGCAATGTGAACAAGTATATCAGGCAGAAGCTGACTCCTGA
ACAGTTGGAAATGTTCAGGAAAGGAACAGTATTTGGCCGATTTGTCGACCTTGACATGATGTTCTGCAGTGCATTGGTTCATTACATTCTGTTAAGGAGCCATCTGGTAC
CGTCGTTCAGAAGAAGGTCTCCAAGAGAACTGCGCGACCGATATTTCAAGGACGAGAAGGGCGACATATCTCTCGTTGATTTCGAACGAATTTACAAGGCGACTGTATTC
GACAATGACGAGACACTTAATGGCCTCCAGAGTGCGATGAAAGACAAGGTGTCACACTACAAGTCAAAGTCTGGTGCCAAAAGCAGTTTCAAGGTGAAGTATAGTTTGAC
GGGTTTTCCCCCAGCATTCCAAGCTTGGGCATACGAGATCGTACCAACTCTTCGCAGACACGGCGTTGATGAGCTAAATGATATAGTAGTGCCTCGCATATTTCATTACT
CGTGTCTAAAAATCGTCACCAAAGCTGTTCTCGAATGCGGGGTGTTCGATTCATTTGAGTTGGTTGTTATCCAGCCTCTAGCAGAGTATGATGCAGAAAGGCTGTATCGA
GAGTCGACCTTCGCAGTTAGGGTTGGTAGCTATGGCACTACTGCAGCTGCTACCGCATCAACCCGCACCGATGGAGTTGGAAATACTGAGCTCCACGACGGTTCACAGTC
TGTTGATGATGTTGAGTTCGCCAAGTTACTGGGGCCGTACACTCAATTGATTCCCATACCTGAAGGGACAGTACACCACCCATCAGACGTACACAATTCAAGAATGGTGG
AAGAAGCTGGTGCAAACACGTACACTGGTGCACCTGTTCGTCCTGAACTCGATGCTGAGAGGTTGACCATGTCGGGTGACATGAACATTGAGAACACTGATGGTGTTCGA
GAGAATCAGGACATGCAAGGCGCTGAAGAGTTTGATAGGGGCGAAGACGGACTAGGATTGTCAGTGGAGAAAGACAGATTGGAGATGTGCTTGACGTTGATGTCAGTGGA
GTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCATGCATTTAAGATTGCTGAGGGGGATAGATTTCCAGCCCAAGCTACTAGCTTGTCTCACCTGAGCAATGTGAACAAGTATATCAGGCAGAAGCTGACTCCTGA
ACAGTTGGAAATGTTCAGGAAAGGAACAGTATTTGGCCGATTTGTCGACCTTGACATGATGTTCTGCAGTGCATTGGTTCATTACATTCTGTTAAGGAGCCATCTGGTAC
CGTCGTTCAGAAGAAGGTCTCCAAGAGAACTGCGCGACCGATATTTCAAGGACGAGAAGGGCGACATATCTCTCGTTGATTTCGAACGAATTTACAAGGCGACTGTATTC
GACAATGACGAGACACTTAATGGCCTCCAGAGTGCGATGAAAGACAAGGTGTCACACTACAAGTCAAAGTCTGGTGCCAAAAGCAGTTTCAAGGTGAAGTATAGTTTGAC
GGGTTTTCCCCCAGCATTCCAAGCTTGGGCATACGAGATCGTACCAACTCTTCGCAGACACGGCGTTGATGAGCTAAATGATATAGTAGTGCCTCGCATATTTCATTACT
CGTGTCTAAAAATCGTCACCAAAGCTGTTCTCGAATGCGGGGTGTTCGATTCATTTGAGTTGGTTGTTATCCAGCCTCTAGCAGAGTATGATGCAGAAAGGCTGTATCGA
GAGTCGACCTTCGCAGTTAGGGTTGGTAGCTATGGCACTACTGCAGCTGCTACCGCATCAACCCGCACCGATGGAGTTGGAAATACTGAGCTCCACGACGGTTCACAGTC
TGTTGATGATGTTGAGTTCGCCAAGTTACTGGGGCCGTACACTCAATTGATTCCCATACCTGAAGGGACAGTACACCACCCATCAGACGTACACAATTCAAGAATGGTGG
AAGAAGCTGGTGCAAACACGTACACTGGTGCACCTGTTCGTCCTGAACTCGATGCTGAGAGGTTGACCATGTCGGGTGACATGAACATTGAGAACACTGATGGTGTTCGA
GAGAATCAGGACATGCAAGGCGCTGAAGAGTTTGATAGGGGCGAAGACGGACTAGGATTGTCAGTGGAGAAAGACAGATTGGAGATGTGCTTGACGTTGATGTCAGTGGA
GTGCTGA
Protein sequenceShow/hide protein sequence
MAHAFKIAEGDRFPAQATSLSHLSNVNKYIRQKLTPEQLEMFRKGTVFGRFVDLDMMFCSALVHYILLRSHLVPSFRRRSPRELRDRYFKDEKGDISLVDFERIYKATVF
DNDETLNGLQSAMKDKVSHYKSKSGAKSSFKVKYSLTGFPPAFQAWAYEIVPTLRRHGVDELNDIVVPRIFHYSCLKIVTKAVLECGVFDSFELVVIQPLAEYDAERLYR
ESTFAVRVGSYGTTAAATASTRTDGVGNTELHDGSQSVDDVEFAKLLGPYTQLIPIPEGTVHHPSDVHNSRMVEEAGANTYTGAPVRPELDAERLTMSGDMNIENTDGVR
ENQDMQGAEEFDRGEDGLGLSVEKDRLEMCLTLMSVEC