; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g15880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g15880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr5:11925366..11932827
RNA-Seq ExpressionMoc05g15880
SyntenyMoc05g15880
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040934.1 uncharacterized protein E6C27_scaffold125G001250 [Cucumis melo var. makuwa]4.7e-3030.88Show/hide
Query:  GETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDAS
        G  +K +   GP+G   ITR+S +GH+RV++Y   GQPIG + TKLK+FIG +VR H+PI Y +W  VP+E K+KIY+LI+K              ID  
Subjt:  GETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDAS

Query:  H----------------ERSS------------------------------DRCILGKKTRTNRERKI-------DDLMIMQDLEEENVDKDGDLLSIAL
        H                ER                                DR ++ KK RT ++ +I       D+L++ +       +   D+LS A+
Subjt:  H----------------ERSS------------------------------DRCILGKKTRTNRERKI-------DDLMIMQDLEEENVDKDGDLLSIAL

Query:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK
        G  D P  +R VG+ +T ++YFH   +     VG+++   EE  +M A + EL+V+L KH             K+ P+ + K + T  SK   + +S+S 
Subjt:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK

Query:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV
         T+   N              +  ++E   +L D+T+    K+ +         ETL K K+GT CRLA+G  DNVV   TIF+ + +  NVKVS+D+V 
Subjt:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV

Query:  DDDSRLPI
        D +  +P+
Subjt:  DDDSRLPI

KAA0066349.1 uncharacterized protein E6C27_scaffold21G004140 [Cucumis melo var. makuwa]7.6e-2829.41Show/hide
Query:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------
        GP+G+ EI R+S +GH+RV++Y   GQPI  + TKLK+FIG +VR H+PI Y++WK VP+E K+KIY+LI+K              ID  H         
Subjt:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------

Query:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL
                    ER                                D  ++ KK RT ++             KID+L++ +       +   D+LS A+
Subjt:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL

Query:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK
        G  D PR +R VG+ +T ++YFH   +     VG+ +   EE  +M A + EL+ +L K+             K+ P+ + K + T  SK   + +S+S 
Subjt:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK

Query:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV
         T+   N              N  ++E   +L D+ +    K+ +         ETL K K+ T C LA+G  +NVV   TIF+ + +  NVKVS+D+V 
Subjt:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV

Query:  DDDSRLPI
        D +  +P+
Subjt:  DDDSRLPI

TYK00930.1 uncharacterized protein E5676_scaffold602G001010 [Cucumis melo var. makuwa]7.6e-2829.41Show/hide
Query:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------
        GP+G+ EI R+S +GH+RV++Y   GQPI  + TKLK+FIG +VR H+PI Y++WK VP+E K+KIY+LI+K              ID  H         
Subjt:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------

Query:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL
                    ER                                D  ++ KK RT ++             KID+L++ +       +   D+LS A+
Subjt:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL

Query:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK
        G  D PR +R VG+ +T ++YFH   +     VG+ +   EE  +M A + EL+ +L K+             K+ P+ + K + T  SK   + +S+S 
Subjt:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK

Query:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV
         T+   N              N  ++E   +L D+ +    K+ +         ETL K K+ T C LA+G  +NVV   TIF+ + +  NVKVS+D+V 
Subjt:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV

Query:  DDDSRLPI
        D +  +P+
Subjt:  DDDSRLPI

TYK22243.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]4.6e-3329.06Show/hide
Query:  DVVGLGSQNNVEESLRDRPLLAGSYVKSSVDHLKEAHIYVLENTEE---------------------VEPYRSAI-------------------------
        D +GLGS N+ E+   DRPL AG+YV   +  LK+AH+++L+NT+E                     V+P  S I                         
Subjt:  DVVGLGSQNNVEESLRDRPLLAGSYVKSSVDHLKEAHIYVLENTEE---------------------VEPYRSAI-------------------------

Query:  ----------------------FHVTKENLYQNDRMTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPII
                              FH TK  L++  +   V +    ++K G + + EITR  SEG + VI+Y   GQ IG N TKLK+FIG +VR H+PI 
Subjt:  ----------------------FHVTKENLYQNDRMTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPII

Query:  YETWKSVPSETKEKIYDLIQKIDASHERSSDRCIL--GKKTRTNRER--------------------KIDDLMIMQDLEEENVDKDGDLLSIALGSRDRP
        +  W +VP E K+KI++LI+       RS    IL  GKK+   +E+                    ++D+L+  Q  +  N   + D+L+ ALG +D  
Subjt:  YETWKSVPSETKEKIYDLIQKIDASHERSSDRCIL--GKKTRTNRER--------------------KIDDLMIMQDLEEENVDKDGDLLSIALGSRDRP

Query:  RLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEK-----GSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSKR
         ++R VG  +TK +YFH  T+ +T    ++K   +E+ +M   +KEL+ KL K ++     G  K + G  +K+                  K S +   
Subjt:  RLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEK-----GSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSKR

Query:  TTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDREETLKFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVVDDDSRLPI
                 K  +S++      E +E +     V +    K ++     K K  T C+LA    D+VVA  TI +S+    NVKV++DVVVD+D  +PI
Subjt:  TTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDREETLKFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVVDDDSRLPI

XP_022156873.1 uncharacterized protein LOC111023710 [Momordica charantia]4.0e-6954.13Show/hide
Query:  MTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQ---------------
        MTDVGETSKTRKK GP+GLH+ITRISSEGHRRVIKY VKGQPIG+N TKLKTFIGISV+HHIP IYETWKSVPSETKEKIYDLIQ               
Subjt:  MTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQ---------------

Query:  ---------------------------------------------------------------------------------------KIDASHERSSDRC
                                                                                               KIDASHER SDRC
Subjt:  ---------------------------------------------------------------------------------------KIDASHERSSDRC

Query:  ILGKKTRTNRERK------------IDDLMIMQDLEEENVDKDGDLLSIALGSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTAL
        IL KK RTNRE K            IDDLMI QDL+EENVDK GDLLSIALGSRDRP LV+AVG+ ITKTQYF+NPTKHSTP VGEKKVDI+EYD+M AL
Subjt:  ILGKKTRTNRERK------------IDDLMIMQDLEEENVDKDGDLLSIALGSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTAL

Query:  VKE
        VK+
Subjt:  VKE

TrEMBL top hitse value%identityAlignment
A0A5A7TBV0 Uncharacterized protein2.3e-3030.88Show/hide
Query:  GETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDAS
        G  +K +   GP+G   ITR+S +GH+RV++Y   GQPIG + TKLK+FIG +VR H+PI Y +W  VP+E K+KIY+LI+K              ID  
Subjt:  GETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDAS

Query:  H----------------ERSS------------------------------DRCILGKKTRTNRERKI-------DDLMIMQDLEEENVDKDGDLLSIAL
        H                ER                                DR ++ KK RT ++ +I       D+L++ +       +   D+LS A+
Subjt:  H----------------ERSS------------------------------DRCILGKKTRTNRERKI-------DDLMIMQDLEEENVDKDGDLLSIAL

Query:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK
        G  D P  +R VG+ +T ++YFH   +     VG+++   EE  +M A + EL+V+L KH             K+ P+ + K + T  SK   + +S+S 
Subjt:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK

Query:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV
         T+   N              +  ++E   +L D+T+    K+ +         ETL K K+GT CRLA+G  DNVV   TIF+ + +  NVKVS+D+V 
Subjt:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV

Query:  DDDSRLPI
        D +  +P+
Subjt:  DDDSRLPI

A0A5A7VM08 Uncharacterized protein3.7e-2829.41Show/hide
Query:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------
        GP+G+ EI R+S +GH+RV++Y   GQPI  + TKLK+FIG +VR H+PI Y++WK VP+E K+KIY+LI+K              ID  H         
Subjt:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------

Query:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL
                    ER                                D  ++ KK RT ++             KID+L++ +       +   D+LS A+
Subjt:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL

Query:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK
        G  D PR +R VG+ +T ++YFH   +     VG+ +   EE  +M A + EL+ +L K+             K+ P+ + K + T  SK   + +S+S 
Subjt:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK

Query:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV
         T+   N              N  ++E   +L D+ +    K+ +         ETL K K+ T C LA+G  +NVV   TIF+ + +  NVKVS+D+V 
Subjt:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV

Query:  DDDSRLPI
        D +  +P+
Subjt:  DDDSRLPI

A0A5D3BPG6 Uncharacterized protein3.7e-2829.41Show/hide
Query:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------
        GP+G+ EI R+S +GH+RV++Y   GQPI  + TKLK+FIG +VR H+PI Y++WK VP+E K+KIY+LI+K              ID  H         
Subjt:  GPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQK--------------IDASH---------

Query:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL
                    ER                                D  ++ KK RT ++             KID+L++ +       +   D+LS A+
Subjt:  ------------ERSS------------------------------DRCILGKKTRTNRE------------RKIDDLMIMQDLEEENVDKDGDLLSIAL

Query:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK
        G  D PR +R VG+ +T ++YFH   +     VG+ +   EE  +M A + EL+ +L K+             K+ P+ + K + T  SK   + +S+S 
Subjt:  GSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSK

Query:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV
         T+   N              N  ++E   +L D+ +    K+ +         ETL K K+ T C LA+G  +NVV   TIF+ + +  NVKVS+D+V 
Subjt:  RTTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDRE-------ETL-KFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVV

Query:  DDDSRLPI
        D +  +P+
Subjt:  DDDSRLPI

A0A5D3DFC9 CACTA en-spm transposon protein2.2e-3329.06Show/hide
Query:  DVVGLGSQNNVEESLRDRPLLAGSYVKSSVDHLKEAHIYVLENTEE---------------------VEPYRSAI-------------------------
        D +GLGS N+ E+   DRPL AG+YV   +  LK+AH+++L+NT+E                     V+P  S I                         
Subjt:  DVVGLGSQNNVEESLRDRPLLAGSYVKSSVDHLKEAHIYVLENTEE---------------------VEPYRSAI-------------------------

Query:  ----------------------FHVTKENLYQNDRMTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPII
                              FH TK  L++  +   V +    ++K G + + EITR  SEG + VI+Y   GQ IG N TKLK+FIG +VR H+PI 
Subjt:  ----------------------FHVTKENLYQNDRMTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPII

Query:  YETWKSVPSETKEKIYDLIQKIDASHERSSDRCIL--GKKTRTNRER--------------------KIDDLMIMQDLEEENVDKDGDLLSIALGSRDRP
        +  W +VP E K+KI++LI+       RS    IL  GKK+   +E+                    ++D+L+  Q  +  N   + D+L+ ALG +D  
Subjt:  YETWKSVPSETKEKIYDLIQKIDASHERSSDRCIL--GKKTRTNRER--------------------KIDDLMIMQDLEEENVDKDGDLLSIALGSRDRP

Query:  RLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEK-----GSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSKR
         ++R VG  +TK +YFH  T+ +T    ++K   +E+ +M   +KEL+ KL K ++     G  K + G  +K+                  K S +   
Subjt:  RLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEK-----GSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSKR

Query:  TTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDREETLKFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVVDDDSRLPI
                 K  +S++      E +E +     V +    K ++     K K  T C+LA    D+VVA  TI +S+    NVKV++DVVVD+D  +PI
Subjt:  TTPSKNASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDREETLKFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVVDDDSRLPI

A0A6J1DRS8 uncharacterized protein LOC1110237101.9e-6954.13Show/hide
Query:  MTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQ---------------
        MTDVGETSKTRKK GP+GLH+ITRISSEGHRRVIKY VKGQPIG+N TKLKTFIGISV+HHIP IYETWKSVPSETKEKIYDLIQ               
Subjt:  MTDVGETSKTRKKCGPSGLHEITRISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQ---------------

Query:  ---------------------------------------------------------------------------------------KIDASHERSSDRC
                                                                                               KIDASHER SDRC
Subjt:  ---------------------------------------------------------------------------------------KIDASHERSSDRC

Query:  ILGKKTRTNRERK------------IDDLMIMQDLEEENVDKDGDLLSIALGSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTAL
        IL KK RTNRE K            IDDLMI QDL+EENVDK GDLLSIALGSRDRP LV+AVG+ ITKTQYF+NPTKHSTP VGEKKVDI+EYD+M AL
Subjt:  ILGKKTRTNRERK------------IDDLMIMQDLEEENVDKDGDLLSIALGSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTAL

Query:  VKE
        VK+
Subjt:  VKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAATCCTCTTTTCTTCTCCCTCCTCGTCTCCTTCTCTTTCGAAGCCGCCAAGTCGAATCGTCTGTCGTCCGCCGCTTTTGGAAACTATTTGCTGCCGTCACGCTT
TCATTTGCCGTCCACTGCCATCGCTCCGTTGTCCGCCGTCCACCGCCATTGCTCTGTCGTACTGCCCATTTCTCACGCTCTTTCCTTCGCCTGTTTCTTCCTTCCTAGGA
CAATGGACAAATCATTGATGAAAAGAAATAGATTGTCTGAGGATTATGAGTTAGGAGTCGAGTCATTCGTTGAATTTGGGTTCCATCATGCAAATGGTTCTAAGAGCTAT
AGGGTATGGTTTTGGCATGGTGAAGAACATGCACCTAGAACAGATGAGGATAAGTTGAATGATGTAGTTGGACTGGGCTCTCAAAACAATGTAGAAGAGAGTCTCCGTGA
TAGGCCTTTATTAGCTGGATCATACGTAAAGTCGAGTGTTGATCATCTAAAAGAAGCTCATATTTATGTATTGGAGAACACTGAAGAAGTGGAACCATATCGAAGTGCAA
TTTTCCACGTAACTAAAGAGAATTTGTACCAGAACGACAGAATGACAGATGTAGGTGAAACTAGCAAGACGAGGAAGAAGTGTGGACCTTCGGGGTTGCACGAGATTACA
CGAATTAGCAGCGAAGGTCACCGGCGGGTTATTAAGTATAAGGTGAAAGGGCAACCGATTGGATATAATGTGACGAAATTGAAGACTTTTATTGGAATAAGTGTACGACA
CCATATTCCCATAATATACGAAACATGGAAATCGGTTCCATCTGAGACGAAGGAGAAAATCTATGACTTGATACAGAAAATTGATGCATCACATGAGAGATCATCAGATC
GTTGCATCTTAGGGAAGAAGACCCGAACAAATCGCGAGAGGAAGATTGATGATCTCATGATAATGCAAGACTTGGAAGAAGAGAATGTGGACAAAGATGGGGATTTGTTA
TCCATCGCACTAGGCAGTCGGGATCGGCCCAGACTTGTTAGAGCAGTAGGTCGGGACATAACAAAAACCCAATACTTCCATAATCCAACCAAACACTCAACGCCCCCAGT
AGGAGAGAAAAAGGTGGACATCGAAGAGTATGATCAAATGACAGCTCTTGTGAAAGAACTTCAAGTGAAACTGAAGAAGCATGAAAAAGGTTCCCCAAAATCAAAACATG
GCACGTCGGCTAAGAAGACTCCAAAAAAATCTCCTAAACCGAAGCGTACCACACCATCTAAAAACGCTCCAAAGAAATCTTCTCAGTCAAAGCGTACCACACCATCTAAA
AATGCTTCAAAAAAATCTCCTCAATCAAAGCGTAGTACTTTATCGAACCATGAGCGAGATGAAAAGTTACCAATACTTACTGATGTTACTTTATGTGTTCACTTGAAGAT
ATTGGACAGGGAAGAAACTTTGAAGTTCAAGGAGGGAACTTATTGTCGTCTGGCACTTGGGTTCATTGATAATGTTGTCGCTGTAAGCACTATATTTGAATCTGAGAGGA
ATGATGGAAACGTGAAAGTGTCCATGGACGTGGTGGTTGATGACGACTCTCGACTTCCAATTTCCGACGCATGGAGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAATCCTCTTTTCTTCTCCCTCCTCGTCTCCTTCTCTTTCGAAGCCGCCAAGTCGAATCGTCTGTCGTCCGCCGCTTTTGGAAACTATTTGCTGCCGTCACGCTT
TCATTTGCCGTCCACTGCCATCGCTCCGTTGTCCGCCGTCCACCGCCATTGCTCTGTCGTACTGCCCATTTCTCACGCTCTTTCCTTCGCCTGTTTCTTCCTTCCTAGGA
CAATGGACAAATCATTGATGAAAAGAAATAGATTGTCTGAGGATTATGAGTTAGGAGTCGAGTCATTCGTTGAATTTGGGTTCCATCATGCAAATGGTTCTAAGAGCTAT
AGGGTATGGTTTTGGCATGGTGAAGAACATGCACCTAGAACAGATGAGGATAAGTTGAATGATGTAGTTGGACTGGGCTCTCAAAACAATGTAGAAGAGAGTCTCCGTGA
TAGGCCTTTATTAGCTGGATCATACGTAAAGTCGAGTGTTGATCATCTAAAAGAAGCTCATATTTATGTATTGGAGAACACTGAAGAAGTGGAACCATATCGAAGTGCAA
TTTTCCACGTAACTAAAGAGAATTTGTACCAGAACGACAGAATGACAGATGTAGGTGAAACTAGCAAGACGAGGAAGAAGTGTGGACCTTCGGGGTTGCACGAGATTACA
CGAATTAGCAGCGAAGGTCACCGGCGGGTTATTAAGTATAAGGTGAAAGGGCAACCGATTGGATATAATGTGACGAAATTGAAGACTTTTATTGGAATAAGTGTACGACA
CCATATTCCCATAATATACGAAACATGGAAATCGGTTCCATCTGAGACGAAGGAGAAAATCTATGACTTGATACAGAAAATTGATGCATCACATGAGAGATCATCAGATC
GTTGCATCTTAGGGAAGAAGACCCGAACAAATCGCGAGAGGAAGATTGATGATCTCATGATAATGCAAGACTTGGAAGAAGAGAATGTGGACAAAGATGGGGATTTGTTA
TCCATCGCACTAGGCAGTCGGGATCGGCCCAGACTTGTTAGAGCAGTAGGTCGGGACATAACAAAAACCCAATACTTCCATAATCCAACCAAACACTCAACGCCCCCAGT
AGGAGAGAAAAAGGTGGACATCGAAGAGTATGATCAAATGACAGCTCTTGTGAAAGAACTTCAAGTGAAACTGAAGAAGCATGAAAAAGGTTCCCCAAAATCAAAACATG
GCACGTCGGCTAAGAAGACTCCAAAAAAATCTCCTAAACCGAAGCGTACCACACCATCTAAAAACGCTCCAAAGAAATCTTCTCAGTCAAAGCGTACCACACCATCTAAA
AATGCTTCAAAAAAATCTCCTCAATCAAAGCGTAGTACTTTATCGAACCATGAGCGAGATGAAAAGTTACCAATACTTACTGATGTTACTTTATGTGTTCACTTGAAGAT
ATTGGACAGGGAAGAAACTTTGAAGTTCAAGGAGGGAACTTATTGTCGTCTGGCACTTGGGTTCATTGATAATGTTGTCGCTGTAAGCACTATATTTGAATCTGAGAGGA
ATGATGGAAACGTGAAAGTGTCCATGGACGTGGTGGTTGATGACGACTCTCGACTTCCAATTTCCGACGCATGGAGGAAATGA
Protein sequenceShow/hide protein sequence
MPNPLFFSLLVSFSFEAAKSNRLSSAAFGNYLLPSRFHLPSTAIAPLSAVHRHCSVVLPISHALSFACFFLPRTMDKSLMKRNRLSEDYELGVESFVEFGFHHANGSKSY
RVWFWHGEEHAPRTDEDKLNDVVGLGSQNNVEESLRDRPLLAGSYVKSSVDHLKEAHIYVLENTEEVEPYRSAIFHVTKENLYQNDRMTDVGETSKTRKKCGPSGLHEIT
RISSEGHRRVIKYKVKGQPIGYNVTKLKTFIGISVRHHIPIIYETWKSVPSETKEKIYDLIQKIDASHERSSDRCILGKKTRTNRERKIDDLMIMQDLEEENVDKDGDLL
SIALGSRDRPRLVRAVGRDITKTQYFHNPTKHSTPPVGEKKVDIEEYDQMTALVKELQVKLKKHEKGSPKSKHGTSAKKTPKKSPKPKRTTPSKNAPKKSSQSKRTTPSK
NASKKSPQSKRSTLSNHERDEKLPILTDVTLCVHLKILDREETLKFKEGTYCRLALGFIDNVVAVSTIFESERNDGNVKVSMDVVVDDDSRLPISDAWRK