; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:14651633..14654593
RNA-Seq ExpressionMoc02g19760
SyntenyMoc02g19760
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152029.1 uncharacterized protein LOC111019838 [Momordica charantia]9.5e-14086.78Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRGDPEKRNKDKYC FHRDHGHNTS+CWELKRQIE+LIQDGYFKKFVGK RSNSVEK EERKRSRTP RR+DRPAVINTIFGG SGGQS 
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        NKR ELAR ARR+VCIIREQ+PT  ITF  A+LE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSL TYLTLGWTRSQLKKS TPLVGFSGES++
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTN
        PEGCIDLPVT GQD TQV +MAEFVVIDGRSAYNAIFGRPIIHSF  VPSTLHQVLKYST +GVG VRGEQ  SRECYASALKGS+VCALE+Q +
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]2.1e-14781.44Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRG PE+R+KDKYC FHR+HGHNTS+ WELK QIE+LIQDGYFKKFVGK R++S EKKEERKRSRTPPRR DRPAVINTIFGG SGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        +KRK+LAR ARREVCIIREQ+PTC ITF  A+L  VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYL LGWTRSQLKKSPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
        PEGCIDLPVT+GQD T+V QMAEFVV+DGRSAYNAIFGRPIIHSF  +PSTLHQVLKYST NGVGTVRGEQ  SRECYAS LKG++VCALE  T+     
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVR
        E +ADLP    R+F+ P EELELVPLLS EKQV+
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVR

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]2.9e-15285.59Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRGDPEK NKD              NCWELKRQIE LIQDGYFKKFVGK RSNSVEKKEERKRSRTPPRRDDRPAVINTIFGG SGGQ G
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        NKR +LAR  RREVCIIREQKPTC ITFGDA+LEGVHLPHNDALVIAPLIDH+LVRRVL+DGGASANI SLPTYL LGWTRSQLKKSPTPLVGFSGESVS
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
        PEGCIDL VTIGQDATQV QMAEFVVID +SAYNAIFGRPIIHSF  V STLHQVLKYST NGVGTVRGEQKTSR+CYAS LKG AVC LEEQTN GKLQ
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQV
         S+ADLPK+ KRQFSPPTEELELVPLLS EK V
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQV

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]7.1e-14379.1Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRG PE+R+KDKYC FHR+HGHNTS+CWELKRQIE+LIQDGYFKKFVGK  ++S EKKEERKRSRTPPRR DRPAVINTIFGG SGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ PTC ITF  A+LE VHLPHNDAL+IA LIDHV+VRRVLV+GGASANILSLPTYL LGWTRSQL++SPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
        PEGCIDLPVT+GQ+ T++ QMAEFVV+DGRS YNAIFGRPIIHSF  +PSTLHQVLKY T NGVGTVRGEQ  SRECYA+ALKG +VCALE   + G L 
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVRS
        E +A+LP   +++F+ PTEELELVPLLS EKQ+ +
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVRS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.6e-13980.12Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRGD EKRNK+KYC FHRDHGHNT++CWELKRQIE+LIQDGYFKKFVGK RSNSVEKKEERKRSRTPPRR+DRPAVINTIFGG +GGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        NKRKELAREARREVCIIRE KPTCSITFGDA+LEGVHLPHNDALVIA LIDH LVRRVL+DG                                      
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
          GCIDLPVTIGQDATQV QMAEFVVIDGRSAYNAIFGRPIIHSF  VPSTLHQVLKYST N VG VRGEQKTSRECYASALKGSAVCALEEQTN GKLQ
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQ
        ES+ADLPKEGKRQF PPTEELELVPLLS E+Q
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1DD03 uncharacterized protein LOC1110198991.0e-14781.44Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRG PE+R+KDKYC FHR+HGHNTS+ WELK QIE+LIQDGYFKKFVGK R++S EKKEERKRSRTPPRR DRPAVINTIFGG SGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        +KRK+LAR ARREVCIIREQ+PTC ITF  A+L  VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYL LGWTRSQLKKSPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
        PEGCIDLPVT+GQD T+V QMAEFVV+DGRSAYNAIFGRPIIHSF  +PSTLHQVLKYST NGVGTVRGEQ  SRECYAS LKG++VCALE  T+     
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVR
        E +ADLP    R+F+ P EELELVPLLS EKQV+
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVR

A0A6J1DET8 uncharacterized protein LOC1110198384.6e-14086.78Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRGDPEKRNKDKYC FHRDHGHNTS+CWELKRQIE+LIQDGYFKKFVGK RSNSVEK EERKRSRTP RR+DRPAVINTIFGG SGGQS 
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        NKR ELAR ARR+VCIIREQ+PT  ITF  A+LE VHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSL TYLTLGWTRSQLKKS TPLVGFSGES++
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTN
        PEGCIDLPVT GQD TQV +MAEFVVIDGRSAYNAIFGRPIIHSF  VPSTLHQVLKYST +GVG VRGEQ  SRECYASALKGS+VCALE+Q +
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTN

A0A6J1DPX9 uncharacterized protein LOC1110220061.4e-15285.59Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRGDPEK NKD              NCWELKRQIE LIQDGYFKKFVGK RSNSVEKKEERKRSRTPPRRDDRPAVINTIFGG SGGQ G
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        NKR +LAR  RREVCIIREQKPTC ITFGDA+LEGVHLPHNDALVIAPLIDH+LVRRVL+DGGASANI SLPTYL LGWTRSQLKKSPTPLVGFSGESVS
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
        PEGCIDL VTIGQDATQV QMAEFVVID +SAYNAIFGRPIIHSF  V STLHQVLKYST NGVGTVRGEQKTSR+CYAS LKG AVC LEEQTN GKLQ
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQV
         S+ADLPK+ KRQFSPPTEELELVPLLS EK V
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQV

A0A6J1DT04 uncharacterized protein LOC1110228803.4e-14379.1Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRG PE+R+KDKYC FHR+HGHNTS+CWELKRQIE+LIQDGYFKKFVGK  ++S EKKEERKRSRTPPRR DRPAVINTIFGG SGGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ PTC ITF  A+LE VHLPHNDAL+IA LIDHV+VRRVLV+GGASANILSLPTYL LGWTRSQL++SPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
        PEGCIDLPVT+GQ+ T++ QMAEFVV+DGRS YNAIFGRPIIHSF  +PSTLHQVLKY T NGVGTVRGEQ  SRECYA+ALKG +VCALE   + G L 
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVRS
        E +A+LP   +++F+ PTEELELVPLLS EKQ+ +
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQVRS

A0A6J1DZB9 uncharacterized protein LOC1110249047.9e-14080.12Show/hide
Query:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG
        MEKLLKRPEKLRGD EKRNK+KYC FHRDHGHNT++CWELKRQIE+LIQDGYFKKFVGK RSNSVEKKEERKRSRTPPRR+DRPAVINTIFGG +GGQSG
Subjt:  MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS
        NKRKELAREARREVCIIRE KPTCSITFGDA+LEGVHLPHNDALVIA LIDH LVRRVL+DG                                      
Subjt:  NKRKELAREARREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVS

Query:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ
          GCIDLPVTIGQDATQV QMAEFVVIDGRSAYNAIFGRPIIHSF  VPSTLHQVLKYST N VG VRGEQKTSRECYASALKGSAVCALEEQTN GKLQ
Subjt:  PEGCIDLPVTIGQDATQVMQMAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQ

Query:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQ
        ES+ADLPKEGKRQF PPTEELELVPLLS E+Q
Subjt:  ESDADLPKEGKRQFSPPTEELELVPLLSLEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAACAAAGATAAGTACTGCTGTTTTCATCGCGATCACGGCCACAATACGTCAAATTG
CTGGGAGTTAAAACGCCAGATTGAAAACCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAAACGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTT
CAAGAACGCCACCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGAGGCTCGAGCGGGGGCCAGTCTGGAAACAAGAGGAAAGAGCTAGCTCGCGAGGCT
AGGCGCGAGGTGTGCATCATCAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGACGCCAACCTGGAGGGGGTCCACTTGCCTCACAATGACGCGCTCGTGATCGC
CCCTCTCATTGATCACGTCCTGGTCCGAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCCCTCCCAACATATCTAACATTGGGATGGACCAGGTCACAAT
TGAAGAAAAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATCGACCTGCCGGTAACAATTGGGCAAGATGCTACCCAAGTAATGCAG
ATGGCCGAGTTCGTAGTGATCGACGGCAGATCGGCCTATAATGCCATTTTCGGGAGACCCATCATCCACTCATTTCTGGTCGTCCCCTCCACACTGCATCAAGTCTTGAA
GTACTCAACCCTGAATGGGGTGGGTACGGTCCGAGGTGAGCAAAAAACCTCACGGGAGTGTTATGCATCCGCGCTTAAAGGGTCGGCGGTATGCGCCCTGGAAGAACAAA
CCAATTGTGGCAAGCTGCAGGAGTCAGACGCCGACCTGCCAAAGGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCTCGAA
AAACAAGTCAGATCGCACCTCGCCCAGTTCAGGACTTACGAGGTGAGTCAAGTTCGAAGATCTGAGAACTCTAATGCAGATGCCTTAGCCAAATTGGCATCAGCATACGA
GACCGACCTGGCTAGGTCGGTCCCGGTCGAAATCCTAGACACTCCTTCAATCTTGGAGCCAGATGTAATGGAGGTTGATACTGCATCACTCACTTGGATGGACCCAATCG
TGGAGTTCATCAAAGGAAACCCACCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAACAAAGATAAGTACTGCTGTTTTCATCGCGATCACGGCCACAATACGTCAAATTG
CTGGGAGTTAAAACGCCAGATTGAAAACCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAAACGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTT
CAAGAACGCCACCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGAGGCTCGAGCGGGGGCCAGTCTGGAAACAAGAGGAAAGAGCTAGCTCGCGAGGCT
AGGCGCGAGGTGTGCATCATCAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGACGCCAACCTGGAGGGGGTCCACTTGCCTCACAATGACGCGCTCGTGATCGC
CCCTCTCATTGATCACGTCCTGGTCCGAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCCCTCCCAACATATCTAACATTGGGATGGACCAGGTCACAAT
TGAAGAAAAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATCGACCTGCCGGTAACAATTGGGCAAGATGCTACCCAAGTAATGCAG
ATGGCCGAGTTCGTAGTGATCGACGGCAGATCGGCCTATAATGCCATTTTCGGGAGACCCATCATCCACTCATTTCTGGTCGTCCCCTCCACACTGCATCAAGTCTTGAA
GTACTCAACCCTGAATGGGGTGGGTACGGTCCGAGGTGAGCAAAAAACCTCACGGGAGTGTTATGCATCCGCGCTTAAAGGGTCGGCGGTATGCGCCCTGGAAGAACAAA
CCAATTGTGGCAAGCTGCAGGAGTCAGACGCCGACCTGCCAAAGGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCTCGAA
AAACAAGTCAGATCGCACCTCGCCCAGTTCAGGACTTACGAGGTGAGTCAAGTTCGAAGATCTGAGAACTCTAATGCAGATGCCTTAGCCAAATTGGCATCAGCATACGA
GACCGACCTGGCTAGGTCGGTCCCGGTCGAAATCCTAGACACTCCTTCAATCTTGGAGCCAGATGTAATGGAGGTTGATACTGCATCACTCACTTGGATGGACCCAATCG
TGGAGTTCATCAAAGGAAACCCACCGTAA
Protein sequenceShow/hide protein sequence
MEKLLKRPEKLRGDPEKRNKDKYCCFHRDHGHNTSNCWELKRQIENLIQDGYFKKFVGKTRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGSSGGQSGNKRKELAREA
RREVCIIREQKPTCSITFGDANLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLTLGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQVMQ
MAEFVVIDGRSAYNAIFGRPIIHSFLVVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSAVCALEEQTNCGKLQESDADLPKEGKRQFSPPTEELELVPLLSLE
KQVRSHLAQFRTYEVSQVRRSENSNADALAKLASAYETDLARSVPVEILDTPSILEPDVMEVDTASLTWMDPIVEFIKGNPP