; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:3570905..3573972
RNA-Seq ExpressionMoc03g04800
SyntenyMoc03g04800
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]8.9e-14381.65Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRG PE+RNKDKYCRFHR+HDHNTS+ WELKRQIEDLIQD YFKKFVGKPR +S EKKEERK SRTP RR DRPAVINTIFGGPSGGQSG
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ+PTC ITFD  D+E VHLPHND LVIAPLIDHVVVRRVLVD G  ANI+SLLTYLALGWTR+QLKKS TPLVGFS ESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDLPVT+G D TQVTQMA+FVVID RSAYNAIFGRPIIHSFR +PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGS VC LE   + D   
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLL
        + +A+LP   +REF+APTEELELVPLL
Subjt:  KLEADLPKSGKREFSAPTEELELVPLL

XP_022152029.1 uncharacterized protein LOC111019838 [Momordica charantia]2.2e-14186.87Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRGDPEKRNKDKYCRFHRDH HNTS+CWELKRQIEDLIQDGYFKKFVGKPR NSVEK EERKRSRTP RREDRPAVINTIFGGPSGGQS 
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        NKR ELAR ARR+VCIIREQ+PT  ITFD  D+E VHLPHND LVIAPLIDHVVVRRVLVDGGA ANILSLLTYL LGWTR+QLKKS TPLVGFSGES++
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGD
         EGCIDLPVT GQD TQVT+MA+FVVID RSAYNAIFGRPIIHSFR VPSTLHQVLKYSTP+GVG VRGEQ ASRECYASALKGS VC LE QA+GD
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGD

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.0e-14681.63Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRG PE+R+KDKYCRFHR+H HNTS+ WELK QIEDLIQDGYFKKFVGKPR +S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        +KRK+LAR ARREVCIIREQ+PTC ITFD  D+  VHLPHND LVIAPLIDHVVVRRVLVDGGA ANILSL TYLALGWTR+QLKKSPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDLPVT+GQD T+VTQMA+FVV+D RSAYNAIFGRPIIHSFR +PSTLHQVLKYSTPNGVGTVRGEQ ASRECYAS LKG+ VC LE   + D   
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ
        + EADLP    REF+AP EELELVPLL  EKQ
Subjt:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]2.2e-14179.4Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRGDPEK NKD              NCWELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ G
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        NKR +LAR  RREVCIIREQKPTC ITF D D+EGVHLPHND LVIAPLIDH++VRRVL+DGGA ANI SL TYLALGWTR+QLKKSPTPLVGFSGESVS
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDL VTIGQD TQVTQMA+FVVID +SAYNAIFGRPIIHSF  V STLHQVLKYST NGVGTVRGEQK SR+CYAS LKG  VCTLE Q N  +L 
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQPDV
          EADLPK  KR+FS PTEELELVPLL PEK  ++
Subjt:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQPDV

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]6.2e-14479.82Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRG PE+R+KDKYCRFHR+H HNTS+CWELKRQIEDLIQDGYFKKFVGKP  +S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ PTC ITFD  D+E VHLPHND L+IA LIDHVVVRRVLV+GGA ANILSL TYLALGWTR+QL++SPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDLPVT+GQ+ T++TQMA+FVV+D RS YNAIFGRPIIHSFR +PSTLHQVLKY TPNGVGTVRGEQ ASRECYA+ALKG  VC LE   +G    
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ
        + EA+LP   ++EF+APTEELELVPLL PEKQ
Subjt:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1D9E1 uncharacterized protein LOC1110188234.3e-14381.65Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRG PE+RNKDKYCRFHR+HDHNTS+ WELKRQIEDLIQD YFKKFVGKPR +S EKKEERK SRTP RR DRPAVINTIFGGPSGGQSG
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ+PTC ITFD  D+E VHLPHND LVIAPLIDHVVVRRVLVD G  ANI+SLLTYLALGWTR+QLKKS TPLVGFS ESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDLPVT+G D TQVTQMA+FVVID RSAYNAIFGRPIIHSFR +PSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGS VC LE   + D   
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLL
        + +A+LP   +REF+APTEELELVPLL
Subjt:  KLEADLPKSGKREFSAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198994.9e-14781.63Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRG PE+R+KDKYCRFHR+H HNTS+ WELK QIEDLIQDGYFKKFVGKPR +S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        +KRK+LAR ARREVCIIREQ+PTC ITFD  D+  VHLPHND LVIAPLIDHVVVRRVLVDGGA ANILSL TYLALGWTR+QLKKSPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDLPVT+GQD T+VTQMA+FVV+D RSAYNAIFGRPIIHSFR +PSTLHQVLKYSTPNGVGTVRGEQ ASRECYAS LKG+ VC LE   + D   
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ
        + EADLP    REF+AP EELELVPLL  EKQ
Subjt:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ

A0A6J1DET8 uncharacterized protein LOC1110198381.1e-14186.87Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRGDPEKRNKDKYCRFHRDH HNTS+CWELKRQIEDLIQDGYFKKFVGKPR NSVEK EERKRSRTP RREDRPAVINTIFGGPSGGQS 
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        NKR ELAR ARR+VCIIREQ+PT  ITFD  D+E VHLPHND LVIAPLIDHVVVRRVLVDGGA ANILSLLTYL LGWTR+QLKKS TPLVGFSGES++
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGD
         EGCIDLPVT GQD TQVT+MA+FVVID RSAYNAIFGRPIIHSFR VPSTLHQVLKYSTP+GVG VRGEQ ASRECYASALKGS VC LE QA+GD
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGD

A0A6J1DPX9 uncharacterized protein LOC1110220061.1e-14179.4Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRGDPEK NKD              NCWELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ G
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        NKR +LAR  RREVCIIREQKPTC ITF D D+EGVHLPHND LVIAPLIDH++VRRVL+DGGA ANI SL TYLALGWTR+QLKKSPTPLVGFSGESVS
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDL VTIGQD TQVTQMA+FVVID +SAYNAIFGRPIIHSF  V STLHQVLKYST NGVGTVRGEQK SR+CYAS LKG  VCTLE Q N  +L 
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQPDV
          EADLPK  KR+FS PTEELELVPLL PEK  ++
Subjt:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQPDV

A0A6J1DT04 uncharacterized protein LOC1110228803.0e-14479.82Show/hide
Query:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG
        MEKLLKR E LRG PE+R+KDKYCRFHR+H HNTS+CWELKRQIEDLIQDGYFKKFVGKP  +S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG
Subjt:  MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSG

Query:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS
        +KRKELAR ARREVCIIREQ PTC ITFD  D+E VHLPHND L+IA LIDHVVVRRVLV+GGA ANILSL TYLALGWTR+QL++SPTPLVGFSGESV 
Subjt:  NKRKELAREARREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVS

Query:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP
         EGCIDLPVT+GQ+ T++TQMA+FVV+D RS YNAIFGRPIIHSFR +PSTLHQVLKY TPNGVGTVRGEQ ASRECYA+ALKG  VC LE   +G    
Subjt:  REGCIDLPVTIGQDDTQVTQMAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELP

Query:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ
        + EA+LP   ++EF+APTEELELVPLL PEKQ
Subjt:  KLEADLPKSGKREFSAPTEELELVPLLIPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCTCCTCAAGCGGCTTGAGAACCTCCGAGGAGACCCAGAAAAACGCAACAAAGATAAATATTGCCGTTTTCATCGCGATCACGACCATAATACGTCAAATTG
CTGGGAACTGAAACGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAGTTTGTGGGAAAACCGAGGTTTAACTCGGTAGAAAAGAAAGAAGAGAGGAAGCGTT
CAAGGACGCCGCCTCGACGAGAGGACCGACCAGCGGTCATCAATACTATTTTCGGAGGCCCAAGTGGGGGCCAATCTGGAAATAAAAGAAAAGAACTAGCTCGCGAAGCC
AGGCGCGAGGTGTGCATCATCAGGGAGCAGAAACCGACGTGCTCCATTACATTTGACGATACTGACATGGAAGGGGTCCACTTGCCCCATAATGACACGCTTGTGATTGC
TCCTCTGATTGATCACGTCGTGGTCAGAAGAGTGCTGGTAGATGGAGGCGCATTTGCCAACATCTTGTCCCTTCTAACATATCTTGCCTTGGGATGGACCAGAGCGCAGT
TGAAGAAGAGTCCAACGCCCTTGGTTGGATTTTCTGGAGAATCGGTCTCCCGAGAAGGGTGTATTGACTTGCCGGTCACGATTGGTCAAGATGATACACAGGTAACCCAG
ATGGCCAAGTTCGTCGTGATCGACTGCAGGTCGGCCTACAATGCCATCTTCGGGAGACCCATCATCCATTCGTTCCGGACCGTTCCTTCCACACTTCATCAAGTCCTGAA
GTACTCAACCCCTAATGGAGTGGGCACGGTCCGAGGAGAGCAGAAAGCTTCAAGGGAATGCTATGCCTCCGCGCTTAAAGGGTCAATAGTGTGCACCTTGGAAGGACAAG
CTAACGGGGACGAGTTGCCGAAGCTCGAGGCCGACCTGCCGAAGTCAGGTAAAAGGGAGTTTTCTGCACCGACAGAGGAGCTTGAGCTTGTTCCTTTACTTATTCCTGAG
AAGCAACCAGACGTGATGGAGGTTGACACTCCAGTACCCTTGTGGATGGACCCAATCGTGGAGTTCATCAAAGGAACTCCGCTGCTAGATTCGAAGGAGCAAAAGAAGAT
GGCGCGGAAAGTAGCTAGATTTATACTCCGAGATGGAGCGTTGTACCGACGTGGCTTCTCCCTGCCCCTGCTTAAGTGTGTAACTCCTGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCTCCTCAAGCGGCTTGAGAACCTCCGAGGAGACCCAGAAAAACGCAACAAAGATAAATATTGCCGTTTTCATCGCGATCACGACCATAATACGTCAAATTG
CTGGGAACTGAAACGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAGTTTGTGGGAAAACCGAGGTTTAACTCGGTAGAAAAGAAAGAAGAGAGGAAGCGTT
CAAGGACGCCGCCTCGACGAGAGGACCGACCAGCGGTCATCAATACTATTTTCGGAGGCCCAAGTGGGGGCCAATCTGGAAATAAAAGAAAAGAACTAGCTCGCGAAGCC
AGGCGCGAGGTGTGCATCATCAGGGAGCAGAAACCGACGTGCTCCATTACATTTGACGATACTGACATGGAAGGGGTCCACTTGCCCCATAATGACACGCTTGTGATTGC
TCCTCTGATTGATCACGTCGTGGTCAGAAGAGTGCTGGTAGATGGAGGCGCATTTGCCAACATCTTGTCCCTTCTAACATATCTTGCCTTGGGATGGACCAGAGCGCAGT
TGAAGAAGAGTCCAACGCCCTTGGTTGGATTTTCTGGAGAATCGGTCTCCCGAGAAGGGTGTATTGACTTGCCGGTCACGATTGGTCAAGATGATACACAGGTAACCCAG
ATGGCCAAGTTCGTCGTGATCGACTGCAGGTCGGCCTACAATGCCATCTTCGGGAGACCCATCATCCATTCGTTCCGGACCGTTCCTTCCACACTTCATCAAGTCCTGAA
GTACTCAACCCCTAATGGAGTGGGCACGGTCCGAGGAGAGCAGAAAGCTTCAAGGGAATGCTATGCCTCCGCGCTTAAAGGGTCAATAGTGTGCACCTTGGAAGGACAAG
CTAACGGGGACGAGTTGCCGAAGCTCGAGGCCGACCTGCCGAAGTCAGGTAAAAGGGAGTTTTCTGCACCGACAGAGGAGCTTGAGCTTGTTCCTTTACTTATTCCTGAG
AAGCAACCAGACGTGATGGAGGTTGACACTCCAGTACCCTTGTGGATGGACCCAATCGTGGAGTTCATCAAAGGAACTCCGCTGCTAGATTCGAAGGAGCAAAAGAAGAT
GGCGCGGAAAGTAGCTAGATTTATACTCCGAGATGGAGCGTTGTACCGACGTGGCTTCTCCCTGCCCCTGCTTAAGTGTGTAACTCCTGAATAA
Protein sequenceShow/hide protein sequence
MEKLLKRLENLRGDPEKRNKDKYCRFHRDHDHNTSNCWELKRQIEDLIQDGYFKKFVGKPRFNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQSGNKRKELAREA
RREVCIIREQKPTCSITFDDTDMEGVHLPHNDTLVIAPLIDHVVVRRVLVDGGAFANILSLLTYLALGWTRAQLKKSPTPLVGFSGESVSREGCIDLPVTIGQDDTQVTQ
MAKFVVIDCRSAYNAIFGRPIIHSFRTVPSTLHQVLKYSTPNGVGTVRGEQKASRECYASALKGSIVCTLEGQANGDELPKLEADLPKSGKREFSAPTEELELVPLLIPE
KQPDVMEVDTPVPLWMDPIVEFIKGTPLLDSKEQKKMARKVARFILRDGALYRRGFSLPLLKCVTPE