; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g05770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g05770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr4:3926606..3929015
RNA-Seq ExpressionMoc04g05770
SyntenyMoc04g05770
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]9.0e-10450.81Show/hide
Query:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI
        F    G WNE+  +YEGG+MGGL+VD+ ITY +L+SA+F +TRI+PD F I++ C+YKF  QY VP +YIFDD SL F+L GPPHPS+VPLYVSVVPKE 
Subjt:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI

Query:  HGSGSSSMNHNI---PEAETFQSFPHQLGQTVPYYAPSFPFDSML--PGPSCFVPSMTPLTDNVI-----------------------------------
          SGS+S  H++   P+ ETF SFP Q+ Q VP  AP     S +    P   V  MTPLTDNV+                                   
Subjt:  HGSGSSSMNHNI---PEAETFQSFPHQLGQTVPYYAPSFPFDSML--PGPSCFVPSMTPLTDNVI-----------------------------------

Query:  ---------------SYPVDVAGPS-SDPSTEVHVVSTNAPCA-IGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKS
                         PV    PS   P  EV  VS NAPCA +     S E + T        +DIA+G+ FRSK++L+F L+V+A++ NFE++VKKS
Subjt:  ---------------SYPVDVAGPS-SDPSTEVHVVSTNAPCA-IGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKS

Query:  TKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARER
        T+SL +V C E+GC+W+LR+RKIKGSDTFLISTF E H   RE ++HDH+QA S VVGQ+IK+  ED+SRRYRP+DI+ DMR+NYGVN RYEK WRARE 
Subjt:  TKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARER

Query:  ALELLMGSPKKSYTLLRKYGEALKSVNPGTM
        AL LLMGSPK+SYT L KYG ALK+ N GT+
Subjt:  ALELLMGSPKKSYTLLRKYGEALKSVNPGTM

XP_022156802.1 uncharacterized protein LOC111023635 [Momordica charantia]1.6e-7646.32Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV
        M  LFV YGG WNE+  +YEGG MGGLDVD++ITY  L+SA+  LTRID D+F +++ CVY  +L +++                        PL  +V+
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV

Query:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLPGPSCFVPSMTPLTDNVISYPVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSRE
           +    +   N  + E E    F          Y      +                T++V+    +  G       E H VS NAP    +    R 
Subjt:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLPGPSCFVPSMTPLTDNVISYPVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSRE

Query:  I--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQ
        +  + TG+      E IAV   FRSKE+L+FKLSV AMK+NF+++VKKSTK+L+TVGCTE GCKW LR++ I+G D+F+IS F + H C REV+ HDHRQ
Subjt:  I--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQ

Query:  ARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM
        ARS VVGQ++KS  EDVSR+YRPKDI+NDMR+NYGVNIRYEK WRA+  AL LLMG PK SYTLLRKYGEALK+VN  T+
Subjt:  ARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM

XP_022156834.1 uncharacterized protein LOC111023667 [Momordica charantia]1.3e-8343.89Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV
        M  LFV YGG WNE+  +YEGG+MGGLDVD++ITY  L+SA+  LTRIDPD+F +++ CVY+F+ +Y+VP Y IFDD SL+F+L GPP PS+VPLYV+V+
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV

Query:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISYPVD-----------------------------
        PK  +GSGS   N N  E +T  SFP+   Q  P +  +   DS L   G   F+P +TPL DNVI   +D                             
Subjt:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISYPVD-----------------------------

Query:  ------VAGPSSDPST-------------------------EVHVVSTNAPCAIGQASCSREI--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAM
              V G   +                            E H +S NAP    +   SR +  + TG+      E IAV   F SK +L+FKL     
Subjt:  ------VAGPSSDPST-------------------------EVHVVSTNAPCAIGQASCSREI--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAM

Query:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNI
                                    LR++ I+G D+F+IS F +VH C REV+ HDHRQARS VVGQ++KS  EDVSR+Y+PKDI+NDMRKNYGVNI
Subjt:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNI

Query:  RYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM
        RYEK W A+  AL LL+GSPK SYTLL KYGEALK VN GT+
Subjt:  RYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM

XP_022158743.1 PKS-NRPS hybrid synthetase CHGG_01239-like [Momordica charantia]2.8e-8955.88Show/hide
Query:  VVPKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISY------------------------------
        ++PK+ HG GSSS N   P  + F SFP+QLGQ VP      P  S LP  G SC V S+TPLTDNV+SY                              
Subjt:  VVPKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISY------------------------------

Query:  -----------------------------PVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMK
                                     P+DV   SS P  E++ VS NAPCA  QASCSR+I +T D++ L++E I V + F+S  +LQF  SV+AMK
Subjt:  -----------------------------PVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMK

Query:  MNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIR
        +NFEYRVKKSTKSL TVGC  DGCKW + +R+I+GSDTFLIS F+ VH+C  EVMKHDHRQARSR+VGQIIK+ FED SRRYRPKDIVN+MRKNYGVNI+
Subjt:  MNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIR

Query:  YEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT
        YEK WRARE AL+LLMGSPKKSYTLLRKYGEALKSVNPGT
Subjt:  YEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT

XP_022159183.1 uncharacterized protein LOC111025603 [Momordica charantia]1.5e-7175.26Show/hide
Query:  VVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY
        +VSTNA CA  QAS SRE+ RT D+V LS +DIA+G+TFRSKE+LQFKLSV+AM++NFEY VKKSTKSLY +GC+EDGCKWS   RKI+GSD FLISTFY
Subjt:  VVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY

Query:  EVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPG
        EVHSC REVMKHDHRQA+SRVVGQIIKS FEDVS RYRPKDIVNDM+KNY VNIRY             LMGSPKKSYTLLRKY EALKSVN G
Subjt:  EVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPG

TrEMBL top hitse value%identityAlignment
A0A6J1DLB0 uncharacterized protein LOC1110219694.3e-10450.81Show/hide
Query:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI
        F    G WNE+  +YEGG+MGGL+VD+ ITY +L+SA+F +TRI+PD F I++ C+YKF  QY VP +YIFDD SL F+L GPPHPS+VPLYVSVVPKE 
Subjt:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI

Query:  HGSGSSSMNHNI---PEAETFQSFPHQLGQTVPYYAPSFPFDSML--PGPSCFVPSMTPLTDNVI-----------------------------------
          SGS+S  H++   P+ ETF SFP Q+ Q VP  AP     S +    P   V  MTPLTDNV+                                   
Subjt:  HGSGSSSMNHNI---PEAETFQSFPHQLGQTVPYYAPSFPFDSML--PGPSCFVPSMTPLTDNVI-----------------------------------

Query:  ---------------SYPVDVAGPS-SDPSTEVHVVSTNAPCA-IGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKS
                         PV    PS   P  EV  VS NAPCA +     S E + T        +DIA+G+ FRSK++L+F L+V+A++ NFE++VKKS
Subjt:  ---------------SYPVDVAGPS-SDPSTEVHVVSTNAPCA-IGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKS

Query:  TKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARER
        T+SL +V C E+GC+W+LR+RKIKGSDTFLISTF E H   RE ++HDH+QA S VVGQ+IK+  ED+SRRYRP+DI+ DMR+NYGVN RYEK WRARE 
Subjt:  TKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARER

Query:  ALELLMGSPKKSYTLLRKYGEALKSVNPGTM
        AL LLMGSPK+SYT L KYG ALK+ N GT+
Subjt:  ALELLMGSPKKSYTLLRKYGEALKSVNPGTM

A0A6J1DSY0 uncharacterized protein LOC1110236357.7e-7746.32Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV
        M  LFV YGG WNE+  +YEGG MGGLDVD++ITY  L+SA+  LTRID D+F +++ CVY  +L +++                        PL  +V+
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV

Query:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLPGPSCFVPSMTPLTDNVISYPVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSRE
           +    +   N  + E E    F          Y      +                T++V+    +  G       E H VS NAP    +    R 
Subjt:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLPGPSCFVPSMTPLTDNVISYPVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSRE

Query:  I--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQ
        +  + TG+      E IAV   FRSKE+L+FKLSV AMK+NF+++VKKSTK+L+TVGCTE GCKW LR++ I+G D+F+IS F + H C REV+ HDHRQ
Subjt:  I--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQ

Query:  ARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM
        ARS VVGQ++KS  EDVSR+YRPKDI+NDMR+NYGVNIRYEK WRA+  AL LLMG PK SYTLLRKYGEALK+VN  T+
Subjt:  ARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM

A0A6J1DUS4 uncharacterized protein LOC1110236676.5e-8443.89Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV
        M  LFV YGG WNE+  +YEGG+MGGLDVD++ITY  L+SA+  LTRIDPD+F +++ CVY+F+ +Y+VP Y IFDD SL+F+L GPP PS+VPLYV+V+
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVV

Query:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISYPVD-----------------------------
        PK  +GSGS   N N  E +T  SFP+   Q  P +  +   DS L   G   F+P +TPL DNVI   +D                             
Subjt:  PKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISYPVD-----------------------------

Query:  ------VAGPSSDPST-------------------------EVHVVSTNAPCAIGQASCSREI--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAM
              V G   +                            E H +S NAP    +   SR +  + TG+      E IAV   F SK +L+FKL     
Subjt:  ------VAGPSSDPST-------------------------EVHVVSTNAPCAIGQASCSREI--VRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAM

Query:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNI
                                    LR++ I+G D+F+IS F +VH C REV+ HDHRQARS VVGQ++KS  EDVSR+Y+PKDI+NDMRKNYGVNI
Subjt:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNI

Query:  RYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM
        RYEK W A+  AL LL+GSPK SYTLL KYGEALK VN GT+
Subjt:  RYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTM

A0A6J1DWY9 PKS-NRPS hybrid synthetase CHGG_01239-like1.4e-8955.88Show/hide
Query:  VVPKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISY------------------------------
        ++PK+ HG GSSS N   P  + F SFP+QLGQ VP      P  S LP  G SC V S+TPLTDNV+SY                              
Subjt:  VVPKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLP--GPSCFVPSMTPLTDNVISY------------------------------

Query:  -----------------------------PVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMK
                                     P+DV   SS P  E++ VS NAPCA  QASCSR+I +T D++ L++E I V + F+S  +LQF  SV+AMK
Subjt:  -----------------------------PVDVAGPSSDPSTEVHVVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMK

Query:  MNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIR
        +NFEYRVKKSTKSL TVGC  DGCKW + +R+I+GSDTFLIS F+ VH+C  EVMKHDHRQARSR+VGQIIK+ FED SRRYRPKDIVN+MRKNYGVNI+
Subjt:  MNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIR

Query:  YEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT
        YEK WRARE AL+LLMGSPKKSYTLLRKYGEALKSVNPGT
Subjt:  YEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT

A0A6J1DY41 uncharacterized protein LOC1110256037.5e-7275.26Show/hide
Query:  VVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY
        +VSTNA CA  QAS SRE+ RT D+V LS +DIA+G+TFRSKE+LQFKLSV+AM++NFEY VKKSTKSLY +GC+EDGCKWS   RKI+GSD FLISTFY
Subjt:  VVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY

Query:  EVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPG
        EVHSC REVMKHDHRQA+SRVVGQIIKS FEDVS RYRPKDIVNDM+KNY VNIRY             LMGSPKKSYTLLRKY EALKSVN G
Subjt:  EVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACTAATTCTCGGGCAGAATCGCATGTTTCCCGCCCGAGAATTGGCCAAAATGCTAATTTTGGGTTGTTTATGGGCGAGAATTGGCGAGAAGTTAGGTTGACTTA
CAATGAATTCTCGGCCATTAACTGGATGCCTCACCTATTTGTTAGCTATGGTGGTAGTTGGAATGAGTCACAATTTCTATATGAAGGTGGAATTATGGGAGGTTTGGATG
TGGACGATTCTATAACTTATGAGGAGCTCCTTAGTGCTATGTTCAGCCTTACCCGAATAGATCCGGATCGGTTCAAAATCTTGATACACTGTGTATATAAGTTCAATCTG
CAGTACCAGGTTCCGAAGTATTACATCTTTGATGACCATAGCCTTAGATTTTTTTTAAGAGGCCCTCCACATCCCTCCGAAGTTCCATTGTATGTATCTGTCGTACCGAA
GGAAATACATGGCAGTGGAAGCAGTTCAATGAATCATAACATTCCAGAAGCAGAAACATTCCAATCATTTCCCCACCAGTTAGGGCAGACCGTTCCGTATTATGCTCCAT
CGTTTCCTTTTGATTCCATGCTCCCAGGCCCATCATGTTTTGTCCCATCAATGACGCCGCTGACGGACAATGTAATCTCATACCCAGTGGATGTAGCCGGTCCATCATCG
GACCCCTCGACCGAAGTGCACGTGGTCAGTACGAATGCACCGTGCGCAATCGGTCAAGCTTCTTGCTCAAGGGAAATTGTTAGGACAGGTGATAAAGTTTGTTTGTCAAC
GGAGGACATTGCGGTAGGGAATACTTTTCGATCGAAAGAAGATTTGCAGTTCAAACTCTCGGTGTACGCAATGAAGATGAATTTTGAATATCGCGTGAAGAAGTCGACAA
AAAGTTTGTACACTGTCGGATGCACCGAGGATGGGTGCAAATGGAGCCTACGTTCAAGGAAAATTAAAGGTTCAGATACTTTTCTTATCTCTACATTCTATGAGGTTCAC
AGTTGCACTCGTGAGGTAATGAAACATGACCACCGGCAAGCTCGAAGTCGTGTGGTGGGTCAGATTATAAAGTCCACATTTGAGGATGTAAGTCGACGTTATAGACCGAA
GGATATTGTTAATGACATGAGGAAAAATTACGGTGTTAACATTCGATATGAAAAGGTGTGGCGTGCGAGAGAGAGGGCTTTGGAACTACTAATGGGATCGCCAAAAAAGT
CGTACACTCTTTTGCGTAAATACGGTGAGGCATTGAAATCGGTGAACCCGGGCACGATGAACTTGAATGATAAGTTCAAGATTCAGAGCGAAGGCGTGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAACTAATTCTCGGGCAGAATCGCATGTTTCCCGCCCGAGAATTGGCCAAAATGCTAATTTTGGGTTGTTTATGGGCGAGAATTGGCGAGAAGTTAGGTTGACTTA
CAATGAATTCTCGGCCATTAACTGGATGCCTCACCTATTTGTTAGCTATGGTGGTAGTTGGAATGAGTCACAATTTCTATATGAAGGTGGAATTATGGGAGGTTTGGATG
TGGACGATTCTATAACTTATGAGGAGCTCCTTAGTGCTATGTTCAGCCTTACCCGAATAGATCCGGATCGGTTCAAAATCTTGATACACTGTGTATATAAGTTCAATCTG
CAGTACCAGGTTCCGAAGTATTACATCTTTGATGACCATAGCCTTAGATTTTTTTTAAGAGGCCCTCCACATCCCTCCGAAGTTCCATTGTATGTATCTGTCGTACCGAA
GGAAATACATGGCAGTGGAAGCAGTTCAATGAATCATAACATTCCAGAAGCAGAAACATTCCAATCATTTCCCCACCAGTTAGGGCAGACCGTTCCGTATTATGCTCCAT
CGTTTCCTTTTGATTCCATGCTCCCAGGCCCATCATGTTTTGTCCCATCAATGACGCCGCTGACGGACAATGTAATCTCATACCCAGTGGATGTAGCCGGTCCATCATCG
GACCCCTCGACCGAAGTGCACGTGGTCAGTACGAATGCACCGTGCGCAATCGGTCAAGCTTCTTGCTCAAGGGAAATTGTTAGGACAGGTGATAAAGTTTGTTTGTCAAC
GGAGGACATTGCGGTAGGGAATACTTTTCGATCGAAAGAAGATTTGCAGTTCAAACTCTCGGTGTACGCAATGAAGATGAATTTTGAATATCGCGTGAAGAAGTCGACAA
AAAGTTTGTACACTGTCGGATGCACCGAGGATGGGTGCAAATGGAGCCTACGTTCAAGGAAAATTAAAGGTTCAGATACTTTTCTTATCTCTACATTCTATGAGGTTCAC
AGTTGCACTCGTGAGGTAATGAAACATGACCACCGGCAAGCTCGAAGTCGTGTGGTGGGTCAGATTATAAAGTCCACATTTGAGGATGTAAGTCGACGTTATAGACCGAA
GGATATTGTTAATGACATGAGGAAAAATTACGGTGTTAACATTCGATATGAAAAGGTGTGGCGTGCGAGAGAGAGGGCTTTGGAACTACTAATGGGATCGCCAAAAAAGT
CGTACACTCTTTTGCGTAAATACGGTGAGGCATTGAAATCGGTGAACCCGGGCACGATGAACTTGAATGATAAGTTCAAGATTCAGAGCGAAGGCGTGGAATGA
Protein sequenceShow/hide protein sequence
MITNSRAESHVSRPRIGQNANFGLFMGENWREVRLTYNEFSAINWMPHLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDRFKILIHCVYKFNL
QYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEIHGSGSSSMNHNIPEAETFQSFPHQLGQTVPYYAPSFPFDSMLPGPSCFVPSMTPLTDNVISYPVDVAGPSS
DPSTEVHVVSTNAPCAIGQASCSREIVRTGDKVCLSTEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVH
SCTREVMKHDHRQARSRVVGQIIKSTFEDVSRRYRPKDIVNDMRKNYGVNIRYEKVWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGTMNLNDKFKIQSEGVE