; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g29570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g29570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr4:21949599..21950675
RNA-Seq ExpressionMoc04g29570
SyntenyMoc04g29570
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]6.9e-7850.56Show/hide
Query:  FVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVVPKEI
        F    G WNE+  +YEGG+MGGL+VD  ITY +L+SA+F +TRI+   F I++ C+YKF  QY VP +YIFDD SL F+L GPPHPS+V LYVSVVPKE 
Subjt:  FVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVVPKEI

Query:  HGNGSSSMNRNI-PEAEAFQSFSHQLGQTVPYYASSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDD-NDENDVEYEYEAEDDDNHD
          +GS+S +  + P+ E F SF  Q+ Q VP  A      S++    P   V  MT LTDNV+PCNLGDDE  + GQWDD  D+ D EY    +D ++ D
Subjt:  HGNGSSSMNRNI-PEAEAFQSFSHQLGQTVPYYASSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDD-NDENDVEYEYEAEDDDNHD

Query:  TEFEDDVDFDNEEEVDPVDVAGPS-SDPSTEVHVVSTNAPCATGQ---ASCSR-EIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVK
           E + + DNE + +PV    PS   P  EV  VS NAPCAT +   AS  + + +  DD++   ++DIA+G+ FRSK++L+F L+V+A++ NFE++VK
Subjt:  TEFEDDVDFDNEEEVDPVDVAGPS-SDPSTEVHVVSTNAPCATGQ---ASCSR-EIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVK

Query:  KSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
        KST+SL +V C E+GC+W+LR+RKIKGSDTFLISTF E H   RE ++HDH+QA S
Subjt:  KSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

XP_022156802.1 uncharacterized protein LOC111023635 [Momordica charantia]3.0e-4137.23Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV
        M  LFV YGG WNE+  +YEGG MGGLDVD +ITY  L+SA+  LTRIDL +F +++ CVY            IF+                        
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV

Query:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLPGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDND---ENDVEYEYEAEDDD
                                                               +T L DNVI CNL DDE +     D N    EN+VEYE++++D+ 
Subjt:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLPGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDND---ENDVEYEYEAEDDD

Query:  NHDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSV
        ++D    ED V+      +DN  E+E    DV     +       E H VS NAP  T +    R +           + +IAV   FRSKE+L+FKLSV
Subjt:  NHDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSV

Query:  YAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
         AMK+NF+++VKKSTK+L+TVGCTE GCKW LR++ I+G D+F+IS F + H C REV+ HDHRQARS
Subjt:  YAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

XP_022156834.1 uncharacterized protein LOC111023667 [Momordica charantia]4.8e-5541.14Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV
        M  LFV YGG WNE+  +YEGG+MGGLDVD +ITY  L+SA+  LTRID  +F +++ CVY+F+ +Y+VP Y IFDD SL+F+L GPP PS+V LYV+V+
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV

Query:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDDDN
        PK  +G+GS   N N  E +   SF +   Q  P +  +   DS L   G   F+P +T L DNVIPCNL DDE+ Y GQ     EN+VEY+++++D+ +
Subjt:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDDDN

Query:  HDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSVY
        +D    ED V+      +DN  E+E    DV     +       E H +S NAP  T +   SR +           + +IAV   F SK +L+FKL   
Subjt:  HDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSVY

Query:  AMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
                                      LR++ I+G D+F+IS F +VH C REV+ HDHRQARS
Subjt:  AMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

XP_022158743.1 PKS-NRPS hybrid synthetase CHGG_01239-like [Momordica charantia]1.5e-6457.36Show/hide
Query:  VVPKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDD
        ++PK+ HG GSSS N   P  +AF SF +QLGQ VP      P  STLP  G SC V S+T LTDNV+  NLGDDE NY  QWD++D+ND  Y  EAED+
Subjt:  VVPKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDD

Query:  D----------NHDTEFEDDVDFDNEEEVDPVDVAGPSSDPSTEVHVVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAM
        D            D EF +DV  DNE EV+P+DV   SS P  E++ VS NAPCAT QASCSR+I +T DE+  + E I V + F+S  +LQF  SV+AM
Subjt:  D----------NHDTEFEDDVDFDNEEEVDPVDVAGPSSDPSTEVHVVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAM

Query:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
        K+NFEYRVKKSTKSL TVGC  DGCKW + +R+I+GSDTFLIS F+ VH+C  EVMKHDHRQARS
Subjt:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

XP_022159183.1 uncharacterized protein LOC111025603 [Momordica charantia]6.5e-4478.15Show/hide
Query:  VVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY
        +VSTNA CAT QAS SRE+ RTDDEV  SM+DIA+G+TFRSKE+LQFKLSV+AM++NFEY VKKSTKSLY +GC+EDGCKWS   RKI+GSD FLISTFY
Subjt:  VVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY

Query:  EVHSCTREVMKHDHRQARS
        EVHSC REVMKHDHRQA+S
Subjt:  EVHSCTREVMKHDHRQARS

TrEMBL top hitse value%identityAlignment
A0A6J1DLB0 uncharacterized protein LOC1110219693.3e-7850.56Show/hide
Query:  FVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVVPKEI
        F    G WNE+  +YEGG+MGGL+VD  ITY +L+SA+F +TRI+   F I++ C+YKF  QY VP +YIFDD SL F+L GPPHPS+V LYVSVVPKE 
Subjt:  FVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVVPKEI

Query:  HGNGSSSMNRNI-PEAEAFQSFSHQLGQTVPYYASSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDD-NDENDVEYEYEAEDDDNHD
          +GS+S +  + P+ E F SF  Q+ Q VP  A      S++    P   V  MT LTDNV+PCNLGDDE  + GQWDD  D+ D EY    +D ++ D
Subjt:  HGNGSSSMNRNI-PEAEAFQSFSHQLGQTVPYYASSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDD-NDENDVEYEYEAEDDDNHD

Query:  TEFEDDVDFDNEEEVDPVDVAGPS-SDPSTEVHVVSTNAPCATGQ---ASCSR-EIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVK
           E + + DNE + +PV    PS   P  EV  VS NAPCAT +   AS  + + +  DD++   ++DIA+G+ FRSK++L+F L+V+A++ NFE++VK
Subjt:  TEFEDDVDFDNEEEVDPVDVAGPS-SDPSTEVHVVSTNAPCATGQ---ASCSR-EIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVK

Query:  KSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
        KST+SL +V C E+GC+W+LR+RKIKGSDTFLISTF E H   RE ++HDH+QA S
Subjt:  KSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

A0A6J1DSY0 uncharacterized protein LOC1110236351.5e-4137.23Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV
        M  LFV YGG WNE+  +YEGG MGGLDVD +ITY  L+SA+  LTRIDL +F +++ CVY            IF+                        
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV

Query:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLPGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDND---ENDVEYEYEAEDDD
                                                               +T L DNVI CNL DDE +     D N    EN+VEYE++++D+ 
Subjt:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLPGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDND---ENDVEYEYEAEDDD

Query:  NHDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSV
        ++D    ED V+      +DN  E+E    DV     +       E H VS NAP  T +    R +           + +IAV   FRSKE+L+FKLSV
Subjt:  NHDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSV

Query:  YAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
         AMK+NF+++VKKSTK+L+TVGCTE GCKW LR++ I+G D+F+IS F + H C REV+ HDHRQARS
Subjt:  YAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

A0A6J1DUS4 uncharacterized protein LOC1110236672.3e-5541.14Show/hide
Query:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV
        M  LFV YGG WNE+  +YEGG+MGGLDVD +ITY  L+SA+  LTRID  +F +++ CVY+F+ +Y+VP Y IFDD SL+F+L GPP PS+V LYV+V+
Subjt:  MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVV

Query:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDDDN
        PK  +G+GS   N N  E +   SF +   Q  P +  +   DS L   G   F+P +T L DNVIPCNL DDE+ Y GQ     EN+VEY+++++D+ +
Subjt:  PKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDDDN

Query:  HDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSVY
        +D    ED V+      +DN  E+E    DV     +       E H +S NAP  T +   SR +           + +IAV   F SK +L+FKL   
Subjt:  HDT-EFEDDVD------FDN--EEEVDPVDVAGPSSDPS----TEVHVVSTNAPCATGQASCSREIVR-TDDEVCSSMEDIAVGNTFRSKEDLQFKLSVY

Query:  AMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
                                      LR++ I+G D+F+IS F +VH C REV+ HDHRQARS
Subjt:  AMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

A0A6J1DWY9 PKS-NRPS hybrid synthetase CHGG_01239-like7.2e-6557.36Show/hide
Query:  VVPKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDD
        ++PK+ HG GSSS N   P  +AF SF +QLGQ VP      P  STLP  G SC V S+T LTDNV+  NLGDDE NY  QWD++D+ND  Y  EAED+
Subjt:  VVPKEIHGNGSSSMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLP--GPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDD

Query:  D----------NHDTEFEDDVDFDNEEEVDPVDVAGPSSDPSTEVHVVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAM
        D            D EF +DV  DNE EV+P+DV   SS P  E++ VS NAPCAT QASCSR+I +T DE+  + E I V + F+S  +LQF  SV+AM
Subjt:  D----------NHDTEFEDDVDFDNEEEVDPVDVAGPSSDPSTEVHVVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAM

Query:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS
        K+NFEYRVKKSTKSL TVGC  DGCKW + +R+I+GSDTFLIS F+ VH+C  EVMKHDHRQARS
Subjt:  KMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFYEVHSCTREVMKHDHRQARS

A0A6J1DY41 uncharacterized protein LOC1110256033.2e-4478.15Show/hide
Query:  VVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY
        +VSTNA CAT QAS SRE+ RTDDEV  SM+DIA+G+TFRSKE+LQFKLSV+AM++NFEY VKKSTKSLY +GC+EDGCKWS   RKI+GSD FLISTFY
Subjt:  VVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLISTFY

Query:  EVHSCTREVMKHDHRQARS
        EVHSC REVMKHDHRQA+S
Subjt:  EVHSCTREVMKHDHRQARS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCACCTATTTGTTAGCTATGGTGGTAGTTGGAATGAGTCACAATTTCTATATGAAGGTGGAATTATGGGAGGTTTGGATGTGGACAATTCTATAACTTATGAGGA
GCTCCTTAGTGCTATGTTCAGCCTTACCCGAATAGATCTGGTTCGGTTCAAAATCTTGATACACTGTGTATATAAGTTCAATCTGCAGTACCAGGTTCCGAAATATTACA
TCTTTGATGACCATAGCCTTAGATTTTTTTTAAGAGGCCCTCCACATCCCTCCGAAGTCTCATTGTATGTATCTGTCGTACCGAAGGAAATACATGGCAATGGAAGCAGT
TCAATGAATCGTAACATTCCAGAAGCAGAAGCATTCCAATCATTTTCCCACCAGTTAGGGCAGACCGTTCCGTATTATGCTTCATCGTTTCCTTTTGATTCCACGCTCCC
AGGTCCATCATGTTTTGTCCCATCAATGACGTCGCTGACGGACAATGTAATCCCATGTAACTTGGGTGACGATGAAACAAACTATTGCGGTCAATGGGACGATAATGATG
AGAACGACGTGGAGTACGAGTATGAGGCCGAGGATGATGACAACCACGATACTGAATTCGAGGATGATGTTGATTTTGACAACGAGGAGGAAGTAGACCCAGTGGATGTA
GCCGGTCCATCATCGGACCCCTCGACCGAAGTACACGTGGTCAGTACGAATGCACCGTGCGCAACCGGTCAAGCTTCTTGCTCAAGGGAAATTGTTAGGACAGATGATGA
AGTTTGTTCGTCAATGGAGGACATTGCGGTAGGGAATACTTTTCGATCGAAAGAAGATTTGCAGTTCAAACTCTCGGTGTACGCAATGAAGATGAATTTTGAATATCGCG
TGAAGAAGTCGACAAAAAGTTTGTACACTGTCGGATGCACCGAGGATGGGTGCAAATGGAGTCTACGTTCAAGGAAAATTAAAGGGTCAGATACTTTTCTTATCTCTACA
TTCTATGAGGTTCACAGTTGCACTCGTGAGGTAATGAAACATGACCACCGGCAAGCTCGAAGTCATAGGATGAGGAAGGGATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCACCTATTTGTTAGCTATGGTGGTAGTTGGAATGAGTCACAATTTCTATATGAAGGTGGAATTATGGGAGGTTTGGATGTGGACAATTCTATAACTTATGAGGA
GCTCCTTAGTGCTATGTTCAGCCTTACCCGAATAGATCTGGTTCGGTTCAAAATCTTGATACACTGTGTATATAAGTTCAATCTGCAGTACCAGGTTCCGAAATATTACA
TCTTTGATGACCATAGCCTTAGATTTTTTTTAAGAGGCCCTCCACATCCCTCCGAAGTCTCATTGTATGTATCTGTCGTACCGAAGGAAATACATGGCAATGGAAGCAGT
TCAATGAATCGTAACATTCCAGAAGCAGAAGCATTCCAATCATTTTCCCACCAGTTAGGGCAGACCGTTCCGTATTATGCTTCATCGTTTCCTTTTGATTCCACGCTCCC
AGGTCCATCATGTTTTGTCCCATCAATGACGTCGCTGACGGACAATGTAATCCCATGTAACTTGGGTGACGATGAAACAAACTATTGCGGTCAATGGGACGATAATGATG
AGAACGACGTGGAGTACGAGTATGAGGCCGAGGATGATGACAACCACGATACTGAATTCGAGGATGATGTTGATTTTGACAACGAGGAGGAAGTAGACCCAGTGGATGTA
GCCGGTCCATCATCGGACCCCTCGACCGAAGTACACGTGGTCAGTACGAATGCACCGTGCGCAACCGGTCAAGCTTCTTGCTCAAGGGAAATTGTTAGGACAGATGATGA
AGTTTGTTCGTCAATGGAGGACATTGCGGTAGGGAATACTTTTCGATCGAAAGAAGATTTGCAGTTCAAACTCTCGGTGTACGCAATGAAGATGAATTTTGAATATCGCG
TGAAGAAGTCGACAAAAAGTTTGTACACTGTCGGATGCACCGAGGATGGGTGCAAATGGAGTCTACGTTCAAGGAAAATTAAAGGGTCAGATACTTTTCTTATCTCTACA
TTCTATGAGGTTCACAGTTGCACTCGTGAGGTAATGAAACATGACCACCGGCAAGCTCGAAGTCATAGGATGAGGAAGGGATTTTAG
Protein sequenceShow/hide protein sequence
MPHLFVSYGGSWNESQFLYEGGIMGGLDVDNSITYEELLSAMFSLTRIDLVRFKILIHCVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVSLYVSVVPKEIHGNGSS
SMNRNIPEAEAFQSFSHQLGQTVPYYASSFPFDSTLPGPSCFVPSMTSLTDNVIPCNLGDDETNYCGQWDDNDENDVEYEYEAEDDDNHDTEFEDDVDFDNEEEVDPVDV
AGPSSDPSTEVHVVSTNAPCATGQASCSREIVRTDDEVCSSMEDIAVGNTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSDTFLIST
FYEVHSCTREVMKHDHRQARSHRMRKGF