; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g12350 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g12350
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:8966877..8970933
RNA-Seq ExpressionMoc02g12350
SyntenyMoc02g12350
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVX16334.1 hypothetical protein CK203_014466 [Vitis vinifera]1.2e-1934.27Show/hide
Query:  DLLSVKDVISHVCRELAQKVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMR------------LLDSTVR--------------
        DL+ V D  + +  ++ +KVF  A H +C YH+ + LK K+K     KLF   A  Y + +F    R            ++D+ V               
Subjt:  DLLSVKDVISHVCRELAQKVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMR------------LLDSTVR--------------

Query:  ---GIREELSSI------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCS
           GI E L+ +                     S YY+ N L S+Y++SI+P GH   W I +DV    +LP   +RPAGRP K RIPS  E K+R +C 
Subjt:  ---GIREELSSI------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCS

Query:  RCGRYGHNKKSCK
        RCG YGHN+KSCK
Subjt:  RCGRYGHNKKSCK

TYK07957.1 hypothetical protein E5676_scaffold265G00780 [Cucumis melo var. makuwa]4.6e-2233.49Show/hide
Query:  KVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIG----------------------------------
        +VF    +C+C  HL   LKL +K+ + DK FF CA AY VE+FE++MR ++S    I++ LSS+G                                  
Subjt:  KVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIG----------------------------------

Query:  ---------------------------YVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGR
                                   YVS +Y    L  TY+  I P+G H  W +++ V  +K+LPP  KR AGRP K RIPS  EF    KCS C R
Subjt:  ---------------------------YVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGR

Query:  YGHNKKSCK
         GHN ++CK
Subjt:  YGHNKKSCK

XP_022134813.1 uncharacterized protein LOC111006994 [Momordica charantia]9.3e-9239.16Show/hide
Query:  MSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHS
        MSYANL + IM+ L LVG SDYPDV +C+GT QF+KKDMKISDDKDVHWLYNII NGA+QCCSLVVDCRNCL N+LD MPINTSSS+DN I S GQF+  
Subjt:  MSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHS

Query:  IDVANNSGNSHIEVNDKFCGKKNLQNAL-------------------------------------------------------------------RQATF
        IDVAN S   HI VNDKFCGK+NLQNAL                                                                   RQATF
Subjt:  IDVANNSGNSHIEVNDKFCGKKNLQNAL-------------------------------------------------------------------RQATF

Query:  SVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------------------------------------------------------------
        SVIKEFIK RIN  GTDL SVKD ISHV REL                                                                    
Subjt:  SVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------AQKVFTTACHCICAYHL
                                                                                           A+KVFTTACHCICAYHL
Subjt:  -----------------------------------------------------------------------------------AQKVFTTACHCICAYHL

Query:  FKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------------------------------------
        FK LKLKYK+KVSD LFFPCAKAYNVEDFE NMRLLDSTVRGIREELS I                                                  
Subjt:  FKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------------------------------------

Query:  -------------------------------------------------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIK
                                                                     GYVS YYSNNYLRSTYS SIHPLGH SSWNI EDVKIIK
Subjt:  -------------------------------------------------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIK

Query:  MLPPNVKRPAGRPMKLRI
        MLPPNVKRPAGRP KLRI
Subjt:  MLPPNVKRPAGRPMKLRI

XP_022155207.1 uncharacterized protein LOC111022347 [Momordica charantia]1.1e-11647.4Show/hide
Query:  VSMFHGGFWNDRDNYVEYKVSELIIHQEMSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCL
        +SMFHGGFWNDRDNYVEYKVSELIIHQEMSYA L+K IMEQL LVG SDYPD+ SCIGTTQF+ KDMKISDDKDVHWLYN+I NG  QCCSLVVDCRNCL
Subjt:  VSMFHGGFWNDRDNYVEYKVSELIIHQEMSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCL

Query:  RNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNSHIEVNDKFCGKKNLQNAL--------------------------------------------
         NVLDRMPINTSSSIDN IPSIGQFH+SIDV N   N HIEVNDKFCGKKNLQNAL                                            
Subjt:  RNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNSHIEVNDKFCGKKNLQNAL--------------------------------------------

Query:  -----------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHV--------------------------------------------
                               RQATFSVIKEFIKDR NM GTDLLSVKDVISHV                                            
Subjt:  -----------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHV--------------------------------------------

Query:  ---------------------------------CREL-----------------------AQKVFTTACHCICAYHL-----------FKKLKLKYKEK-
                                         CR L                       A+KVFTTACHCICA++            F K    Y    
Subjt:  ---------------------------------CREL-----------------------AQKVFTTACHCICAYHL-----------FKKLKLKYKEK-

Query:  ----------------VSDKLFFPCA---------------KAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------GYVSSYYSNN
                        + D +  P                 K  N  DF+   +   S    + E++S+                     GYVSSYYSNN
Subjt:  ----------------VSDKLFFPCA---------------KAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------GYVSSYYSNN

Query:  YLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGRYGHNKKSCKFSLTQ
        YL STYS SIHPLGH SSWNI EDVKIIKML PNVKRPAGRP KLRIPSALEFKKRVKCSRCGRYGHN+KSCKFSLTQ
Subjt:  YLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGRYGHNKKSCKFSLTQ

XP_022159005.1 uncharacterized protein LOC111025451 [Momordica charantia]4.5e-8650.64Show/hide
Query:  MEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNS
        ME+LRLVG SDYP+V SCIGTT+F+KKDMKI DDKDV+WL+N+  NGAVQCCSLVVDCRNCL NVLDRMPIN SSSIDN IPSIGQFHHSIDVAN S N 
Subjt:  MEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNS

Query:  HIEVNDKFCGKKNLQNAL-------------------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------
        HIEVNDKFCG +NLQNAL                               RQATFSVIKEFIKDRIN+ GTDLLSVKDVISHV REL              
Subjt:  HIEVNDKFCGKKNLQNAL-------------------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------

Query:  -------------------------------------------------------------------------------------------------AQK
                                                                                                          +K
Subjt:  -------------------------------------------------------------------------------------------------AQK

Query:  VFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIGYV--SSYYSN----NYLRSTYSESIH
        VFTTA HCICAYHLFK LKLKYKEK+SDKLFF CAKAYN+EDFEHNMRLLDSTVRG+REELS IG+   S  YS     N++ +  SES++
Subjt:  VFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIGYV--SSYYSN----NYLRSTYSESIH

TrEMBL top hitse value%identityAlignment
A0A438K565 CCHC-type domain-containing protein6.0e-2034.27Show/hide
Query:  DLLSVKDVISHVCRELAQKVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMR------------LLDSTVR--------------
        DL+ V D  + +  ++ +KVF  A H +C YH+ + LK K+K     KLF   A  Y + +F    R            ++D+ V               
Subjt:  DLLSVKDVISHVCRELAQKVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMR------------LLDSTVR--------------

Query:  ---GIREELSSI------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCS
           GI E L+ +                     S YY+ N L S+Y++SI+P GH   W I +DV    +LP   +RPAGRP K RIPS  E K+R +C 
Subjt:  ---GIREELSSI------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCS

Query:  RCGRYGHNKKSCK
        RCG YGHN+KSCK
Subjt:  RCGRYGHNKKSCK

A0A5D3C7L7 Uncharacterized protein2.2e-2233.49Show/hide
Query:  KVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIG----------------------------------
        +VF    +C+C  HL   LKL +K+ + DK FF CA AY VE+FE++MR ++S    I++ LSS+G                                  
Subjt:  KVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIG----------------------------------

Query:  ---------------------------YVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGR
                                   YVS +Y    L  TY+  I P+G H  W +++ V  +K+LPP  KR AGRP K RIPS  EF    KCS C R
Subjt:  ---------------------------YVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGR

Query:  YGHNKKSCK
         GHN ++CK
Subjt:  YGHNKKSCK

A0A6J1C328 uncharacterized protein LOC1110069944.5e-9239.16Show/hide
Query:  MSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHS
        MSYANL + IM+ L LVG SDYPDV +C+GT QF+KKDMKISDDKDVHWLYNII NGA+QCCSLVVDCRNCL N+LD MPINTSSS+DN I S GQF+  
Subjt:  MSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHS

Query:  IDVANNSGNSHIEVNDKFCGKKNLQNAL-------------------------------------------------------------------RQATF
        IDVAN S   HI VNDKFCGK+NLQNAL                                                                   RQATF
Subjt:  IDVANNSGNSHIEVNDKFCGKKNLQNAL-------------------------------------------------------------------RQATF

Query:  SVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------------------------------------------------------------
        SVIKEFIK RIN  GTDL SVKD ISHV REL                                                                    
Subjt:  SVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------AQKVFTTACHCICAYHL
                                                                                           A+KVFTTACHCICAYHL
Subjt:  -----------------------------------------------------------------------------------AQKVFTTACHCICAYHL

Query:  FKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------------------------------------
        FK LKLKYK+KVSD LFFPCAKAYNVEDFE NMRLLDSTVRGIREELS I                                                  
Subjt:  FKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------------------------------------

Query:  -------------------------------------------------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIK
                                                                     GYVS YYSNNYLRSTYS SIHPLGH SSWNI EDVKIIK
Subjt:  -------------------------------------------------------------GYVSSYYSNNYLRSTYSESIHPLGHHSSWNILEDVKIIK

Query:  MLPPNVKRPAGRPMKLRI
        MLPPNVKRPAGRP KLRI
Subjt:  MLPPNVKRPAGRPMKLRI

A0A6J1DNQ8 uncharacterized protein LOC1110223475.3e-11747.4Show/hide
Query:  VSMFHGGFWNDRDNYVEYKVSELIIHQEMSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCL
        +SMFHGGFWNDRDNYVEYKVSELIIHQEMSYA L+K IMEQL LVG SDYPD+ SCIGTTQF+ KDMKISDDKDVHWLYN+I NG  QCCSLVVDCRNCL
Subjt:  VSMFHGGFWNDRDNYVEYKVSELIIHQEMSYANLMKVIMEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCL

Query:  RNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNSHIEVNDKFCGKKNLQNAL--------------------------------------------
         NVLDRMPINTSSSIDN IPSIGQFH+SIDV N   N HIEVNDKFCGKKNLQNAL                                            
Subjt:  RNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNSHIEVNDKFCGKKNLQNAL--------------------------------------------

Query:  -----------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHV--------------------------------------------
                               RQATFSVIKEFIKDR NM GTDLLSVKDVISHV                                            
Subjt:  -----------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHV--------------------------------------------

Query:  ---------------------------------CREL-----------------------AQKVFTTACHCICAYHL-----------FKKLKLKYKEK-
                                         CR L                       A+KVFTTACHCICA++            F K    Y    
Subjt:  ---------------------------------CREL-----------------------AQKVFTTACHCICAYHL-----------FKKLKLKYKEK-

Query:  ----------------VSDKLFFPCA---------------KAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------GYVSSYYSNN
                        + D +  P                 K  N  DF+   +   S    + E++S+                     GYVSSYYSNN
Subjt:  ----------------VSDKLFFPCA---------------KAYNVEDFEHNMRLLDSTVRGIREELSSI--------------------GYVSSYYSNN

Query:  YLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGRYGHNKKSCKFSLTQ
        YL STYS SIHPLGH SSWNI EDVKIIKML PNVKRPAGRP KLRIPSALEFKKRVKCSRCGRYGHN+KSCKFSLTQ
Subjt:  YLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGRYGHNKKSCKFSLTQ

A0A6J1DXF3 uncharacterized protein LOC1110254512.2e-8650.64Show/hide
Query:  MEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNS
        ME+LRLVG SDYP+V SCIGTT+F+KKDMKI DDKDV+WL+N+  NGAVQCCSLVVDCRNCL NVLDRMPIN SSSIDN IPSIGQFHHSIDVAN S N 
Subjt:  MEQLRLVGGSDYPDVCSCIGTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNS

Query:  HIEVNDKFCGKKNLQNAL-------------------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------
        HIEVNDKFCG +NLQNAL                               RQATFSVIKEFIKDRIN+ GTDLLSVKDVISHV REL              
Subjt:  HIEVNDKFCGKKNLQNAL-------------------------------RQATFSVIKEFIKDRINMSGTDLLSVKDVISHVCREL--------------

Query:  -------------------------------------------------------------------------------------------------AQK
                                                                                                          +K
Subjt:  -------------------------------------------------------------------------------------------------AQK

Query:  VFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIGYV--SSYYSN----NYLRSTYSESIH
        VFTTA HCICAYHLFK LKLKYKEK+SDKLFF CAKAYN+EDFEHNMRLLDSTVRG+REELS IG+   S  YS     N++ +  SES++
Subjt:  VFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIGYV--SSYYSN----NYLRSTYSESIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGGCCTATCCAACAAACTAGAAGAGTCGAGTTCGTTTTCTTTGTGAGGCCAAGGGATTCTCCACAGGGATTTGACTATGTAAATGGAGATTCACGGACA
AATGGGAACTCCTTCAGCTCAGGTTGCACTTCTATTACCGTGAGCATGTTTCATGGTGGATTTTGGAACGACAGAGATAATTACGTGGAATACAAAGTGTCAGAA
CTAATTATACATCAAGAAATGTCATATGCCAATTTGATGAAAGTTATAATGGAACAATTGAGATTGGTAGGTGGTTCTGATTATCCTGATGTCTGTTCTTGCATT
GGAACCACTCAGTTTGTTAAGAAGGACATGAAAATATCAGACGACAAGGATGTGCATTGGTTATACAACATAATTTGTAATGGTGCCGTTCAATGTTGTAGTTTA
GTTGTTGATTGTAGAAATTGTCTTCGTAACGTATTAGATCGCATGCCGATTAATACTTCAAGTTCGATAGACAATGATATACCATCAATTGGTCAATTTCATCAC
TCAATTGATGTTGCTAATAATTCAGGGAATTCCCACATTGAAGTGAATGACAAGTTTTGTGGAAAGAAAAATTTACAGAATGCTTTGCGGCAAGCAACTTTTTCT
GTTATCAAGGAGTTTATTAAGGATAGGATTAACATGTCCGGCACAGATTTGCTTAGTGTTAAAGATGTTATATCCCATGTGTGTAGAGAACTCGCTCAGAAGGTT
TTCACCACTGCTTGCCACTGCATATGCGCCTATCATCTCTTCAAGAAGTTAAAGTTGAAGTACAAGGAGAAAGTTTCTGATAAATTATTCTTCCCATGTGCCAAA
GCATACAATGTTGAGGATTTTGAACATAACATGCGCCTTTTAGATTCTACTGTACGAGGCATACGCGAGGAGTTGTCTAGCATAGGTTACGTATCCTCCTACTAT
TCGAACAACTATTTACGTTCCACGTACAGTGAAAGTATCCACCCATTAGGTCATCATTCAAGTTGGAATATCCTAGAAGATGTGAAGATTATAAAGATGCTGCCG
CCCAATGTGAAACGTCCTGCTGGTAGACCAATGAAGTTGAGGATTCCGTCAGCATTAGAATTTAAAAAACGTGTTAAATGTAGCCGTTGTGGAAGATATGGTCAC
AACAAGAAGTCGTGCAAATTTTCCCTTACGCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGGGCCTATCCAACAAACTAGAAGAGTCGAGTTCGTTTTCTTTGTGAGGCCAAGGGATTCTCCACAGGGATTTGACTATGTAAATGGAGATTCACGGACA
AATGGGAACTCCTTCAGCTCAGGTTGCACTTCTATTACCGTGAGCATGTTTCATGGTGGATTTTGGAACGACAGAGATAATTACGTGGAATACAAAGTGTCAGAA
CTAATTATACATCAAGAAATGTCATATGCCAATTTGATGAAAGTTATAATGGAACAATTGAGATTGGTAGGTGGTTCTGATTATCCTGATGTCTGTTCTTGCATT
GGAACCACTCAGTTTGTTAAGAAGGACATGAAAATATCAGACGACAAGGATGTGCATTGGTTATACAACATAATTTGTAATGGTGCCGTTCAATGTTGTAGTTTA
GTTGTTGATTGTAGAAATTGTCTTCGTAACGTATTAGATCGCATGCCGATTAATACTTCAAGTTCGATAGACAATGATATACCATCAATTGGTCAATTTCATCAC
TCAATTGATGTTGCTAATAATTCAGGGAATTCCCACATTGAAGTGAATGACAAGTTTTGTGGAAAGAAAAATTTACAGAATGCTTTGCGGCAAGCAACTTTTTCT
GTTATCAAGGAGTTTATTAAGGATAGGATTAACATGTCCGGCACAGATTTGCTTAGTGTTAAAGATGTTATATCCCATGTGTGTAGAGAACTCGCTCAGAAGGTT
TTCACCACTGCTTGCCACTGCATATGCGCCTATCATCTCTTCAAGAAGTTAAAGTTGAAGTACAAGGAGAAAGTTTCTGATAAATTATTCTTCCCATGTGCCAAA
GCATACAATGTTGAGGATTTTGAACATAACATGCGCCTTTTAGATTCTACTGTACGAGGCATACGCGAGGAGTTGTCTAGCATAGGTTACGTATCCTCCTACTAT
TCGAACAACTATTTACGTTCCACGTACAGTGAAAGTATCCACCCATTAGGTCATCATTCAAGTTGGAATATCCTAGAAGATGTGAAGATTATAAAGATGCTGCCG
CCCAATGTGAAACGTCCTGCTGGTAGACCAATGAAGTTGAGGATTCCGTCAGCATTAGAATTTAAAAAACGTGTTAAATGTAGCCGTTGTGGAAGATATGGTCAC
AACAAGAAGTCGTGCAAATTTTCCCTTACGCAATAG
Protein sequenceShow/hide protein sequence
MNGPIQQTRRVEFVFFVRPRDSPQGFDYVNGDSRTNGNSFSSGCTSITVSMFHGGFWNDRDNYVEYKVSELIIHQEMSYANLMKVIMEQLRLVGGSDYPDVCSCI
GTTQFVKKDMKISDDKDVHWLYNIICNGAVQCCSLVVDCRNCLRNVLDRMPINTSSSIDNDIPSIGQFHHSIDVANNSGNSHIEVNDKFCGKKNLQNALRQATFS
VIKEFIKDRINMSGTDLLSVKDVISHVCRELAQKVFTTACHCICAYHLFKKLKLKYKEKVSDKLFFPCAKAYNVEDFEHNMRLLDSTVRGIREELSSIGYVSSYY
SNNYLRSTYSESIHPLGHHSSWNILEDVKIIKMLPPNVKRPAGRPMKLRIPSALEFKKRVKCSRCGRYGHNKKSCKFSLTQ