; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007343 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007343
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionZf-CCHC domain-containing protein/UBN2 domain-containing protein
Genome locationchr11:11698717..11699984
RNA-Seq ExpressionPI0007343
SyntenyPI0007343
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]9.6e-9963.1Show/hide
Query:  RKILRSLHKTWEAKVTGNP----------RKLIGSLITHEIIMKEPLGGWVQK--------------EEEHCIKDHLL-------------GIFQNHLST
        RKIL  L KTW+AKVT              +LIGSL+THEIIM+E L    +K              +E+   +D ++               F+ +LST
Subjt:  RKILRSLHKTWEAKVTGNP----------RKLIGSLITHEIIMKEPLGGWVQK--------------EEEHCIKDHLL-------------GIFQNHLST

Query:  QKESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQN
        QK SKGEK+KKDEVI YE KK   IR DCP LKSSKKSK+KA+KATWDDSSE E  SEVEE A+LGL+  SDKEDEHDDEVTL+ PSI +LFENFEN+QN
Subjt:  QKESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQN

Query:  DLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY-------------------LDKAKET
        DLEKLSSKYVV KKKYNVL+SENKSL DKI CFKEN N  QIEELNV  DKH+ DCN+KDALLDKVRFLEHD                     LDKAKET
Subjt:  DLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY-------------------LDKAKET

Query:  IKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNM
        IKKLTIGAQRLDKIIE+GKSYGDKR LGY+DESST S SKTTFVKASPIVPK NM
Subjt:  IKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNM

TYK02592.1 zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]2.2e-6367.12Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAK-----VTGNPRKLIGSLITHEIIMKEPLG----GWVQKEEEHCIKDHLLGIFQNHLSTQKE
        MFTRFTNIINALK L K+YT SENVRKILRSL KTWEAK      +   + +  + I+ E+  ++ L      +  ++  + IK      F+ HLSTQKE
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAK-----VTGNPRKLIGSLITHEIIMKEPLG----GWVQKEEEHCIKDHLLGIFQNHLSTQKE

Query:  SKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLE
        SKGEKNKKDEVIYYE KK+G IR DCP LKSSKKSKKKA+KATWDDSS+ E  SEVEE+A+LGL+AHSDKEDEHDDEVTL+ PSI +LFENFE+MQNDLE
Subjt:  SKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLE

Query:  KLSSKYVVRKKKYNVLTSENKS
        KLSSK VV KKKYNVLTSENKS
Subjt:  KLSSKYVVRKKKYNVLTSENKS

TYK24240.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]1.3e-6350.9Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTG------NPR----KLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK
        +FTRFTNIINALKDL KIYT SEN RKILRSL KTWEAKV        +P+    +LIGSL+THEII+K+ L    +K++   +K          +S + 
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTG------NPR----KLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK

Query:  ESKGEKN-KKDEVIYYEYKKSGDIR-----------MDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAK
        + K E +  +D++ Y+  K    I+                LK  KKSKKKA+KATWDDSS  ESG EVE++A+LGL+AH                    
Subjt:  ESKGEKN-KKDEVIYYEYKKSGDIR-----------MDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAK

Query:  LFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY----------------
                    +KLSS+YVV KKKYNVLTSENKSL  K  CFKENENVVQIEELNV  DKHVCDCN+KDALLDKVRFL+HDG                 
Subjt:  LFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY----------------

Query:  ---LDKAKETIKKLTIGAQRLDKIIELGKSYG
           L+KAKETI+KLTI A+RLDKII +GKSYG
Subjt:  ---LDKAKETIKKLTIGAQRLDKIIELGKSYG

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]2.7e-6951.64Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKV----------TGNPRKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK
        MF RFTNI+NAL+ L K Y+  E V+K+L SL K WE KV          T +  +LIGSL+THEI +K+ +      E+E   KD  + +    ++ + 
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKV----------TGNPRKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK

Query:  ESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDL
        + +GE    ++ + Y  +K+     DCP+LKSSKKSKKKA+KATWDDS E  S SE EE+A+   +AHSDKEDE DDEV L   S  +LFE FENMQN+L
Subjt:  ESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDL

Query:  EKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSY
        EKL SKYV+ K K NV TSENKSL D I C K+NE+   ++ L     K     N+ DAL++          LDKAK+ IK+LTIGAQRLDKIIE GK Y
Subjt:  EKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSY

Query:  GDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMP
        GDKRGLGY++E +TPS+SKT FVKASP +PKL  P
Subjt:  GDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMP

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.6e-14167.83Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTGNP----------RKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGI---------
        MFTRFTNIINALK L K+YT SENVRKILRSL KTWEAKVT              +LIGSL+THEIIMKE L    +K++   +K   L +         
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTGNP----------RKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGI---------

Query:  --------------------FQNHLSTQKESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDK
                            F+ HLSTQKESKGEK+KKDEVI YE K+SG IR DCP+LKSSKKSKKKA+KATWDDSSE E  SEVEE+A+LGL+AHSDK
Subjt:  --------------------FQNHLSTQKESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDK

Query:  EDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDG
        +DEHDD+VTL+  SI +LFENFE+MQNDLEKLSSKYVV KKKYNVL SENKSL D I CFKENEN  QIEELNV  DKHV  C +KDALLDKVRFLEHD 
Subjt:  EDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDG

Query:  -------------------YLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMPNDVSNHIKFSFVPICHN
                            LDKAKETIKKLTIGAQRLDKIIE+GKSYGDKRGLGY+DESSTPS+SKTTFVKASPIVPK NM N VSNH+K SFVPICHN
Subjt:  -------------------YLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMPNDVSNHIKFSFVPICHN

Query:  CGVEGHIRPKCFKSKYAHTTSSRRIFSQR
        CGVEGHIRPKCFK KYA  T SRR FSQR
Subjt:  CGVEGHIRPKCFKSKYAHTTSSRRIFSQR

TrEMBL top hitse value%identityAlignment
A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein4.7e-9963.1Show/hide
Query:  RKILRSLHKTWEAKVTGNP----------RKLIGSLITHEIIMKEPLGGWVQK--------------EEEHCIKDHLL-------------GIFQNHLST
        RKIL  L KTW+AKVT              +LIGSL+THEIIM+E L    +K              +E+   +D ++               F+ +LST
Subjt:  RKILRSLHKTWEAKVTGNP----------RKLIGSLITHEIIMKEPLGGWVQK--------------EEEHCIKDHLL-------------GIFQNHLST

Query:  QKESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQN
        QK SKGEK+KKDEVI YE KK   IR DCP LKSSKKSK+KA+KATWDDSSE E  SEVEE A+LGL+  SDKEDEHDDEVTL+ PSI +LFENFEN+QN
Subjt:  QKESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQN

Query:  DLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY-------------------LDKAKET
        DLEKLSSKYVV KKKYNVL+SENKSL DKI CFKEN N  QIEELNV  DKH+ DCN+KDALLDKVRFLEHD                     LDKAKET
Subjt:  DLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY-------------------LDKAKET

Query:  IKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNM
        IKKLTIGAQRLDKIIE+GKSYGDKR LGY+DESST S SKTTFVKASPIVPK NM
Subjt:  IKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNM

A0A5D3BUV2 Zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein1.1e-6367.12Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAK-----VTGNPRKLIGSLITHEIIMKEPLG----GWVQKEEEHCIKDHLLGIFQNHLSTQKE
        MFTRFTNIINALK L K+YT SENVRKILRSL KTWEAK      +   + +  + I+ E+  ++ L      +  ++  + IK      F+ HLSTQKE
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAK-----VTGNPRKLIGSLITHEIIMKEPLG----GWVQKEEEHCIKDHLLGIFQNHLSTQKE

Query:  SKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLE
        SKGEKNKKDEVIYYE KK+G IR DCP LKSSKKSKKKA+KATWDDSS+ E  SEVEE+A+LGL+AHSDKEDEHDDEVTL+ PSI +LFENFE+MQNDLE
Subjt:  SKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLE

Query:  KLSSKYVVRKKKYNVLTSENKS
        KLSSK VV KKKYNVLTSENKS
Subjt:  KLSSKYVVRKKKYNVLTSENKS

A0A5D3DLU8 UBN2 domain-containing protein6.4e-6450.9Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTG------NPR----KLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK
        +FTRFTNIINALKDL KIYT SEN RKILRSL KTWEAKV        +P+    +LIGSL+THEII+K+ L    +K++   +K          +S + 
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTG------NPR----KLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK

Query:  ESKGEKN-KKDEVIYYEYKKSGDIR-----------MDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAK
        + K E +  +D++ Y+  K    I+                LK  KKSKKKA+KATWDDSS  ESG EVE++A+LGL+AH                    
Subjt:  ESKGEKN-KKDEVIYYEYKKSGDIR-----------MDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAK

Query:  LFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY----------------
                    +KLSS+YVV KKKYNVLTSENKSL  K  CFKENENVVQIEELNV  DKHVCDCN+KDALLDKVRFL+HDG                 
Subjt:  LFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGY----------------

Query:  ---LDKAKETIKKLTIGAQRLDKIIELGKSYG
           L+KAKETI+KLTI A+RLDKII +GKSYG
Subjt:  ---LDKAKETIKKLTIGAQRLDKIIELGKSYG

A0A6J1DS74 uncharacterized protein LOC1110238061.3e-6951.64Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKV----------TGNPRKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK
        MF RFTNI+NAL+ L K Y+  E V+K+L SL K WE KV          T +  +LIGSL+THEI +K+ +      E+E   KD  + +    ++ + 
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKV----------TGNPRKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQK

Query:  ESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDL
        + +GE    ++ + Y  +K+     DCP+LKSSKKSKKKA+KATWDDS E  S SE EE+A+   +AHSDKEDE DDEV L   S  +LFE FENMQN+L
Subjt:  ESKGEKNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDL

Query:  EKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSY
        EKL SKYV+ K K NV TSENKSL D I C K+NE+   ++ L     K     N+ DAL++          LDKAK+ IK+LTIGAQRLDKIIE GK Y
Subjt:  EKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSY

Query:  GDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMP
        GDKRGLGY++E +TPS+SKT FVKASP +PKL  P
Subjt:  GDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMP

A0A6J1DY46 uncharacterized protein LOC1110252599.5e-6047.45Show/hide
Query:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTG-NPRKLIGSLITHEIIMKEPLG----GWVQKEEEHCIKDHLLGIFQNHLSTQKESKGE
        MF RFTNI+NAL+ L K Y+  E V+K+L SL K WE KVT     K + +L   E+I +  L      ++ ++ ++ IK      F+ + S  KE K E
Subjt:  MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTG-NPRKLIGSLITHEIIMKEPLG----GWVQKEEEHCIKDHLLGIFQNHLSTQKESKGE

Query:  KNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSS
         +KKDEVI YE KK G IR DCP LKSSKKSKKKA+KATWDDS E  + SE EE+A+   +AHSDKEDE DDE+TL   S  +LFE FENMQNDLEKL  
Subjt:  KNKKDEVIYYEYKKSGDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSS

Query:  KYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRG
                                                                           LDKAK++IKKLTIGAQRLDKIIELGK YGDKRG
Subjt:  KYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRG

Query:  LGYVDESSTPSTSKTTFVKASPIVPKLNMPNDV
        LGY+DE STPS+SK  FVKASP +PKL  P  V
Subjt:  LGYVDESSTPSTSKTTFVKASPIVPKLNMPNDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein6.6e-0522.91Show/hide
Query:  SDKEDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLE
        SD +   DD++++     A   EN++ +     K+        ++ +VLT E   L  K+V           + L    +K                  E
Subjt:  SDKEDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVCFKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLE

Query:  HDGYLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFV------KASPIVPKLNMPNDVSNHIKFS-----------------
            L++ ++ ++ L  G ++L  I+ +GK+  DK GLG+      PS S   FV       AS  V +     ++++  +                   
Subjt:  HDGYLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFV------KASPIVPKLNMPNDVSNHIKFS-----------------

Query:  -------FVPICHNCGVEGHIRPKCFK
               F P+CH+CGV GHIRP+CF+
Subjt:  -------FVPICHNCGVEGHIRPKCFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTACTAGATTTACTAACATTATAAATGCTTTGAAGGATCTTGATAAAATCTATACAATTTCCGAAAATGTTAGAAAAATTCTAAGGTCTCTACATAAGACTTGGGA
AGCTAAGGTGACAGGCAATCCAAGAAAGCTCATTGGCTCACTCATAACTCATGAGATCATTATGAAGGAGCCACTTGGAGGATGGGTTCAAAAAGAAGAAGAGCATTGTA
TTAAAGACCATCTCCTTGGAATATTTCAAAACCACCTATCAACCCAAAAAGAGTCAAAAGGTGAGAAAAACAAAAAGGATGAGGTGATTTATTATGAATACAAAAAGTCG
GGTGACATAAGAATGGATTGCCCTATCCTCAAGTCATCTAAAAAATCTAAGAAGAAGGCAATAAAGGCTACATGGGATGATAGTAGTGAAAGAGAAAGTGGAAGTGAAGT
TGAAGAAATAGCACACCTTGGTCTCATAGCTCATAGTGACAAAGAAGATGAACATGATGATGAGGTAACTCTAAAACGTCCTTCTATTGCTAAATTGTTTGAAAATTTTG
AAAATATGCAAAATGACCTAGAAAAACTTAGTTCTAAGTATGTTGTGCGTAAGAAGAAATACAATGTTTTAACTAGTGAAAATAAGTCTTTACACGATAAAATTGTTTGC
TTTAAAGAGAATGAAAATGTTGTGCAAATTGAAGAATTAAATGTCTATTGTGATAAGCATGTTTGTGATTGTAATGATAAAGATGCTTTGCTTGATAAAGTTAGATTTCT
TGAGCATGATGGTTATCTTGATAAAGCTAAGGAGACTATTAAAAAGCTGACAATAGGTGCTCAAAGATTGGATAAAATTATTGAATTAGGAAAGTCTTATGGTGATAAGA
GAGGTTTAGGCTATGTTGATGAATCATCTACTCCTTCAACTTCTAAAACTACATTTGTTAAAGCATCTCCTATTGTGCCTAAACTTAATATGCCTAATGATGTGTCTAAT
CATATTAAATTTAGTTTTGTACCTATATGTCATAATTGTGGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAATCGAAGTATGCTCACACTACTTCTTCAAGAAGAAT
CTTTTCACAAAGAACATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTACTAGATTTACTAACATTATAAATGCTTTGAAGGATCTTGATAAAATCTATACAATTTCCGAAAATGTTAGAAAAATTCTAAGGTCTCTACATAAGACTTGGGA
AGCTAAGGTGACAGGCAATCCAAGAAAGCTCATTGGCTCACTCATAACTCATGAGATCATTATGAAGGAGCCACTTGGAGGATGGGTTCAAAAAGAAGAAGAGCATTGTA
TTAAAGACCATCTCCTTGGAATATTTCAAAACCACCTATCAACCCAAAAAGAGTCAAAAGGTGAGAAAAACAAAAAGGATGAGGTGATTTATTATGAATACAAAAAGTCG
GGTGACATAAGAATGGATTGCCCTATCCTCAAGTCATCTAAAAAATCTAAGAAGAAGGCAATAAAGGCTACATGGGATGATAGTAGTGAAAGAGAAAGTGGAAGTGAAGT
TGAAGAAATAGCACACCTTGGTCTCATAGCTCATAGTGACAAAGAAGATGAACATGATGATGAGGTAACTCTAAAACGTCCTTCTATTGCTAAATTGTTTGAAAATTTTG
AAAATATGCAAAATGACCTAGAAAAACTTAGTTCTAAGTATGTTGTGCGTAAGAAGAAATACAATGTTTTAACTAGTGAAAATAAGTCTTTACACGATAAAATTGTTTGC
TTTAAAGAGAATGAAAATGTTGTGCAAATTGAAGAATTAAATGTCTATTGTGATAAGCATGTTTGTGATTGTAATGATAAAGATGCTTTGCTTGATAAAGTTAGATTTCT
TGAGCATGATGGTTATCTTGATAAAGCTAAGGAGACTATTAAAAAGCTGACAATAGGTGCTCAAAGATTGGATAAAATTATTGAATTAGGAAAGTCTTATGGTGATAAGA
GAGGTTTAGGCTATGTTGATGAATCATCTACTCCTTCAACTTCTAAAACTACATTTGTTAAAGCATCTCCTATTGTGCCTAAACTTAATATGCCTAATGATGTGTCTAAT
CATATTAAATTTAGTTTTGTACCTATATGTCATAATTGTGGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAATCGAAGTATGCTCACACTACTTCTTCAAGAAGAAT
CTTTTCACAAAGAACATAG
Protein sequenceShow/hide protein sequence
MFTRFTNIINALKDLDKIYTISENVRKILRSLHKTWEAKVTGNPRKLIGSLITHEIIMKEPLGGWVQKEEEHCIKDHLLGIFQNHLSTQKESKGEKNKKDEVIYYEYKKS
GDIRMDCPILKSSKKSKKKAIKATWDDSSERESGSEVEEIAHLGLIAHSDKEDEHDDEVTLKRPSIAKLFENFENMQNDLEKLSSKYVVRKKKYNVLTSENKSLHDKIVC
FKENENVVQIEELNVYCDKHVCDCNDKDALLDKVRFLEHDGYLDKAKETIKKLTIGAQRLDKIIELGKSYGDKRGLGYVDESSTPSTSKTTFVKASPIVPKLNMPNDVSN
HIKFSFVPICHNCGVEGHIRPKCFKSKYAHTTSSRRIFSQRT