; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g10560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g10560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr5:8267729..8270214
RNA-Seq ExpressionMoc05g10560
SyntenyMoc05g10560
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]1.3e-15870.65Show/hide
Query:  QEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFV
        +EQLN DREDEDFGELPQEV GDEFEDEEDNDDI QYEV+VRTPVHE QQVDEEPPTKEQEGTSGPVDVPSEAM ESSSSSSQ                 
Subjt:  QEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFV

Query:  ANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRP
                                      GA+SRPRTR AVARLAA+KEAEAGPSKKAK ARVQR A+EPLEEANEEE DSTEQTPSRVKRVRLEVRRP
Subjt:  ANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRP

Query:  TFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVSEDLVKEFYIAINPHRGDVVRVRGKVVKFSPS------------------IINTHYG
        TFTTRDILLERGFDEAQEPVPEYVR+R+V+NGWETLFAPITRVSE LVKEFY AINP+RGD VRVRG  +   PS                   I+T   
Subjt:  TFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVSEDLVKEFYIAINPHRGDVVRVRGKVVKFSPS------------------IINTHYG

Query:  L----LDV-FNPIVWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM---------------------KSSGDSIVQEEDS
        L    LD+     VWMYVVKNRLIPTS+DSSIKRNRAM++YIL+KGVEFNFGELIRNEI+SCSEK+                     K  G SIV+EEDS
Subjt:  L----LDV-FNPIVWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM---------------------KSSGDSIVQEEDS

Query:  PITAADPETRGVVTREQYDELRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSDDE
        PITAADPETRGVVTREQYDELRHKYELLLVTQRAT AFLKKIYGDEAPSFPDELA DLPSSSRLPTDSNDDESSDDE
Subjt:  PITAADPETRGVVTREQYDELRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]3.0e-6768.02Show/hide
Query:  VHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVAR
        +HE QQ DEE   +EQEG SG VDVP+EA+ ESSSSSS+GK+PSLSSLNVSDPNFVA A TS+E+V LTKVVKK + KK + EI  GA SRP TRA +A 
Subjt:  VHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVAR

Query:  LAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVS
        LAA+KEAEAGP KKAKR +  R ++EPL+E N+EE DS EQTPS+ KRVR EV+R  FT R+IL+E+GFDEAQEPVP+Y++RRL++NGWETLFAP  RVS
Subjt:  LAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVS

Query:  EDLVKEFYIAINPHRGDVVRVR
        E LVKEFY  INP+RGD +  R
Subjt:  EDLVKEFYIAINPHRGDVVRVR

XP_022156935.1 uncharacterized protein LOC111023761 [Momordica charantia]6.5e-3852.68Show/hide
Query:  SGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGEDPLEDDGNSGAAQEQL
        SGDSEHD EPLEHSDSATV+I+CQIAP  IM ETPP TLQ                                E+LVALNEARGEDPL+DDGNSG      
Subjt:  SGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGEDPLEDDGNSGAAQEQL

Query:  NADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAE
                                                     Q DEEP  +EQEGTSGP+DV SEAM ESSSS SQ KT SLSSLNVSDPNFVA AE
Subjt:  NADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAE

Query:  TSDEEVRLTKVVKKTQKKKKVAEI
         SDEEV L KVVKKTQKKKKVAEI
Subjt:  TSDEEVRLTKVVKKTQKKKKVAEI

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]2.5e-4536.4Show/hide
Query:  MEGSSSTKPHDKEKETKRVLLPPPTKPGMIPLELPRISHEKLVFDPREQRRKYEEAIRMNPRRNQSLGGTNSEKINMESKDARVNKEGHSEKKLGGVNKV
        MEGSS +KP DKE E K+V+LPPP  P                                                  E   ARVN+ G+SEKKL G +KV
Subjt:  MEGSSSTKPHDKEKETKRVLLPPPTKPGMIPLELPRISHEKLVFDPREQRRKYEEAIRMNPRRNQSLGGTNSEKINMESKDARVNKEGHSEKKLGGVNKV

Query:  YLRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENDRVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNFSQANPISESLDLSIPSP
        YLRKNQS+ +K + LDE IAR+ E+ ++ +K  EI DK+N+ + AKI ELN KWQ FMENS+++SEEIQ+ELN                           
Subjt:  YLRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENDRVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNFSQANPISESLDLSIPSP

Query:  LSTTVTVHVEGQQQGSGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGED
                                                                                     EQERTTSKI +ILVALNEA GED
Subjt:  LSTTVTVHVEGQQQGSGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGED

Query:  PLEDDGNSGAAQEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSL
        PLEDDGNS  AQ +LN D EDED G+LPQEV GDE E+EE+NDDI QYEVR+   VHE Q+   E P +  EG S PVDVP+EA  +SSSSSS  K  S 
Subjt:  PLEDDGNSGAAQEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSL

Query:  SSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRG
          +N  +P       +++++    K V +   K+  A I        R R     + A  E    P K A   R  RG
Subjt:  SSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRG

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]1.4e-3556.35Show/hide
Query:  VWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM-----------------------------KSSGDS--------IVQE
        +W YVVKN LI TS+DSSI++ R M++YILMKG+EFNF ELIRNEI  C+EKM                             K S  S        IV+E
Subjt:  VWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM-----------------------------KSSGDS--------IVQE

Query:  EDSPITAADPETRGVVTREQYDE---LRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSD
        EDSPITAADP+TRGVVTREQYDE   LRH Y+LL  TQ AT  FLKK+YGD APS PDELA DLPSSSR PT   DD   D
Subjt:  EDSPITAADPETRGVVTREQYDE---LRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSD

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220076.1e-15970.65Show/hide
Query:  QEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFV
        +EQLN DREDEDFGELPQEV GDEFEDEEDNDDI QYEV+VRTPVHE QQVDEEPPTKEQEGTSGPVDVPSEAM ESSSSSSQ                 
Subjt:  QEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFV

Query:  ANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRP
                                      GA+SRPRTR AVARLAA+KEAEAGPSKKAK ARVQR A+EPLEEANEEE DSTEQTPSRVKRVRLEVRRP
Subjt:  ANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRP

Query:  TFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVSEDLVKEFYIAINPHRGDVVRVRGKVVKFSPS------------------IINTHYG
        TFTTRDILLERGFDEAQEPVPEYVR+R+V+NGWETLFAPITRVSE LVKEFY AINP+RGD VRVRG  +   PS                   I+T   
Subjt:  TFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVSEDLVKEFYIAINPHRGDVVRVRGKVVKFSPS------------------IINTHYG

Query:  L----LDV-FNPIVWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM---------------------KSSGDSIVQEEDS
        L    LD+     VWMYVVKNRLIPTS+DSSIKRNRAM++YIL+KGVEFNFGELIRNEI+SCSEK+                     K  G SIV+EEDS
Subjt:  L----LDV-FNPIVWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM---------------------KSSGDSIVQEEDS

Query:  PITAADPETRGVVTREQYDELRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSDDE
        PITAADPETRGVVTREQYDELRHKYELLLVTQRAT AFLKKIYGDEAPSFPDELA DLPSSSRLPTDSNDDESSDDE
Subjt:  PITAADPETRGVVTREQYDELRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSDDE

A0A6J1DRR9 uncharacterized protein LOC1110237613.2e-3852.68Show/hide
Query:  SGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGEDPLEDDGNSGAAQEQL
        SGDSEHD EPLEHSDSATV+I+CQIAP  IM ETPP TLQ                                E+LVALNEARGEDPL+DDGNSG      
Subjt:  SGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGEDPLEDDGNSGAAQEQL

Query:  NADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAE
                                                     Q DEEP  +EQEGTSGP+DV SEAM ESSSS SQ KT SLSSLNVSDPNFVA AE
Subjt:  NADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAE

Query:  TSDEEVRLTKVVKKTQKKKKVAEI
         SDEEV L KVVKKTQKKKKVAEI
Subjt:  TSDEEVRLTKVVKKTQKKKKVAEI

A0A6J1DW11 uncharacterized protein LOC1110236201.5e-6768.02Show/hide
Query:  VHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVAR
        +HE QQ DEE   +EQEG SG VDVP+EA+ ESSSSSS+GK+PSLSSLNVSDPNFVA A TS+E+V LTKVVKK + KK + EI  GA SRP TRA +A 
Subjt:  VHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVAR

Query:  LAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVS
        LAA+KEAEAGP KKAKR +  R ++EPL+E N+EE DS EQTPS+ KRVR EV+R  FT R+IL+E+GFDEAQEPVP+Y++RRL++NGWETLFAP  RVS
Subjt:  LAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAPITRVS

Query:  EDLVKEFYIAINPHRGDVVRVR
        E LVKEFY  INP+RGD +  R
Subjt:  EDLVKEFYIAINPHRGDVVRVR

A0A6J1DW79 uncharacterized protein LOC1110249641.2e-4536.4Show/hide
Query:  MEGSSSTKPHDKEKETKRVLLPPPTKPGMIPLELPRISHEKLVFDPREQRRKYEEAIRMNPRRNQSLGGTNSEKINMESKDARVNKEGHSEKKLGGVNKV
        MEGSS +KP DKE E K+V+LPPP  P                                                  E   ARVN+ G+SEKKL G +KV
Subjt:  MEGSSSTKPHDKEKETKRVLLPPPTKPGMIPLELPRISHEKLVFDPREQRRKYEEAIRMNPRRNQSLGGTNSEKINMESKDARVNKEGHSEKKLGGVNKV

Query:  YLRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENDRVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNFSQANPISESLDLSIPSP
        YLRKNQS+ +K + LDE IAR+ E+ ++ +K  EI DK+N+ + AKI ELN KWQ FMENS+++SEEIQ+ELN                           
Subjt:  YLRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENDRVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNFSQANPISESLDLSIPSP

Query:  LSTTVTVHVEGQQQGSGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGED
                                                                                     EQERTTSKI +ILVALNEA GED
Subjt:  LSTTVTVHVEGQQQGSGDSEHDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGED

Query:  PLEDDGNSGAAQEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSL
        PLEDDGNS  AQ +LN D EDED G+LPQEV GDE E+EE+NDDI QYEVR+   VHE Q+   E P +  EG S PVDVP+EA  +SSSSSS  K  S 
Subjt:  PLEDDGNSGAAQEQLNADREDEDFGELPQEVPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSL

Query:  SSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRG
          +N  +P       +++++    K V +   K+  A I        R R     + A  E    P K A   R  RG
Subjt:  SSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIALGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRG

A0A6J1E204 uncharacterized protein LOC1110257026.6e-3656.35Show/hide
Query:  VWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM-----------------------------KSSGDS--------IVQE
        +W YVVKN LI TS+DSSI++ R M++YILMKG+EFNF ELIRNEI  C+EKM                             K S  S        IV+E
Subjt:  VWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKM-----------------------------KSSGDS--------IVQE

Query:  EDSPITAADPETRGVVTREQYDE---LRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSD
        EDSPITAADP+TRGVVTREQYDE   LRH Y+LL  TQ AT  FLKK+YGD APS PDELA DLPSSSR PT   DD   D
Subjt:  EDSPITAADPETRGVVTREQYDE---LRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTTCATCTTCCACCAAGCCACACGACAAAGAGAAGGAAACGAAAAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACTTCCTAGGAT
TTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAGGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGGTGGTACAAATTCTGAAA
AAATTAATATGGAATCTAAGGATGCTAGGGTTAATAAAGAGGGTCATAGTGAGAAGAAATTAGGAGGAGTTAATAAAGTCTATCTTCGAAAAAATCAATCTTTAGAGGAA
AAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTCAAGAGAGGGCAGAGATGTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGATAGGGTTTACGCGAAAAT
TGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATTTTT
CTCAAGCTAACCCCATTTCCGAGTCTTTAGATTTGTCTATCCCTTCCCCTCTTTCCACTACTGTTACTGTGCATGTTGAAGGTCAACAACAGGGTAGTGGGGATTCAGAA
CACGATACGGAGCCCTTGGAGCACTCAGATTCGGCCACTGTCGAAATTCAATGCCAAATTGCGCCTGACGCAATTATGGATGAGACTCCACCGACCACTTTACAAGGTAT
TTTGTCTCCATCTTTTCCAGATGCTATCTTGACTAAAAAGCCCATAGTTTTTGATGATTTAGAACAGGAAAGGACAACGTCGAAAATTGCCGAAATTTTGGTGGCGTTGA
ATGAAGCACGGGGAGAGGATCCATTGGAGGATGATGGAAACAGTGGGGCAGCACAAGAACAATTGAATGCTGATAGAGAAGATGAAGATTTTGGAGAATTACCCCAAGAA
GTGCCTGGAGACGAGTTTGAGGACGAGGAAGACAATGACGATATCTTCCAATATGAAGTGAGAGTACGAACTCCGGTGCACGAATTTCAGCAAGTTGATGAGGAGCCCCC
TACTAAAGAACAAGAAGGAACATCCGGTCCTGTGGATGTCCCTAGCGAGGCCATGGCGGAATCATCTTCATCTTCTTCACAAGGTAAGACCCCTTCTTTGTCGAGTTTGA
ATGTTTCTGACCCAAACTTTGTTGCTAATGCAGAGACTTCAGATGAGGAGGTGAGATTGACCAAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCAGAAATTGCG
CTAGGCGCAATTTCTAGGCCCAGGACCCGAGCCGCTGTAGCACGTTTGGCTGCTAAAAAAGAAGCCGAGGCCGGTCCATCTAAAAAAGCCAAGAGGGCTAGGGTGCAAAG
AGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGAGACGGATTCTACCGAGCAGACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGAAGGCCCACCTTCA
CAACACGTGATATCCTCCTTGAGAGAGGTTTTGATGAGGCCCAAGAGCCGGTGCCAGAATATGTTAGGAGGAGGCTTGTGAAGAATGGTTGGGAGACATTGTTTGCCCCC
ATTACACGTGTATCAGAGGACTTGGTGAAAGAGTTTTACATTGCCATCAACCCACACCGAGGGGATGTAGTGAGAGTACGGGGTAAAGTGGTAAAATTCTCGCCTTCCAT
TATTAATACTCACTATGGTTTGTTGGATGTCTTTAATCCCATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATA
GGGCGATGATGATGTATATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCGGAGAAAATGAAGAGTTCGGGGGATTCC
ATTGTTCAAGAGGAAGATTCTCCCATTACCGCTGCGGATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGACGAGCTTAGGCACAAGTATGAGCTTTTGTTGGT
TACTCAACGTGCCACATCTGCTTTCCTCAAGAAGATATACGGTGATGAAGCACCTTCTTTCCCCGATGAACTGGCGGTCGATTTACCATCTTCTTCCCGTCTTCCTACCG
ATTCCAACGATGATGAGTCTTCCGATGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTTCATCTTCCACCAAGCCACACGACAAAGAGAAGGAAACGAAAAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACTTCCTAGGAT
TTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAGGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGGTGGTACAAATTCTGAAA
AAATTAATATGGAATCTAAGGATGCTAGGGTTAATAAAGAGGGTCATAGTGAGAAGAAATTAGGAGGAGTTAATAAAGTCTATCTTCGAAAAAATCAATCTTTAGAGGAA
AAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTCAAGAGAGGGCAGAGATGTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGATAGGGTTTACGCGAAAAT
TGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATTTTT
CTCAAGCTAACCCCATTTCCGAGTCTTTAGATTTGTCTATCCCTTCCCCTCTTTCCACTACTGTTACTGTGCATGTTGAAGGTCAACAACAGGGTAGTGGGGATTCAGAA
CACGATACGGAGCCCTTGGAGCACTCAGATTCGGCCACTGTCGAAATTCAATGCCAAATTGCGCCTGACGCAATTATGGATGAGACTCCACCGACCACTTTACAAGGTAT
TTTGTCTCCATCTTTTCCAGATGCTATCTTGACTAAAAAGCCCATAGTTTTTGATGATTTAGAACAGGAAAGGACAACGTCGAAAATTGCCGAAATTTTGGTGGCGTTGA
ATGAAGCACGGGGAGAGGATCCATTGGAGGATGATGGAAACAGTGGGGCAGCACAAGAACAATTGAATGCTGATAGAGAAGATGAAGATTTTGGAGAATTACCCCAAGAA
GTGCCTGGAGACGAGTTTGAGGACGAGGAAGACAATGACGATATCTTCCAATATGAAGTGAGAGTACGAACTCCGGTGCACGAATTTCAGCAAGTTGATGAGGAGCCCCC
TACTAAAGAACAAGAAGGAACATCCGGTCCTGTGGATGTCCCTAGCGAGGCCATGGCGGAATCATCTTCATCTTCTTCACAAGGTAAGACCCCTTCTTTGTCGAGTTTGA
ATGTTTCTGACCCAAACTTTGTTGCTAATGCAGAGACTTCAGATGAGGAGGTGAGATTGACCAAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCAGAAATTGCG
CTAGGCGCAATTTCTAGGCCCAGGACCCGAGCCGCTGTAGCACGTTTGGCTGCTAAAAAAGAAGCCGAGGCCGGTCCATCTAAAAAAGCCAAGAGGGCTAGGGTGCAAAG
AGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGAGACGGATTCTACCGAGCAGACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGAAGGCCCACCTTCA
CAACACGTGATATCCTCCTTGAGAGAGGTTTTGATGAGGCCCAAGAGCCGGTGCCAGAATATGTTAGGAGGAGGCTTGTGAAGAATGGTTGGGAGACATTGTTTGCCCCC
ATTACACGTGTATCAGAGGACTTGGTGAAAGAGTTTTACATTGCCATCAACCCACACCGAGGGGATGTAGTGAGAGTACGGGGTAAAGTGGTAAAATTCTCGCCTTCCAT
TATTAATACTCACTATGGTTTGTTGGATGTCTTTAATCCCATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATA
GGGCGATGATGATGTATATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCGGAGAAAATGAAGAGTTCGGGGGATTCC
ATTGTTCAAGAGGAAGATTCTCCCATTACCGCTGCGGATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGACGAGCTTAGGCACAAGTATGAGCTTTTGTTGGT
TACTCAACGTGCCACATCTGCTTTCCTCAAGAAGATATACGGTGATGAAGCACCTTCTTTCCCCGATGAACTGGCGGTCGATTTACCATCTTCTTCCCGTCTTCCTACCG
ATTCCAACGATGATGAGTCTTCCGATGATGAATAG
Protein sequenceShow/hide protein sequence
MEGSSSTKPHDKEKETKRVLLPPPTKPGMIPLELPRISHEKLVFDPREQRRKYEEAIRMNPRRNQSLGGTNSEKINMESKDARVNKEGHSEKKLGGVNKVYLRKNQSLEE
KGAVLDEEIARLQERAEMFSKNNEIRDKENDRVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNFSQANPISESLDLSIPSPLSTTVTVHVEGQQQGSGDSE
HDTEPLEHSDSATVEIQCQIAPDAIMDETPPTTLQGILSPSFPDAILTKKPIVFDDLEQERTTSKIAEILVALNEARGEDPLEDDGNSGAAQEQLNADREDEDFGELPQE
VPGDEFEDEEDNDDIFQYEVRVRTPVHEFQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAETSDEEVRLTKVVKKTQKKKKVAEIA
LGAISRPRTRAAVARLAAKKEAEAGPSKKAKRARVQRGAKEPLEEANEEETDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFDEAQEPVPEYVRRRLVKNGWETLFAP
ITRVSEDLVKEFYIAINPHRGDVVRVRGKVVKFSPSIINTHYGLLDVFNPIVWMYVVKNRLIPTSHDSSIKRNRAMMMYILMKGVEFNFGELIRNEIRSCSEKMKSSGDS
IVQEEDSPITAADPETRGVVTREQYDELRHKYELLLVTQRATSAFLKKIYGDEAPSFPDELAVDLPSSSRLPTDSNDDESSDDE