; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g17430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g17430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein SPIRRIG
Genome locationchr11:13389094..13390701
RNA-Seq ExpressionMoc11g17430
SyntenyMoc11g17430
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]2.5e-10665.78Show/hide
Query:  KQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTEEIMREKVPPKFKLPTVPGILDYIKRVGQNMVPTIKTR
        +QHKSPAPNGG+S+  +R+SE ISLDKGKP DRPESSEKRHNQK+KGFDLEELLDQ +SPFTEEIMREK                               
Subjt:  KQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTEEIMREKVPPKFKLPTVPGILDYIKRVGQNMVPTIKTR

Query:  VNLELQELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVRD---------------------------------------GKRTDQKRERSGDKPQG
                 +S+ D V+RFN++KLQVEGLTDAVSLLAF+ GVRD                                       GKRTDQKRERSGDKPQG
Subjt:  VNLELQELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVRD---------------------------------------GKRTDQKRERSGDKPQG

Query:  SRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQ
        SRWEKRDRS QKDPP+KFEKYT TTVPLEQVLM+IK+QRLLKWPERM A STKRSKGRYCLFH DHGHATQDCFDLK+EVEGLI +GYLKEYVE+ KATQ
Subjt:  SRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQ

Query:  KGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR
         GESDKSPAREIRTIMGGPIERESGRKRK DVREARA+R
Subjt:  KGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR

XP_022145129.1 uncharacterized protein LOC111014646 [Momordica charantia]3.9e-9987.39Show/hide
Query:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK
        MTP RS RRSDDNCSAKRRLNLDDPQVGGPEDGTSQ N ERQEGLPE HALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQ+ERQESDEVPLVRDPRKGK
Subjt:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK

Query:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE
        GPAQSETEESTN+ GSKLRIGGNTRRRTQIFDSQK++KQHK  APNGG S+ ++RNSE +SLDKGKP D+PESSEKRHNQKEKGFDLEELLDQ +SPFTE
Subjt:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE

Query:  EIMREKVPPKFKLPTVPGILDY
        EIMREKVPPKFKLPTV    D+
Subjt:  EIMREKVPPKFKLPTVPGILDY

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.8e-16863.93Show/hide
Query:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK
        MTP RS RRSDD+CSAKRRLNL DPQVGGPEDGTSQ N ERQEGL E  ALTTPEP QKQFAVLEDK                  ESDEVPLVRDP+KGK
Subjt:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK

Query:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE
        GP +S+TEESTN+VGSKLRIGGNTR+RT+IFD +K +KQHKSPAPNGG+S+ ++RNSE ISLDKGKP DRPESSEKRH+ KEKGFDLEELLDQ +SPFTE
Subjt:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE

Query:  EIMREKVPPKFKLPTVP----------------------GILD-----------------YIKRVGQNMVPTIKTRVNLEL-------------------
        EIMREKVPPKFKLPTV                       G+ +                 + +++ +  + + K+     +                   
Subjt:  EIMREKVPPKFKLPTVP----------------------GILD-----------------YIKRVGQNMVPTIKTRVNLEL-------------------

Query:  -QELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR---------------------------------------DGKRTDQKRERSGDKPQGSRWE
         Q   +S+RD V+RFNE+KLQVEGLTDAVSLLAFMSGVR                                       DGKRTD KRERSGDKPQGSRWE
Subjt:  -QELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR---------------------------------------DGKRTDQKRERSGDKPQGSRWE

Query:  KRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGES
        KRDRSSQKDPPRKFEKYTPTTVP+EQVLM+IKDQRLLKWPERM+A S KRSKGRYCLFHRDHGHATQDCFDLK+EVEGLIR+GYLKEYVE+ KATQ GES
Subjt:  KRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGES

Query:  DKSPAREIRTIMGGPIERESGRKRKADVREARANR
        DKSPAREIRTIMGGPIERESGRKRKADVREAR +R
Subjt:  DKSPAREIRTIMGGPIERESGRKRKADVREARANR

XP_022159192.1 uncharacterized protein LOC111025612 [Momordica charantia]6.7e-5980.13Show/hide
Query:  DGKRTDQKRERSGDKPQGSRWEKRDR-SSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVE
        +GKR DQKRERSG+KP  S+WEK+DR   QKD PRKFEKYTPTTVPLEQVLM+IKDQRLLKWPE M+AP  KRSKGRYCLFHRDHGHAT DCFDLK+EVE
Subjt:  DGKRTDQKRERSGDKPQGSRWEKRDR-SSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVE

Query:  GLIRKGYLKEYVEDSKATQKGESD-KSPAREIRTIMGGPIERESGRKRKADVREAR
        GLIRKGYLKEYV+D KAT   E+D KSP REIRTIMGG  E+ES RKRKA VREAR
Subjt:  GLIRKGYLKEYVEDSKATQKGESD-KSPAREIRTIMGGPIERESGRKRKADVREAR

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]4.9e-8664.01Show/hide
Query:  VPGILDYIKRVGQNMVPTIKTRVNLELQELG----------------------------QSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR-------
        VPGILDYIK VGQNMV TIK       + L                             +S+ D V+RFN++KLQ+E LTD VSLLAFMSGVR       
Subjt:  VPGILDYIKRVGQNMVPTIKTRVNLELQELG----------------------------QSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR-------

Query:  --------------------------------DGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAP
                                        DGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLM+IKDQRLLKWPERM+ P
Subjt:  --------------------------------DGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAP

Query:  STKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR
        STKRSKGRYCLFHRDH HATQD FDLK+EVEGLIR+GYL+EYVE+ KATQ GES+KSPAREIRTIMGGPIERES RKRKADVREAR +R
Subjt:  STKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128051.2e-10665.78Show/hide
Query:  KQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTEEIMREKVPPKFKLPTVPGILDYIKRVGQNMVPTIKTR
        +QHKSPAPNGG+S+  +R+SE ISLDKGKP DRPESSEKRHNQK+KGFDLEELLDQ +SPFTEEIMREK                               
Subjt:  KQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTEEIMREKVPPKFKLPTVPGILDYIKRVGQNMVPTIKTR

Query:  VNLELQELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVRD---------------------------------------GKRTDQKRERSGDKPQG
                 +S+ D V+RFN++KLQVEGLTDAVSLLAF+ GVRD                                       GKRTDQKRERSGDKPQG
Subjt:  VNLELQELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVRD---------------------------------------GKRTDQKRERSGDKPQG

Query:  SRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQ
        SRWEKRDRS QKDPP+KFEKYT TTVPLEQVLM+IK+QRLLKWPERM A STKRSKGRYCLFH DHGHATQDCFDLK+EVEGLI +GYLKEYVE+ KATQ
Subjt:  SRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQ

Query:  KGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR
         GESDKSPAREIRTIMGGPIERESGRKRK DVREARA+R
Subjt:  KGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR

A0A6J1CUA7 uncharacterized protein LOC1110146461.9e-9987.39Show/hide
Query:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK
        MTP RS RRSDDNCSAKRRLNLDDPQVGGPEDGTSQ N ERQEGLPE HALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQ+ERQESDEVPLVRDPRKGK
Subjt:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK

Query:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE
        GPAQSETEESTN+ GSKLRIGGNTRRRTQIFDSQK++KQHK  APNGG S+ ++RNSE +SLDKGKP D+PESSEKRHNQKEKGFDLEELLDQ +SPFTE
Subjt:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE

Query:  EIMREKVPPKFKLPTVPGILDY
        EIMREKVPPKFKLPTV    D+
Subjt:  EIMREKVPPKFKLPTVPGILDY

A0A6J1DWY0 uncharacterized protein LOC1110252938.6e-16963.93Show/hide
Query:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK
        MTP RS RRSDD+CSAKRRLNL DPQVGGPEDGTSQ N ERQEGL E  ALTTPEP QKQFAVLEDK                  ESDEVPLVRDP+KGK
Subjt:  MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGK

Query:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE
        GP +S+TEESTN+VGSKLRIGGNTR+RT+IFD +K +KQHKSPAPNGG+S+ ++RNSE ISLDKGKP DRPESSEKRH+ KEKGFDLEELLDQ +SPFTE
Subjt:  GPAQSETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTE

Query:  EIMREKVPPKFKLPTVP----------------------GILD-----------------YIKRVGQNMVPTIKTRVNLEL-------------------
        EIMREKVPPKFKLPTV                       G+ +                 + +++ +  + + K+     +                   
Subjt:  EIMREKVPPKFKLPTVP----------------------GILD-----------------YIKRVGQNMVPTIKTRVNLEL-------------------

Query:  -QELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR---------------------------------------DGKRTDQKRERSGDKPQGSRWE
         Q   +S+RD V+RFNE+KLQVEGLTDAVSLLAFMSGVR                                       DGKRTD KRERSGDKPQGSRWE
Subjt:  -QELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR---------------------------------------DGKRTDQKRERSGDKPQGSRWE

Query:  KRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGES
        KRDRSSQKDPPRKFEKYTPTTVP+EQVLM+IKDQRLLKWPERM+A S KRSKGRYCLFHRDHGHATQDCFDLK+EVEGLIR+GYLKEYVE+ KATQ GES
Subjt:  KRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGES

Query:  DKSPAREIRTIMGGPIERESGRKRKADVREARANR
        DKSPAREIRTIMGGPIERESGRKRKADVREAR +R
Subjt:  DKSPAREIRTIMGGPIERESGRKRKADVREARANR

A0A6J1DYL6 uncharacterized protein LOC1110257852.4e-8664.01Show/hide
Query:  VPGILDYIKRVGQNMVPTIKTRVNLELQELG----------------------------QSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR-------
        VPGILDYIK VGQNMV TIK       + L                             +S+ D V+RFN++KLQ+E LTD VSLLAFMSGVR       
Subjt:  VPGILDYIKRVGQNMVPTIKTRVNLELQELG----------------------------QSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVR-------

Query:  --------------------------------DGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAP
                                        DGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLM+IKDQRLLKWPERM+ P
Subjt:  --------------------------------DGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAP

Query:  STKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR
        STKRSKGRYCLFHRDH HATQD FDLK+EVEGLIR+GYL+EYVE+ KATQ GES+KSPAREIRTIMGGPIERES RKRKADVREAR +R
Subjt:  STKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGESDKSPAREIRTIMGGPIERESGRKRKADVREARANR

A0A6J1DZ52 uncharacterized protein LOC1110256123.3e-5980.13Show/hide
Query:  DGKRTDQKRERSGDKPQGSRWEKRDR-SSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVE
        +GKR DQKRERSG+KP  S+WEK+DR   QKD PRKFEKYTPTTVPLEQVLM+IKDQRLLKWPE M+AP  KRSKGRYCLFHRDHGHAT DCFDLK+EVE
Subjt:  DGKRTDQKRERSGDKPQGSRWEKRDR-SSQKDPPRKFEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVE

Query:  GLIRKGYLKEYVEDSKATQKGESD-KSPAREIRTIMGGPIERESGRKRKADVREAR
        GLIRKGYLKEYV+D KAT   E+D KSP REIRTIMGG  E+ES RKRKA VREAR
Subjt:  GLIRKGYLKEYVEDSKATQKGESD-KSPAREIRTIMGGPIERESGRKRKADVREAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCAGGAAGGAGTCAACGACGCTCTGATGATAACTGCTCTGCCAAGAGGAGGCTGAACTTGGACGACCCCCAGGTTGGGGGACCCGAGGATGGGACCAGC
CAGTCAAACCCGGAACGTCAGGAGGGGTTGCCCGAGGTGCATGCACTAACGACCCCTGAGCCACTCCAAAAGCAGTTTGCGGTCTTGGAAGACAAGGTAGAGGGC
ATGCTGCAACGCATGACCCAAGTCCTTCGACAATTCGAGCGACAAGAGTCCGACGAGGTACCCCTTGTCAGAGACCCGAGAAAGGGGAAGGGCCCAGCGCAAAGC
GAGACTGAGGAATCAACAAACAATGTAGGGAGCAAGTTGCGGATAGGTGGAAATACCAGGCGACGAACCCAGATTTTCGACTCCCAAAAGATAAGAAAGCAACAT
AAATCGCCGGCACCAAACGGGGGTGAGAGCAACCAGAATAACAGAAACTCTGAGTCAATAAGTCTCGACAAAGGCAAACCAACAGATCGGCCAGAGTCTTCGGAG
AAGCGACATAACCAAAAGGAGAAGGGATTCGACCTCGAAGAACTACTGGATCAAGTCAACTCACCATTCACGGAGGAGATCATGAGAGAGAAGGTCCCTCCAAAA
TTCAAGCTACCCACGGTGCCGGGTATTCTCGACTACATTAAACGGGTCGGCCAGAATATGGTTCCGACAATTAAAACGAGGGTCAATCTCGAGTTGCAAGAGCTT
GGCCAGAGCATTCGTGACCCAGTTTCTCGATTCAACGAGAAGAAGCTGCAGGTAGAAGGCCTTACAGACGCTGTATCTCTACTGGCCTTCATGTCCGGCGTCAGG
GACGGAAAGCGAACCGACCAAAAGAGGGAGAGGTCGGGAGATAAACCGCAAGGGTCGAGATGGGAGAAGAGGGATCGGAGTAGCCAGAAAGATCCACCCCGAAAA
TTTGAAAAGTATACCCCGACCACCGTTCCACTCGAGCAAGTGCTGATGGATATCAAAGACCAAAGGTTGCTTAAGTGGCCGGAGAGGATGGAGGCCCCGTCAACT
AAACGAAGTAAAGGCCGATATTGCCTTTTCCACCGGGATCACGGCCATGCAACTCAGGATTGTTTTGATCTCAAGAAAGAGGTGGAAGGACTAATCCGAAAGGGC
TACCTCAAAGAGTATGTAGAGGACTCTAAAGCGACACAAAAGGGCGAAAGCGACAAGTCTCCTGCTCGAGAGATTCGAACTATAATGGGAGGCCCCATAGAAAGA
GAATCTGGGAGAAAAAGAAAAGCAGATGTGCGAGAAGCTAGGGCGAACCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACACCAGGAAGGAGTCAACGACGCTCTGATGATAACTGCTCTGCCAAGAGGAGGCTGAACTTGGACGACCCCCAGGTTGGGGGACCCGAGGATGGGACCAGC
CAGTCAAACCCGGAACGTCAGGAGGGGTTGCCCGAGGTGCATGCACTAACGACCCCTGAGCCACTCCAAAAGCAGTTTGCGGTCTTGGAAGACAAGGTAGAGGGC
ATGCTGCAACGCATGACCCAAGTCCTTCGACAATTCGAGCGACAAGAGTCCGACGAGGTACCCCTTGTCAGAGACCCGAGAAAGGGGAAGGGCCCAGCGCAAAGC
GAGACTGAGGAATCAACAAACAATGTAGGGAGCAAGTTGCGGATAGGTGGAAATACCAGGCGACGAACCCAGATTTTCGACTCCCAAAAGATAAGAAAGCAACAT
AAATCGCCGGCACCAAACGGGGGTGAGAGCAACCAGAATAACAGAAACTCTGAGTCAATAAGTCTCGACAAAGGCAAACCAACAGATCGGCCAGAGTCTTCGGAG
AAGCGACATAACCAAAAGGAGAAGGGATTCGACCTCGAAGAACTACTGGATCAAGTCAACTCACCATTCACGGAGGAGATCATGAGAGAGAAGGTCCCTCCAAAA
TTCAAGCTACCCACGGTGCCGGGTATTCTCGACTACATTAAACGGGTCGGCCAGAATATGGTTCCGACAATTAAAACGAGGGTCAATCTCGAGTTGCAAGAGCTT
GGCCAGAGCATTCGTGACCCAGTTTCTCGATTCAACGAGAAGAAGCTGCAGGTAGAAGGCCTTACAGACGCTGTATCTCTACTGGCCTTCATGTCCGGCGTCAGG
GACGGAAAGCGAACCGACCAAAAGAGGGAGAGGTCGGGAGATAAACCGCAAGGGTCGAGATGGGAGAAGAGGGATCGGAGTAGCCAGAAAGATCCACCCCGAAAA
TTTGAAAAGTATACCCCGACCACCGTTCCACTCGAGCAAGTGCTGATGGATATCAAAGACCAAAGGTTGCTTAAGTGGCCGGAGAGGATGGAGGCCCCGTCAACT
AAACGAAGTAAAGGCCGATATTGCCTTTTCCACCGGGATCACGGCCATGCAACTCAGGATTGTTTTGATCTCAAGAAAGAGGTGGAAGGACTAATCCGAAAGGGC
TACCTCAAAGAGTATGTAGAGGACTCTAAAGCGACACAAAAGGGCGAAAGCGACAAGTCTCCTGCTCGAGAGATTCGAACTATAATGGGAGGCCCCATAGAAAGA
GAATCTGGGAGAAAAAGAAAAGCAGATGTGCGAGAAGCTAGGGCGAACCGCTAA
Protein sequenceShow/hide protein sequence
MTPGRSQRRSDDNCSAKRRLNLDDPQVGGPEDGTSQSNPERQEGLPEVHALTTPEPLQKQFAVLEDKVEGMLQRMTQVLRQFERQESDEVPLVRDPRKGKGPAQS
ETEESTNNVGSKLRIGGNTRRRTQIFDSQKIRKQHKSPAPNGGESNQNNRNSESISLDKGKPTDRPESSEKRHNQKEKGFDLEELLDQVNSPFTEEIMREKVPPK
FKLPTVPGILDYIKRVGQNMVPTIKTRVNLELQELGQSIRDPVSRFNEKKLQVEGLTDAVSLLAFMSGVRDGKRTDQKRERSGDKPQGSRWEKRDRSSQKDPPRK
FEKYTPTTVPLEQVLMDIKDQRLLKWPERMEAPSTKRSKGRYCLFHRDHGHATQDCFDLKKEVEGLIRKGYLKEYVEDSKATQKGESDKSPAREIRTIMGGPIER
ESGRKRKADVREARANR