; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:4126131..4133139
RNA-Seq ExpressionMoc07g04860
SyntenyMoc07g04860
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]5.6e-9146.19Show/hide
Query:  VSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVARS
        +SIKPIPEL QA++DTLK+YKD F  GRK+GTLVTD+LLLESGLL+YNPLVRP+EASRPNSEL MV  FT SVKRKS+GRAHA K VQS++P T  V ++
Subjt:  VSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVARS

Query:  AAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRFRV
        AAQ +AGPSS  PTPVIELDS GE SREKR  +ESEALDVSPL EVR                                                     
Subjt:  AAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRFRV

Query:  EPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVEIL
                                                                                                            
Subjt:  EPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVEIL

Query:  KAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFDGFAKDFRDAGFK
            EAKA+LLK+EDER KAH RAAHAITKGLEKEKFQLLKEKD  +                                 FRQHPDFDGFAKDF DAGFK
Subjt:  KAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFDGFAKDFRDAGFK

Query:  FLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        FLMKGIAAD+ HL++D  D+KKRYAEKWASGPN T  P SLV+KYVR+LDSDYSDL+E++ PSQE  EVGTT
Subjt:  FLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]9.2e-9470.76Show/hide
Query:  MRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARS
        MRFR+E SSSGVKD VS IS +CLDRCLRRAS+FV+D  SVLQRTID+A EAFIASIHSAVM+KAELDGRE L AKE+EN S  LEAATT+KG LLKA+ 
Subjt:  MRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARS

Query:  EVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDFDGFAKDFR
        EV+IL+AEV+AK  LLKKE E+ KAH RAAHAITKGLEKEKFQLLKE          KDASI                       FRQHP+FDGFAKDF 
Subjt:  EVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDFDGFAKDFR

Query:  DAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        DAGFKFLMKGIAADM HLQID SD+KKRY+E WASGPN TP P+SLV+KYVRELDSDYSD+EE DAPSQE  +VGTT
Subjt:  DAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.1e-7760.79Show/hide
Query:  GTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALE-AATTMKG
        G   +  + R+EPSSSGV+D VS IS + LDRCLRRASKFV+   SVLQRTID+A EAF+ASI SA+ +KAELDGRE+LAA+EKE  S ALE A++TMK 
Subjt:  GTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALE-AATTMKG

Query:  GLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFD
         LLKA SEVE LKAEVE++A+LLKKE++R++A  RAAHAIT+GLE+EKFQLLKEKD  +                                 FRQHPDFD
Subjt:  GLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFD

Query:  GFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQ
        GFAKDF DAGFKFLMKGIA+DM  LQID S +K+RYAEKWASGP  TP P++LV++YVR+LDSDYSD EE+   S ++
Subjt:  GFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQ

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]2.1e-9871.23Show/hide
Query:  MGGTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMK
        MGGT DV+ RFR+EPSSSGVKD VS IS +CLDRCL+RASKFV+D  SVLQRTID+A EAF+ASIHSA+M+KAELDGRE LAAKE+ENSS ALEAATT+K
Subjt:  MGGTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMK

Query:  GGLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDF
        G LLKA+ EV IL+AEV+AKA+LLKKE E+ KAH RAAHAITKGLEKEKFQLLKE          KD SI                       FRQH DF
Subjt:  GGLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDF

Query:  DGFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        DGFAKDF DAGFKFLMKGIAADM HLQID S++KK+Y+EKWASGPN TP P+SLV KYVRELDSDYSD+EE DAPSQE NE+GTT
Subjt:  DGFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.6e-17371.94Show/hide
Query:  SIVSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVA
        ++VSIK IPEL QA++DTLK+YKD F   RK+ TLVTD+LLLESGLL+YNPLVR +EASRPNSEL MV  FTGSVKRKS+GRAHA KTV  TEP T TV 
Subjt:  SIVSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVA

Query:  RSAAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRF
        R+ AQ  +GPSS VPTPVIELD +G  S EKR   ESEALDVSPL EVR +SPL+RRRKKKKT++SSE G RG LPTSHADLVDDPEARM GTS+V+MRF
Subjt:  RSAAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRF

Query:  RVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVE
         +EPSSSGVKD VS IS +CLDR LRRASKFV+D  SVLQRTID+  EAFIASIH AVM+KAELDGRE LAAKE+ENS  ALEAATT+KG LLKA+ EV+
Subjt:  RVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVE

Query:  ILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLK----------EKDASI----------------------PFRQHPDFDGFAKDFRDAG
        IL+AEV+AK  LLKKE E+ KAH RAAHAITKGLEKEKFQLLK          EKDASI                       FRQHPDFDGFAKDF DAG
Subjt:  ILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLK----------EKDASI----------------------PFRQHPDFDGFAKDFRDAG

Query:  FKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        FKFLMKGIAADM HLQID + +KK+Y+EKWASGPN TPDP+SLV+KYVRELDSDYSD+EE DAPSQE  EVGTT
Subjt:  FKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124672.7e-9146.19Show/hide
Query:  VSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVARS
        +SIKPIPEL QA++DTLK+YKD F  GRK+GTLVTD+LLLESGLL+YNPLVRP+EASRPNSEL MV  FT SVKRKS+GRAHA K VQS++P T  V ++
Subjt:  VSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVARS

Query:  AAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRFRV
        AAQ +AGPSS  PTPVIELDS GE SREKR  +ESEALDVSPL EVR                                                     
Subjt:  AAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRFRV

Query:  EPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVEIL
                                                                                                            
Subjt:  EPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVEIL

Query:  KAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFDGFAKDFRDAGFK
            EAKA+LLK+EDER KAH RAAHAITKGLEKEKFQLLKEKD  +                                 FRQHPDFDGFAKDF DAGFK
Subjt:  KAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFDGFAKDFRDAGFK

Query:  FLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        FLMKGIAAD+ HL++D  D+KKRYAEKWASGPN T  P SLV+KYVR+LDSDYSDL+E++ PSQE  EVGTT
Subjt:  FLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

A0A6J1D1N9 uncharacterized protein LOC1110161934.4e-9470.76Show/hide
Query:  MRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARS
        MRFR+E SSSGVKD VS IS +CLDRCLRRAS+FV+D  SVLQRTID+A EAFIASIHSAVM+KAELDGRE L AKE+EN S  LEAATT+KG LLKA+ 
Subjt:  MRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARS

Query:  EVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDFDGFAKDFR
        EV+IL+AEV+AK  LLKKE E+ KAH RAAHAITKGLEKEKFQLLKE          KDASI                       FRQHP+FDGFAKDF 
Subjt:  EVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDFDGFAKDFR

Query:  DAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        DAGFKFLMKGIAADM HLQID SD+KKRY+E WASGPN TP P+SLV+KYVRELDSDYSD+EE DAPSQE  +VGTT
Subjt:  DAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

A0A6J1D971 uncharacterized protein LOC1110185389.9e-7860.79Show/hide
Query:  GTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALE-AATTMKG
        G   +  + R+EPSSSGV+D VS IS + LDRCLRRASKFV+   SVLQRTID+A EAF+ASI SA+ +KAELDGRE+LAA+EKE  S ALE A++TMK 
Subjt:  GTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALE-AATTMKG

Query:  GLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFD
         LLKA SEVE LKAEVE++A+LLKKE++R++A  RAAHAIT+GLE+EKFQLLKEKD  +                                 FRQHPDFD
Subjt:  GLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASI--------------------------------PFRQHPDFD

Query:  GFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQ
        GFAKDF DAGFKFLMKGIA+DM  LQID S +K+RYAEKWASGP  TP P++LV++YVR+LDSDYSD EE+   S ++
Subjt:  GFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQ

A0A6J1DF31 uncharacterized protein LOC1110199091.0e-9871.23Show/hide
Query:  MGGTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMK
        MGGT DV+ RFR+EPSSSGVKD VS IS +CLDRCL+RASKFV+D  SVLQRTID+A EAF+ASIHSA+M+KAELDGRE LAAKE+ENSS ALEAATT+K
Subjt:  MGGTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMK

Query:  GGLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDF
        G LLKA+ EV IL+AEV+AKA+LLKKE E+ KAH RAAHAITKGLEKEKFQLLKE          KD SI                       FRQH DF
Subjt:  GGLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKE----------KDASI----------------------PFRQHPDF

Query:  DGFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        DGFAKDF DAGFKFLMKGIAADM HLQID S++KK+Y+EKWASGPN TP P+SLV KYVRELDSDYSD+EE DAPSQE NE+GTT
Subjt:  DGFAKDFRDAGFKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

A0A6J1DZB3 uncharacterized protein LOC1110256653.7e-17371.94Show/hide
Query:  SIVSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVA
        ++VSIK IPEL QA++DTLK+YKD F   RK+ TLVTD+LLLESGLL+YNPLVR +EASRPNSEL MV  FTGSVKRKS+GRAHA KTV  TEP T TV 
Subjt:  SIVSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDRLLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVA

Query:  RSAAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRF
        R+ AQ  +GPSS VPTPVIELD +G  S EKR   ESEALDVSPL EVR +SPL+RRRKKKKT++SSE G RG LPTSHADLVDDPEARM GTS+V+MRF
Subjt:  RSAAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVREDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRF

Query:  RVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVE
         +EPSSSGVKD VS IS +CLDR LRRASKFV+D  SVLQRTID+  EAFIASIH AVM+KAELDGRE LAAKE+ENS  ALEAATT+KG LLKA+ EV+
Subjt:  RVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVMMKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVE

Query:  ILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLK----------EKDASI----------------------PFRQHPDFDGFAKDFRDAG
        IL+AEV+AK  LLKKE E+ KAH RAAHAITKGLEKEKFQLLK          EKDASI                       FRQHPDFDGFAKDF DAG
Subjt:  ILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLK----------EKDASI----------------------PFRQHPDFDGFAKDFRDAG

Query:  FKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT
        FKFLMKGIAADM HLQID + +KK+Y+EKWASGPN TPDP+SLV+KYVRELDSDYSD+EE DAPSQE  EVGTT
Subjt:  FKFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAATGCATAAGCAATCTTCCTCAAATTCTTGAAGAAAAAGAGGGGTCGTCGTATCATGATCAACCTCCAAGTACGGTGGCAACCCGGGGTCATAATGCTCAGAA
CCTCGAGGCCCCAATTCGACCTGGCCCGAAGCGAGGTAAGGGGAAGGAAAATACACAACCTAGGAGGAGAAGATTGGATTGGCCTTGTGTCGAGCAAGACCTAGCGTCGC
TAGTCCGCGATCCTACTAAAGAAAAAGTACCCAGGGATGGAGTTGGAAGACCTGTCCTAGCACCTCCTCTGAACGTTGTTTTGTTGGTAGATGATATGGAGCAAGAAATA
CGAGCGTACGCAACCCCAGCATTCTACGACTTCAACCCAATGATTGCAGATCATAATATTGAAGCCAATAGATTTGAGCTTAAACCAATAAAGACTGTTTATGCAAAGAA
TTGCACAACGGGGTTTGCGAACCTAGCTCGAACCCGATCACCAACTCGACCCGAACCATGGAGTGAACCTGCACAAGAGGGCAAACTCTCCGACGATCAAATCAGTATAG
TTTCAATCAAACCAATTCCCGAGCTAACTCAAGCATCTTGGGATACTCTCAAGTATTACAAGGATCGCTTCTCAAGTGGCAGGAAGGTCGGAACCTTGGTAACTGACCGG
CTGTTACTGGAGTCCGGGTTGTTAAACTACAACCCCTTAGTGCGCCCAGTTGAAGCTTCAAGACCAAACTCTGAGCTCGTGATGGTGTTAGAATTCACAGGCAGCGTGAA
ACGTAAGTCCAGGGGTCGTGCTCACGCCTTTAAGACTGTTCAAAGCACGGAGCCAACAACTTCTACTGTTGCTCGATCTGCAGCTCAAGTCAAGGCTGGGCCGTCTTCTG
AAGTCCCAACTCCGGTGATCGAGTTGGACTCTGCTGGGGAGCACTCTAGAGAAAAGCGTCCAATGAATGAGTCCGAGGCACTGGACGTGTCACCTCTGTGCGAGGTGAGA
GAAGACTCTCCTCTGAAGAGGAGAAGGAAGAAGAAGAAAACCACCACCTCCTCTGAGGTTGGACCTCGTGGGCCCCTGCCCACGAGCCATGCTGACCTGGTGGACGACCC
CGAAGCTCGGATGGGGGGGACGTCCGACGTGAAGATGCGGTTCAGAGTGGAACCGTCAAGCTCCGGGGTAAAGGACCATGTGTCTAGCATTTCGGTTTCATGCTTGGACC
GCTGCCTTAGAAGGGCGTCCAAGTTCGTAAATGACCATAGGTCTGTACTGCAAAGGACCATTGACCACGCCGTTGAGGCGTTCATTGCTTCAATTCACTCGGCAGTTATG
ATGAAGGCCGAGCTGGATGGAAGGGAGATCTTGGCAGCGAAGGAGAAGGAGAATTCTTCTGTTGCCTTGGAAGCCGCCACCACAATGAAGGGCGGGCTACTGAAAGCTCG
CTCCGAAGTGGAGATTTTGAAGGCCGAGGTGGAGGCCAAGGCTCAGCTGCTGAAGAAAGAGGACGAGAGGCAGAAGGCCCACTTCCGAGCTGCCCATGCCATCACCAAGG
GGTTGGAGAAGGAGAAATTCCAGCTCCTGAAGGAGAAGGACGCTTCAATACCATTCAGGCAACACCCAGATTTTGATGGGTTCGCCAAAGATTTTCGTGATGCGGGCTTC
AAGTTCCTGATGAAGGGCATTGCTGCCGACATGACTCATCTCCAGATCGACCCCAGCGATATGAAAAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCTAATAGCACTCC
TGACCCCGAATCCTTGGTGGAGAAGTACGTCAGAGAGCTAGACTCTGACTACTCTGACCTGGAAGAGAACGATGCTCCTAGTCAGGAGCAGAACGAGGTCGGCACCACAT
AA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAATGCATAAGCAATCTTCCTCAAATTCTTGAAGAAAAAGAGGGGTCGTCGTATCATGATCAACCTCCAAGTACGGTGGCAACCCGGGGTCATAATGCTCAGAA
CCTCGAGGCCCCAATTCGACCTGGCCCGAAGCGAGGTAAGGGGAAGGAAAATACACAACCTAGGAGGAGAAGATTGGATTGGCCTTGTGTCGAGCAAGACCTAGCGTCGC
TAGTCCGCGATCCTACTAAAGAAAAAGTACCCAGGGATGGAGTTGGAAGACCTGTCCTAGCACCTCCTCTGAACGTTGTTTTGTTGGTAGATGATATGGAGCAAGAAATA
CGAGCGTACGCAACCCCAGCATTCTACGACTTCAACCCAATGATTGCAGATCATAATATTGAAGCCAATAGATTTGAGCTTAAACCAATAAAGACTGTTTATGCAAAGAA
TTGCACAACGGGGTTTGCGAACCTAGCTCGAACCCGATCACCAACTCGACCCGAACCATGGAGTGAACCTGCACAAGAGGGCAAACTCTCCGACGATCAAATCAGTATAG
TTTCAATCAAACCAATTCCCGAGCTAACTCAAGCATCTTGGGATACTCTCAAGTATTACAAGGATCGCTTCTCAAGTGGCAGGAAGGTCGGAACCTTGGTAACTGACCGG
CTGTTACTGGAGTCCGGGTTGTTAAACTACAACCCCTTAGTGCGCCCAGTTGAAGCTTCAAGACCAAACTCTGAGCTCGTGATGGTGTTAGAATTCACAGGCAGCGTGAA
ACGTAAGTCCAGGGGTCGTGCTCACGCCTTTAAGACTGTTCAAAGCACGGAGCCAACAACTTCTACTGTTGCTCGATCTGCAGCTCAAGTCAAGGCTGGGCCGTCTTCTG
AAGTCCCAACTCCGGTGATCGAGTTGGACTCTGCTGGGGAGCACTCTAGAGAAAAGCGTCCAATGAATGAGTCCGAGGCACTGGACGTGTCACCTCTGTGCGAGGTGAGA
GAAGACTCTCCTCTGAAGAGGAGAAGGAAGAAGAAGAAAACCACCACCTCCTCTGAGGTTGGACCTCGTGGGCCCCTGCCCACGAGCCATGCTGACCTGGTGGACGACCC
CGAAGCTCGGATGGGGGGGACGTCCGACGTGAAGATGCGGTTCAGAGTGGAACCGTCAAGCTCCGGGGTAAAGGACCATGTGTCTAGCATTTCGGTTTCATGCTTGGACC
GCTGCCTTAGAAGGGCGTCCAAGTTCGTAAATGACCATAGGTCTGTACTGCAAAGGACCATTGACCACGCCGTTGAGGCGTTCATTGCTTCAATTCACTCGGCAGTTATG
ATGAAGGCCGAGCTGGATGGAAGGGAGATCTTGGCAGCGAAGGAGAAGGAGAATTCTTCTGTTGCCTTGGAAGCCGCCACCACAATGAAGGGCGGGCTACTGAAAGCTCG
CTCCGAAGTGGAGATTTTGAAGGCCGAGGTGGAGGCCAAGGCTCAGCTGCTGAAGAAAGAGGACGAGAGGCAGAAGGCCCACTTCCGAGCTGCCCATGCCATCACCAAGG
GGTTGGAGAAGGAGAAATTCCAGCTCCTGAAGGAGAAGGACGCTTCAATACCATTCAGGCAACACCCAGATTTTGATGGGTTCGCCAAAGATTTTCGTGATGCGGGCTTC
AAGTTCCTGATGAAGGGCATTGCTGCCGACATGACTCATCTCCAGATCGACCCCAGCGATATGAAAAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCTAATAGCACTCC
TGACCCCGAATCCTTGGTGGAGAAGTACGTCAGAGAGCTAGACTCTGACTACTCTGACCTGGAAGAGAACGATGCTCCTAGTCAGGAGCAGAACGAGGTCGGCACCACAT
AA
Protein sequenceShow/hide protein sequence
MSKCISNLPQILEEKEGSSYHDQPPSTVATRGHNAQNLEAPIRPGPKRGKGKENTQPRRRRLDWPCVEQDLASLVRDPTKEKVPRDGVGRPVLAPPLNVVLLVDDMEQEI
RAYATPAFYDFNPMIADHNIEANRFELKPIKTVYAKNCTTGFANLARTRSPTRPEPWSEPAQEGKLSDDQISIVSIKPIPELTQASWDTLKYYKDRFSSGRKVGTLVTDR
LLLESGLLNYNPLVRPVEASRPNSELVMVLEFTGSVKRKSRGRAHAFKTVQSTEPTTSTVARSAAQVKAGPSSEVPTPVIELDSAGEHSREKRPMNESEALDVSPLCEVR
EDSPLKRRRKKKKTTTSSEVGPRGPLPTSHADLVDDPEARMGGTSDVKMRFRVEPSSSGVKDHVSSISVSCLDRCLRRASKFVNDHRSVLQRTIDHAVEAFIASIHSAVM
MKAELDGREILAAKEKENSSVALEAATTMKGGLLKARSEVEILKAEVEAKAQLLKKEDERQKAHFRAAHAITKGLEKEKFQLLKEKDASIPFRQHPDFDGFAKDFRDAGF
KFLMKGIAADMTHLQIDPSDMKKRYAEKWASGPNSTPDPESLVEKYVRELDSDYSDLEENDAPSQEQNEVGTT