; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNAse I-like superfamily protein
Genome locationchr7:7058117..7064184
RNA-Seq ExpressionMoc07g09200
SyntenyMoc07g09200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.2e-10585.42Show/hide
Query:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------
        MCARKG CGIVKGPTSIKGWVRKWFYASGEWLAKDESV+IRPV ELTQ SFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAV            
Subjt:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------

Query:  PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGAC
         MVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV  PASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVP KRRRKKKK TSPLEVGA 
Subjt:  PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGAC

Query:  RVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ
         VLPASFADRVDDPEARMGGT +VT RFRVEPSSSGVRDQ
Subjt:  RVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.5e-11181.95Show/hide
Query:  LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAP QVAPNGWGVIFALAILFWLRA+DSEEAELLDVDQLLACFEAKRI KKPGRFYMCARKG  GIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRKSK
        EWLAKDES              VSIRPV ELTQ SFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAV             MVC FAS VKRKSK
Subjt:  EWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
        GRAHALEAAQSSKP TPAV  PASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  GRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.5e-13475.49Show/hide
Query:  MSSSFSSNLGPIEDLTRRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------
        MSSS SSNL    DL RRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG                                    
Subjt:  MSSSFSSNLGPIEDLTRRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------

Query:  --LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYA
          LPLHPFVQEFLFRTGLAP QVAPNGWGVIFALAILFWLRA+DSEEAEL DVDQLLACFEAKRI KKPGRFYMCARKG  GIVKGPTSIKGWVRKWFYA
Subjt:  --LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRK
        SGEWLAKDES              VSIRPV ELTQ SFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAV             MVCGFAS VKRK
Subjt:  SGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRK

Query:  SKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV  PASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]2.4e-9687.55Show/hide
Query:  MVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGACR
        MVCGFASSVKRKSKGRAHA EAAQSSKPATPAVA PASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVP KRRRKKKK  SPLEVGAC 
Subjt:  MVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGACR

Query:  VLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTS+VTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELL
        EKEEFS ALEA     KD+      E+E   AE+ET  E L
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.0e-10763.27Show/hide
Query:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKGT GIVKGPTSIKGWV KWF+ASGEWLAKDES              VSI+ + EL Q +FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AV------------PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         V             MVCGF  SVKRKSKGRAHAL+    ++P TP V        + P+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AV------------PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPQKRRRKKKKMTSPLEVGACRVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E P +RRRKKKK +S  E GA   LP S AD VDDPEARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPQKRRRKKKKMTSPLEVGACRVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDERKA
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDERKA

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.1e-10585.42Show/hide
Query:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------
        MCARKG CGIVKGPTSIKGWVRKWFYASGEWLAKDESV+IRPV ELTQ SFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAV            
Subjt:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------

Query:  PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGAC
         MVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV  PASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVP KRRRKKKK TSPLEVGA 
Subjt:  PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGAC

Query:  RVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ
         VLPASFADRVDDPEARMGGT +VT RFRVEPSSSGVRDQ
Subjt:  RVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.2e-11181.95Show/hide
Query:  LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAP QVAPNGWGVIFALAILFWLRA+DSEEAELLDVDQLLACFEAKRI KKPGRFYMCARKG  GIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRKSK
        EWLAKDES              VSIRPV ELTQ SFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAV             MVC FAS VKRKSK
Subjt:  EWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
        GRAHALEAAQSSKP TPAV  PASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  GRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1DXS5 uncharacterized protein LOC1110255024.1e-13475.49Show/hide
Query:  MSSSFSSNLGPIEDLTRRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------
        MSSS SSNL    DL RRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG                                    
Subjt:  MSSSFSSNLGPIEDLTRRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------

Query:  --LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYA
          LPLHPFVQEFLFRTGLAP QVAPNGWGVIFALAILFWLRA+DSEEAEL DVDQLLACFEAKRI KKPGRFYMCARKG  GIVKGPTSIKGWVRKWFYA
Subjt:  --LPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRK
        SGEWLAKDES              VSIRPV ELTQ SFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAV             MVCGFAS VKRK
Subjt:  SGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAV------------PMVCGFASSVKRK

Query:  SKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV  PASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256061.2e-9687.55Show/hide
Query:  MVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGACR
        MVCGFASSVKRKSKGRAHA EAAQSSKPATPAVA PASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVP KRRRKKKK  SPLEVGAC 
Subjt:  MVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGACR

Query:  VLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTS+VTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELL
        EKEEFS ALEA     KD+      E+E   AE+ET  E L
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELL

A0A6J1DZB3 uncharacterized protein LOC1110256651.9e-10763.27Show/hide
Query:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKGT GIVKGPTSIKGWV KWF+ASGEWLAKDES              VSI+ + EL Q +FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGTCGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AV------------PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         V             MVCGF  SVKRKSKGRAHAL+    ++P TP V        + P+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AV------------PMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AWPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPQKRRRKKKKMTSPLEVGACRVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E P +RRRKKKK +S  E GA   LP S AD VDDPEARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPQKRRRKKKKMTSPLEVGACRVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDERKA
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDERKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein3.5e-0832.99Show/hide
Query:  QERRRKTWDLIERFHSVS---NLPWILGGNFNEILSDSEKLGRAPRHQSI--MQDFRDIVDSCGLVDPGFTGDVFTWCDGHTMRQPIWERLDRFLIN
        +  RR  WD I R  + S   N PW++ G+FN+I S +E     P + S+  ++D +  +    LVD    G ++TW   H    PI  +LDR ++N
Subjt:  QERRRKTWDLIERFHSVS---NLPWILGGNFNEILSDSEKLGRAPRHQSI--MQDFRDIVDSCGLVDPGFTGDVFTWCDGHTMRQPIWERLDRFLIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTGTTATACATGAGGACAGTAGGATTTGGCGATTCACATGTATTTATGGGAAACCTATTCAGGAAAGAAGACGAAAGACTTGGGATCTTATCGAACGGTTTCA
CAGTGTCTCTAATCTTCCTTGGATTTTGGGGGGAAATTTTAATGAAATCCTTTCGGACTCGGAGAAATTGGGAAGAGCCCCGAGACATCAGAGTATTATGCAAGATTTCA
GGGACATTGTTGATTCATGTGGACTAGTTGATCCAGGTTTCACGGGGGATGTTTTCACTTGGTGTGATGGGCATACTATGCGTCAGCCTATCTGGGAGAGGTTGGACAGG
TTTCTAATCAATAGCGATATGATCCAATCTCAAGCGTTCTGGGAGGTTCGTCATTTAGAGTTCCTTGCTTCAGATCATAGACCGATTCTGGCAATTTGGCAAGAGTGGTG
GAAGGGTGATGGAAATAGCAGGAAGGGGCGAAGGGCGGGTTGTTTTGAGGAAAGATGGACGTCGTTTGGTGAGTGCAAGGTCAAGCTGGGTGGTTCCCTCAGCGGGGCTA
TTGCTAAGAAAGAGGCTAAGATTCATAATTTGTCTCTTTGTGGCGATGAGAATTGGAGAAGTAACTTGTGGAAGGCTGAAAGTGAGCTGGAAGCGCTACTAGAGGAAGAG
GAGAAGTATTGGCGTCAGAGAAGTAGAGAAGATTGGTTGAAGTGGGGAGATAGGAACACGAAGTGGTGCAAGGCCTATGGCGAATCGGCGACTCATATTTTCTGGGAGTG
TAAGATTGCAGCTCGAACTCGGCTTCCGGACCGATCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGTTGCCATGTCGTCCT
CTTTTAGCAGCAACTTAGGACCCATTGAGGACTTAACTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCC
TCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCG
GACTGGGTTGGCTCCGACTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTCTGGCTACGAGCTCAGGATAGCGAAGAGGCCGAGCTGTTGG
ACGTAGACCAGCTTCTCGCGTGCTTCGAAGCGAAAAGGATAACTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCACATGCGGTATAGTTAAGGGGCCGACC
TCCATCAAAGGATGGGTGAGGAAGTGGTTCTATGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGTTTCAATCCGACCAGTCCTCGAGCTTACGCAAGTCTCCTTCGA
CACGTTGAAATATTACAAGGAGCGCTTTCCGAAGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTACTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTC
CCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCGAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCATGGCCT
GCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGA
GGAGGTGAGGGAGGAAGTCCCTCAGAAGCGAAGGAGGAAGAAAAAGAAGATGACCTCCCCCTTGGAGGTCGGAGCTTGTAGGGTCTTGCCTGCGAGCTTTGCAGATCGGG
TGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCAATGTGACAGCACGGTTTAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAAGTGTCCCGCATCTCAGCTGCA
AGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATC
GGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGC
TGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACGAGCGCAAGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTGTTATACATGAGGACAGTAGGATTTGGCGATTCACATGTATTTATGGGAAACCTATTCAGGAAAGAAGACGAAAGACTTGGGATCTTATCGAACGGTTTCA
CAGTGTCTCTAATCTTCCTTGGATTTTGGGGGGAAATTTTAATGAAATCCTTTCGGACTCGGAGAAATTGGGAAGAGCCCCGAGACATCAGAGTATTATGCAAGATTTCA
GGGACATTGTTGATTCATGTGGACTAGTTGATCCAGGTTTCACGGGGGATGTTTTCACTTGGTGTGATGGGCATACTATGCGTCAGCCTATCTGGGAGAGGTTGGACAGG
TTTCTAATCAATAGCGATATGATCCAATCTCAAGCGTTCTGGGAGGTTCGTCATTTAGAGTTCCTTGCTTCAGATCATAGACCGATTCTGGCAATTTGGCAAGAGTGGTG
GAAGGGTGATGGAAATAGCAGGAAGGGGCGAAGGGCGGGTTGTTTTGAGGAAAGATGGACGTCGTTTGGTGAGTGCAAGGTCAAGCTGGGTGGTTCCCTCAGCGGGGCTA
TTGCTAAGAAAGAGGCTAAGATTCATAATTTGTCTCTTTGTGGCGATGAGAATTGGAGAAGTAACTTGTGGAAGGCTGAAAGTGAGCTGGAAGCGCTACTAGAGGAAGAG
GAGAAGTATTGGCGTCAGAGAAGTAGAGAAGATTGGTTGAAGTGGGGAGATAGGAACACGAAGTGGTGCAAGGCCTATGGCGAATCGGCGACTCATATTTTCTGGGAGTG
TAAGATTGCAGCTCGAACTCGGCTTCCGGACCGATCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGTTGCCATGTCGTCCT
CTTTTAGCAGCAACTTAGGACCCATTGAGGACTTAACTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCC
TCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCG
GACTGGGTTGGCTCCGACTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTCTGGCTACGAGCTCAGGATAGCGAAGAGGCCGAGCTGTTGG
ACGTAGACCAGCTTCTCGCGTGCTTCGAAGCGAAAAGGATAACTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCACATGCGGTATAGTTAAGGGGCCGACC
TCCATCAAAGGATGGGTGAGGAAGTGGTTCTATGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGTTTCAATCCGACCAGTCCTCGAGCTTACGCAAGTCTCCTTCGA
CACGTTGAAATATTACAAGGAGCGCTTTCCGAAGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTACTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTC
CCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCGAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCATGGCCT
GCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGA
GGAGGTGAGGGAGGAAGTCCCTCAGAAGCGAAGGAGGAAGAAAAAGAAGATGACCTCCCCCTTGGAGGTCGGAGCTTGTAGGGTCTTGCCTGCGAGCTTTGCAGATCGGG
TGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCAATGTGACAGCACGGTTTAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAAGTGTCCCGCATCTCAGCTGCA
AGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATC
GGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGC
TGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACGAGCGCAAGGCCTAA
Protein sequenceShow/hide protein sequence
MDAVIHEDSRIWRFTCIYGKPIQERRRKTWDLIERFHSVSNLPWILGGNFNEILSDSEKLGRAPRHQSIMQDFRDIVDSCGLVDPGFTGDVFTWCDGHTMRQPIWERLDR
FLINSDMIQSQAFWEVRHLEFLASDHRPILAIWQEWWKGDGNSRKGRRAGCFEERWTSFGECKVKLGGSLSGAIAKKEAKIHNLSLCGDENWRSNLWKAESELEALLEEE
EKYWRQRSREDWLKWGDRNTKWCKAYGESATHIFWECKIAARTRLPDRSEHLGGPAQKGEHSDDQVSIVAMSSSFSSNLGPIEDLTRRLESELEEIENFRFSDDGEDSDA
STSGQGLEYPSRIPEHYLGSLRRGLPLHPFVQEFLFRTGLAPTQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDQLLACFEAKRITKKPGRFYMCARKGTCGIVKGPT
SIKGWVRKWFYASGEWLAKDESVSIRPVLELTQVSFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVPMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAWP
ASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKMTSPLEVGACRVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAA
SLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDERKA