; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g28120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g28120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNase H domain-containing protein
Genome locationchr9:21131844..21141150
RNA-Seq ExpressionMoc09g28120
SyntenyMoc09g28120
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]4.6e-2359.46Show/hide
Query:  LKDHRGQVLASGTKYLEFVSSVDEAEALAAAKGINLAMETGIYPIQLETDSSQIFHLVWQEVEDLSETGEIVARAKGMLPNGDQRRFSFTRRGGNGAAHN
        ++DHRGQVLAS TKYLE V+SVD+AEALAA +G+ +AMETGI PI LETDS +I++L  ++ E LS+TG I+   K  L    Q  +SFT+R GN  AH 
Subjt:  LKDHRGQVLASGTKYLEFVSSVDEAEALAAAKGINLAMETGIYPIQLETDSSQIFHLVWQEVEDLSETGEIVARAKGMLPNGDQRRFSFTRRGGNGAAHN

Query:  LARRALLQQEN
        LARRAL  QEN
Subjt:  LARRALLQQEN

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.3e-2534.21Show/hide
Query:  LPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPLLIGARVIPPPHPLN
        +P   +   + LK   D + +G+K+GTL+TDKLL  S LLDY  L   PIEA RPNSELAMVCGF + VKRK         RAH L I            
Subjt:  LPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPLLIGARVIPPPHPLN

Query:  ALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLAPKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAAAAEGELKE
                                    S+ V   V  +   D A  S+       A  T V          +ELD     +++ R  S     E E  +
Subjt:  ALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLAPKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAAAAEGELKE

Query:  VKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRN----ITLANKFVA---FKAEIEKQKVKLLSGKFLETTFQAHPNFDG
        V    EV + + E+LK + E+ +         L+  HA+ KGL+ EKF+LL+        L  K  A     AE++ +K +L +G  LE  F+ HP+FDG
Subjt:  VKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRN----ITLANKFVA---FKAEIEKQKVKLLSGKFLETTFQAHPNFDG

Query:  FAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG
        FAKDFSDA  KFL + I    P+L  DLG LK    +  ASG
Subjt:  FAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.8e-2241.58Show/hide
Query:  PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRE---TSFAAAA--EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMV
        P S +Q+ ID A E  +   H+ VM+K ELD R+ LT K+RE   T+  AA   +GEL + + EV++L+ E +  KTDL K  KE   +   L+  HA+ 
Subjt:  PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRE---TSFAAAA--EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMV

Query:  KGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLA
        KGL+ EKF+LL+    LA               E++  K +L  G  LE +F+ HPNFDGFAKDFSDA  KFL + I    P+L  DL  LK    +  A
Subjt:  KGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLA

Query:  SG
        SG
Subjt:  SG

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]2.4e-2432.89Show/hide
Query:  LNALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLA---------------------------------------------------PK
        L   +V PL+ V  +    K+K+ +RKT  S+ V  EV + GI  LA                                                   P+
Subjt:  LNALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLA---------------------------------------------------PK

Query:  SAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRET-----SFAAAAEGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKG
        S I+++IDY V+ H V  HA ++MK +LDDRD +   +RE        A   E ELKE + E EVLK +   L+   +  + E  H   L K  + +VKG
Subjt:  SAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRET-----SFAAAAEGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKG

Query:  LKVEKFELLRRNITLA-------NKFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNLDLGSLKDHRGQVLASG
        L+ EKF+L+RRN  L        ++    K E+E  K KL +G  LE  FQAH +FD F  DFSD   KFL + I  +A +LDL  +K    +  ASG
Subjt:  LKVEKFELLRRNITLA-------NKFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNLDLGSLKDHRGQVLASG

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.0e-3533.26Show/hide
Query:  RYNILDSLGLCLPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPL--L
        R+  L S+ L +P   +   + LK   D + + +K+ TL+TDKLL  S LLDY  L  L IEA RPNSELAMVCGF   VKRK         RAH L  +
Subjt:  RYNILDSLGLCLPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPL--L

Query:  IGARVIPP-----------------PHPL-------------------NALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGV-------------------
        +G   + P                 P P+                    AL+V PL  V  E SP + + K++KT+ S                      
Subjt:  IGARVIPP-----------------PHPL-------------------NALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGV-------------------

Query:  ------------AIEVSLSGIIDLA---------------------PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAA-----AA
                     +E S SG+ D                       P S +Q+ ID   E  +   H  VM+K ELD R+ L  K+RE SFAA       
Subjt:  ------------AIEVSLSGIIDLA---------------------PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAA-----AA

Query:  EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQA
        +GEL + + EV++L+ E +  K DL K  KE   +   L+  HA+ KGL+ EKF+LL+    LA               E++  K +L +G  LE +F+ 
Subjt:  EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQA

Query:  HPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG
        HP+FDGFAKDFSDA  KFL + I    P+L  DL  LK    +  ASG
Subjt:  HPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG

TrEMBL top hitse value%identityAlignment
A0A6J1CDQ4 uncharacterized protein LOC1110105332.2e-2359.46Show/hide
Query:  LKDHRGQVLASGTKYLEFVSSVDEAEALAAAKGINLAMETGIYPIQLETDSSQIFHLVWQEVEDLSETGEIVARAKGMLPNGDQRRFSFTRRGGNGAAHN
        ++DHRGQVLAS TKYLE V+SVD+AEALAA +G+ +AMETGI PI LETDS +I++L  ++ E LS+TG I+   K  L    Q  +SFT+R GN  AH 
Subjt:  LKDHRGQVLASGTKYLEFVSSVDEAEALAAAKGINLAMETGIYPIQLETDSSQIFHLVWQEVEDLSETGEIVARAKGMLPNGDQRRFSFTRRGGNGAAHN

Query:  LARRALLQQEN
        LARRAL  QEN
Subjt:  LARRALLQQEN

A0A6J1CLV1 uncharacterized protein LOC1110124676.3e-2634.21Show/hide
Query:  LPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPLLIGARVIPPPHPLN
        +P   +   + LK   D + +G+K+GTL+TDKLL  S LLDY  L   PIEA RPNSELAMVCGF + VKRK         RAH L I            
Subjt:  LPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPLLIGARVIPPPHPLN

Query:  ALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLAPKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAAAAEGELKE
                                    S+ V   V  +   D A  S+       A  T V          +ELD     +++ R  S     E E  +
Subjt:  ALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLAPKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAAAAEGELKE

Query:  VKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRN----ITLANKFVA---FKAEIEKQKVKLLSGKFLETTFQAHPNFDG
        V    EV + + E+LK + E+ +         L+  HA+ KGL+ EKF+LL+        L  K  A     AE++ +K +L +G  LE  F+ HP+FDG
Subjt:  VKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRN----ITLANKFVA---FKAEIEKQKVKLLSGKFLETTFQAHPNFDG

Query:  FAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG
        FAKDFSDA  KFL + I    P+L  DLG LK    +  ASG
Subjt:  FAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG

A0A6J1D1N9 uncharacterized protein LOC1110161938.5e-2341.58Show/hide
Query:  PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRE---TSFAAAA--EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMV
        P S +Q+ ID A E  +   H+ VM+K ELD R+ LT K+RE   T+  AA   +GEL + + EV++L+ E +  KTDL K  KE   +   L+  HA+ 
Subjt:  PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRE---TSFAAAA--EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMV

Query:  KGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLA
        KGL+ EKF+LL+    LA               E++  K +L  G  LE +F+ HPNFDGFAKDFSDA  KFL + I    P+L  DL  LK    +  A
Subjt:  KGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLA

Query:  SG
        SG
Subjt:  SG

A0A6J1DBX9 uncharacterized protein LOC1110189131.2e-2432.89Show/hide
Query:  LNALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLA---------------------------------------------------PK
        L   +V PL+ V  +    K+K+ +RKT  S+ V  EV + GI  LA                                                   P+
Subjt:  LNALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLA---------------------------------------------------PK

Query:  SAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRET-----SFAAAAEGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKG
        S I+++IDY V+ H V  HA ++MK +LDDRD +   +RE        A   E ELKE + E EVLK +   L+   +  + E  H   L K  + +VKG
Subjt:  SAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRET-----SFAAAAEGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKG

Query:  LKVEKFELLRRNITLA-------NKFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNLDLGSLKDHRGQVLASG
        L+ EKF+L+RRN  L        ++    K E+E  K KL +G  LE  FQAH +FD F  DFSD   KFL + I  +A +LDL  +K    +  ASG
Subjt:  LKVEKFELLRRNITLA-------NKFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKDFSDASIKFLREEIRGLAPNLDLGSLKDHRGQVLASG

A0A6J1DZB3 uncharacterized protein LOC1110256652.0e-3533.26Show/hide
Query:  RYNILDSLGLCLPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPL--L
        R+  L S+ L +P   +   + LK   D + + +K+ TL+TDKLL  S LLDY  L  L IEA RPNSELAMVCGF   VKRK         RAH L  +
Subjt:  RYNILDSLGLCLPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPL--L

Query:  IGARVIPP-----------------PHPL-------------------NALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGV-------------------
        +G   + P                 P P+                    AL+V PL  V  E SP + + K++KT+ S                      
Subjt:  IGARVIPP-----------------PHPL-------------------NALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGV-------------------

Query:  ------------AIEVSLSGIIDLA---------------------PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAA-----AA
                     +E S SG+ D                       P S +Q+ ID   E  +   H  VM+K ELD R+ L  K+RE SFAA       
Subjt:  ------------AIEVSLSGIIDLA---------------------PKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAA-----AA

Query:  EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQA
        +GEL + + EV++L+ E +  K DL K  KE   +   L+  HA+ KGL+ EKF+LL+    LA               E++  K +L +G  LE +F+ 
Subjt:  EGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRNITLAN-------KFVAFKAEIEKQKVKLLSGKFLETTFQA

Query:  HPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG
        HP+FDGFAKDFSDA  KFL + I    P+L  DL  LK    +  ASG
Subjt:  HPNFDGFAKDFSDASIKFLREEIRGLAPNL--DLGSLKDHRGQVLASG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAGTCAATGCGAGCTCAAAGTCTTAGGTCGTTCTCAGAGAGAATTGCATACCTGTCGAATGGGAGAAGTGCTCCTTTTATAGGCAGGCGAGGGGATACA
AGACCGTTAGGTTGCTTCGACATGCGGTTACGACTGTCGGGCCCTTCAAGTCGGTGGTCGGATCTGAGGCCGGAGTCCACAGCAGTGGGGGTAGTTGCTCGGGTG
GTGGGTTCCCCGATGAGCCAGATTGGAATCAGGGTATCTCATCCTTTAGTGGTTCCTCCTCTAGTGAATAGGCTGAATAGAGAAGTGGAGGAGATGCCAGAGAAC
TACCCTGGCCCCCTCCGTAGGCGTTACAACATCCTAGACTCCTTAGGTCTTTGTCTTCCTGTGGAAGGTGAGACTGCCGAGAATCCACTCAAGGGGTGTGTCGAC
CGATACGAGAAAGGTCAGAAAGTGGGGACCTTGTTGACCGACAAGCTGCTGTATGCATCCAGACTTTTGGATTACTGTCGGCTATCAAACTTGCCAATTGAGGCT
CTTCGACCTAACTCCGAATTAGCAATGGTGTGTGGTTTTGCTAACCCCGTAAAGCGAAAGTGTCCTAACAGTGGCCAATATGCGCTCAGGGCACACCCGTTGCTG
ATAGGGGCTAGGGTGATCCCCCCTCCCCACCCCCTTAATGCACTGGAGGTTTTGCCTTTGCGAGCTGTTCTTGAAGAGTTTTCCCCTAAGAAGACCAAGAGTAAG
AGGAGAAAGACTACCCATTCTGAAGGAGTCGCGATTGAGGTGAGTCTCAGTGGAATCATTGACCTTGCTCCTAAGTCGGCCATCCAACAGATGATCGACTACGCG
GTCGAGACTCATGTTGTTGTCTACCATGCTGTCGTTATGATGAAGGTCGAGTTGGACGATCGCGATTGGCTAACTAAGAAGGATAGGGAGACATCTTTTGCTGCT
GCCGCTGAGGGGGAGTTGAAGGAAGTTAAGGCTGAGGTTGAGGTGCTAAAAGTCGAGTGCGAGGTGTTGAAAACGGACCTCGAAAAAGCTCAAAAGGAAACGGTT
CATAACCTGACCCTGCTAAAAAGAGGTCATGCAATGGTAAAAGGCCTCAAAGTGGAGAAGTTCGAGCTGCTAAGGCGTAACATTACTTTGGCCAACAAATTCGTT
GCCTTTAAGGCCGAGATCGAGAAGCAAAAGGTCAAACTTTTAAGTGGTAAGTTCTTGGAGACCACCTTCCAAGCTCACCCTAACTTCGATGGCTTCGCCAAAGAC
TTTAGTGATGCGAGCATCAAGTTTCTGAGGGAAGAGATCAGGGGCTTGGCTCCTAACCTCGACCTCGGCTCCCTAAAAGATCACAGAGGTCAAGTTTTAGCGTCA
GGAACAAAATACTTGGAATTTGTGTCGTCCGTTGATGAGGCCGAAGCCTTAGCTGCAGCAAAAGGGATCAATTTGGCCATGGAGACTGGCATCTACCCAATTCAA
TTGGAGACCGATTCGTCCCAGATTTTCCACCTTGTCTGGCAAGAAGTGGAAGATCTCTCAGAAACGGGAGAAATTGTTGCGCGGGCGAAAGGAATGTTACCTAAT
GGAGATCAGCGGCGCTTCTCCTTTACTAGAAGGGGCGGAAATGGAGCTGCTCACAATCTAGCGAGAAGAGCTTTGCTGCAGCAGGAAAATCGGACCACTCAAGAT
GGAAGTGAGGTCGCTCCATCTCAGGAATTGACAACACCTTTACCTCAGGAGGAAACCATTCCTGGCTCGCAAGAGGTTGATCTGCTGACCTCAAGATCGAGGTTG
CGAAGTAGCACTTCGTCATTCAAGGGCTGGGCGAAGCCCTCCACTCTTTCTATTGGCCTTCGGGTGGGCGAGAGAAGAGCTGAGGTGGCTGATGTCAAGATTGGA
ACAAAACTCCTTGAACTTTGTGTTGTCAAATTACTTACCATCGTCTGTCACTATGACCTTTGGGTCAGACACCTTAGAAAGGCAAAGAAATCAAATTCAAAATTG
TACAATTTCTCATCTTTAAGTATGTATTGCGCTGCTTCAAATCGGTCAGGATATCACCTCAAAGATGCGAAGGACTGCACTGCTCTAGCAATTTTTCTGGTAACT
TATGAAGACGGGAAATTAGGGTTCGTTAAACTCAGGGATCACAAATGGACGCTGATGGATGAAATTGAAACACATTTCTCCTCTATTGTGTGGTTCGGGCAACCA
GAAATATTGGTGGAGCAGGGCGACGAAACTTACGTCGTCGATCGGTTCGATCCGAAGGAAAAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAGTCAATGCGAGCTCAAAGTCTTAGGTCGTTCTCAGAGAGAATTGCATACCTGTCGAATGGGAGAAGTGCTCCTTTTATAGGCAGGCGAGGGGATACA
AGACCGTTAGGTTGCTTCGACATGCGGTTACGACTGTCGGGCCCTTCAAGTCGGTGGTCGGATCTGAGGCCGGAGTCCACAGCAGTGGGGGTAGTTGCTCGGGTG
GTGGGTTCCCCGATGAGCCAGATTGGAATCAGGGTATCTCATCCTTTAGTGGTTCCTCCTCTAGTGAATAGGCTGAATAGAGAAGTGGAGGAGATGCCAGAGAAC
TACCCTGGCCCCCTCCGTAGGCGTTACAACATCCTAGACTCCTTAGGTCTTTGTCTTCCTGTGGAAGGTGAGACTGCCGAGAATCCACTCAAGGGGTGTGTCGAC
CGATACGAGAAAGGTCAGAAAGTGGGGACCTTGTTGACCGACAAGCTGCTGTATGCATCCAGACTTTTGGATTACTGTCGGCTATCAAACTTGCCAATTGAGGCT
CTTCGACCTAACTCCGAATTAGCAATGGTGTGTGGTTTTGCTAACCCCGTAAAGCGAAAGTGTCCTAACAGTGGCCAATATGCGCTCAGGGCACACCCGTTGCTG
ATAGGGGCTAGGGTGATCCCCCCTCCCCACCCCCTTAATGCACTGGAGGTTTTGCCTTTGCGAGCTGTTCTTGAAGAGTTTTCCCCTAAGAAGACCAAGAGTAAG
AGGAGAAAGACTACCCATTCTGAAGGAGTCGCGATTGAGGTGAGTCTCAGTGGAATCATTGACCTTGCTCCTAAGTCGGCCATCCAACAGATGATCGACTACGCG
GTCGAGACTCATGTTGTTGTCTACCATGCTGTCGTTATGATGAAGGTCGAGTTGGACGATCGCGATTGGCTAACTAAGAAGGATAGGGAGACATCTTTTGCTGCT
GCCGCTGAGGGGGAGTTGAAGGAAGTTAAGGCTGAGGTTGAGGTGCTAAAAGTCGAGTGCGAGGTGTTGAAAACGGACCTCGAAAAAGCTCAAAAGGAAACGGTT
CATAACCTGACCCTGCTAAAAAGAGGTCATGCAATGGTAAAAGGCCTCAAAGTGGAGAAGTTCGAGCTGCTAAGGCGTAACATTACTTTGGCCAACAAATTCGTT
GCCTTTAAGGCCGAGATCGAGAAGCAAAAGGTCAAACTTTTAAGTGGTAAGTTCTTGGAGACCACCTTCCAAGCTCACCCTAACTTCGATGGCTTCGCCAAAGAC
TTTAGTGATGCGAGCATCAAGTTTCTGAGGGAAGAGATCAGGGGCTTGGCTCCTAACCTCGACCTCGGCTCCCTAAAAGATCACAGAGGTCAAGTTTTAGCGTCA
GGAACAAAATACTTGGAATTTGTGTCGTCCGTTGATGAGGCCGAAGCCTTAGCTGCAGCAAAAGGGATCAATTTGGCCATGGAGACTGGCATCTACCCAATTCAA
TTGGAGACCGATTCGTCCCAGATTTTCCACCTTGTCTGGCAAGAAGTGGAAGATCTCTCAGAAACGGGAGAAATTGTTGCGCGGGCGAAAGGAATGTTACCTAAT
GGAGATCAGCGGCGCTTCTCCTTTACTAGAAGGGGCGGAAATGGAGCTGCTCACAATCTAGCGAGAAGAGCTTTGCTGCAGCAGGAAAATCGGACCACTCAAGAT
GGAAGTGAGGTCGCTCCATCTCAGGAATTGACAACACCTTTACCTCAGGAGGAAACCATTCCTGGCTCGCAAGAGGTTGATCTGCTGACCTCAAGATCGAGGTTG
CGAAGTAGCACTTCGTCATTCAAGGGCTGGGCGAAGCCCTCCACTCTTTCTATTGGCCTTCGGGTGGGCGAGAGAAGAGCTGAGGTGGCTGATGTCAAGATTGGA
ACAAAACTCCTTGAACTTTGTGTTGTCAAATTACTTACCATCGTCTGTCACTATGACCTTTGGGTCAGACACCTTAGAAAGGCAAAGAAATCAAATTCAAAATTG
TACAATTTCTCATCTTTAAGTATGTATTGCGCTGCTTCAAATCGGTCAGGATATCACCTCAAAGATGCGAAGGACTGCACTGCTCTAGCAATTTTTCTGGTAACT
TATGAAGACGGGAAATTAGGGTTCGTTAAACTCAGGGATCACAAATGGACGCTGATGGATGAAATTGAAACACATTTCTCCTCTATTGTGTGGTTCGGGCAACCA
GAAATATTGGTGGAGCAGGGCGACGAAACTTACGTCGTCGATCGGTTCGATCCGAAGGAAAAAAATTGA
Protein sequenceShow/hide protein sequence
MPKSMRAQSLRSFSERIAYLSNGRSAPFIGRRGDTRPLGCFDMRLRLSGPSSRWSDLRPESTAVGVVARVVGSPMSQIGIRVSHPLVVPPLVNRLNREVEEMPEN
YPGPLRRRYNILDSLGLCLPVEGETAENPLKGCVDRYEKGQKVGTLLTDKLLYASRLLDYCRLSNLPIEALRPNSELAMVCGFANPVKRKCPNSGQYALRAHPLL
IGARVIPPPHPLNALEVLPLRAVLEEFSPKKTKSKRRKTTHSEGVAIEVSLSGIIDLAPKSAIQQMIDYAVETHVVVYHAVVMMKVELDDRDWLTKKDRETSFAA
AAEGELKEVKAEVEVLKVECEVLKTDLEKAQKETVHNLTLLKRGHAMVKGLKVEKFELLRRNITLANKFVAFKAEIEKQKVKLLSGKFLETTFQAHPNFDGFAKD
FSDASIKFLREEIRGLAPNLDLGSLKDHRGQVLASGTKYLEFVSSVDEAEALAAAKGINLAMETGIYPIQLETDSSQIFHLVWQEVEDLSETGEIVARAKGMLPN
GDQRRFSFTRRGGNGAAHNLARRALLQQENRTTQDGSEVAPSQELTTPLPQEETIPGSQEVDLLTSRSRLRSSTSSFKGWAKPSTLSIGLRVGERRAEVADVKIG
TKLLELCVVKLLTIVCHYDLWVRHLRKAKKSNSKLYNFSSLSMYCAASNRSGYHLKDAKDCTALAIFLVTYEDGKLGFVKLRDHKWTLMDEIETHFSSIVWFGQP
EILVEQGDETYVVDRFDPKEKN