; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g11330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g11330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr5:8823829..8838462
RNA-Seq ExpressionMoc05g11330
SyntenyMoc05g11330
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145129.1 uncharacterized protein LOC111014646 [Momordica charantia]7.4e-3443.72Show/hide
Query:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN
        M  E S R SD+NC  +R+LN+DDP + G E        + TS+ N     ER EG  E   +  P  ++ +F  LE+KVEGM QRM+Q+ ++ E++E +
Subjt:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN

Query:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL
          PLV DPRK K P  SE                           Q++++Q K   P  G    +D+   E + +D  +P D+PE SEK  +QKEKGFDL
Subjt:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL

Query:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQ
        EEL+DQADSPFTEEIM EKVPPKFKLPT+KQ
Subjt:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQ

XP_022150097.1 uncharacterized protein LOC111018357 [Momordica charantia]1.3e-4144.44Show/hide
Query:  MGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGPKLPGEQASSPSKEKETVTEPNQPVEGEVSGPIAPQ--------------------
        MGQL NEL+SR   TLPS TEEPKRE KEHCKAITT+S LAYE PK+PGEQASSPSKEKET TEP+QPVE EVS P+A Q                    
Subjt:  MGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGPKLPGEQASSPSKEKETVTEPNQPVEGEVSGPIAPQ--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----QVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITEGLEQELEAMEKER-ANTFLSPMEKFEFGDLSKDEF
             QVTFNVLDAMRL DE+EECS IQITN VSMEEFCDL+  GLEQELE  +KER A+TFLSPMEKFEFGDLS DEF
Subjt:  -----QVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITEGLEQELEAMEKER-ANTFLSPMEKFEFGDLSKDEF

XP_022157836.1 uncharacterized protein LOC111024449 [Momordica charantia]6.7e-3564.84Show/hide
Query:  KSMMLQMLNTICQFYDERIHDKDDTAMREFTIRNDTAIRDYTSRNDAAIRNLEAHMGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGP
        +SMM + + ++ + +  R    +DTAM+EFT RNDTAIRDYTSRNDAA+RNLEA MGQLA+EL++RP GTLPS TEEPK E +EHCK ITTRSGLAYE P
Subjt:  KSMMLQMLNTICQFYDERIHDKDDTAMREFTIRNDTAIRDYTSRNDAAIRNLEAHMGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGP

Query:  KLPGEQASSPSKEKETVTEPNQPVEGEV
        K+P E +S P+KEKET TEP++P+E E+
Subjt:  KLPGEQASSPSKEKETVTEPNQPVEGEV

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]3.2e-6145.25Show/hide
Query:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN
        M  E S R SD++C  +R+LN+ DP + G E        + TS+ N     ER EG  E   +  P   + +F  LE+K               E  EV 
Subjt:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN

Query:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL
          PLV DP+K K P  S+                           ++ ++Q KS  P  G +  +D  + E I +D  +P DRPE SEK  S KEKGFDL
Subjt:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL

Query:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAYREWMDIYGVTEAIRCRVFSFTLSGSAR----------------------------
        EEL+DQADSPFTEEIM EKVPPKFKLPT+KQFD +TDP+DHLDAYREWMDIYGV+EA+RCRVFS TL+GSAR                            
Subjt:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAYREWMDIYGVTEAIRCRVFSFTLSGSAR----------------------------

Query:  ---------------RATESLKDYVARFNEEKLQVERLSDVVSLLAFMSGIKDEHLSF
                       R TESL+DYVARFNEEKLQVE L+D VSLLAFMSG++DEHLSF
Subjt:  ---------------RATESLKDYVARFNEEKLQVERLSDVVSLLAFMSGIKDEHLSF

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]9.2e-5356.81Show/hide
Query:  QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDLEELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAY
        Q+I++Q KSL P  G++  +D  + E I ++  +P DRPE SEK  +QKEKGFDLEEL+ QADSPFTEEIM EKVPPKFKLPT+K FDG T+P+DHLDAY
Subjt:  QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDLEELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAY

Query:  REWMDIYGVTEAIRCRVFSFTLSGSAR-------------------------------------------RATESLKDYVARFNEEKLQVERLSDVVSLL
        REWMDIYGV++AIRCRVFS TL+GSAR                                           R  ESL DYVARFNEEKLQ+E L+D VSLL
Subjt:  REWMDIYGVTEAIRCRVFSFTLSGSAR-------------------------------------------RATESLKDYVARFNEEKLQVERLSDVVSLL

Query:  AFMSGIKDEHLSF
        AFMSG++DEHLSF
Subjt:  AFMSGIKDEHLSF

TrEMBL top hitse value%identityAlignment
A0A6J1D9S6 uncharacterized protein LOC1110183576.1e-4244.44Show/hide
Query:  MGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGPKLPGEQASSPSKEKETVTEPNQPVEGEVSGPIAPQ--------------------
        MGQL NEL+SR   TLPS TEEPKRE KEHCKAITT+S LAYE PK+PGEQASSPSKEKET TEP+QPVE EVS P+A Q                    
Subjt:  MGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGPKLPGEQASSPSKEKETVTEPNQPVEGEVSGPIAPQ--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----QVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITEGLEQELEAMEKER-ANTFLSPMEKFEFGDLSKDEF
             QVTFNVLDAMRL DE+EECS IQITN VSMEEFCDL+  GLEQELE  +KER A+TFLSPMEKFEFGDLS DEF
Subjt:  -----QVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITEGLEQELEAMEKER-ANTFLSPMEKFEFGDLSKDEF

A0A6J1DWY0 uncharacterized protein LOC1110252931.5e-6145.25Show/hide
Query:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN
        M  E S R SD++C  +R+LN+ DP + G E        + TS+ N     ER EG  E   +  P   + +F  LE+K               E  EV 
Subjt:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN

Query:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL
          PLV DP+K K P  S+                           ++ ++Q KS  P  G +  +D  + E I +D  +P DRPE SEK  S KEKGFDL
Subjt:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL

Query:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAYREWMDIYGVTEAIRCRVFSFTLSGSAR----------------------------
        EEL+DQADSPFTEEIM EKVPPKFKLPT+KQFD +TDP+DHLDAYREWMDIYGV+EA+RCRVFS TL+GSAR                            
Subjt:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAYREWMDIYGVTEAIRCRVFSFTLSGSAR----------------------------

Query:  ---------------RATESLKDYVARFNEEKLQVERLSDVVSLLAFMSGIKDEHLSF
                       R TESL+DYVARFNEEKLQVE L+D VSLLAFMSG++DEHLSF
Subjt:  ---------------RATESLKDYVARFNEEKLQVERLSDVVSLLAFMSGIKDEHLSF

A0A6J1DZC3 uncharacterized protein LOC1110244493.2e-3564.84Show/hide
Query:  KSMMLQMLNTICQFYDERIHDKDDTAMREFTIRNDTAIRDYTSRNDAAIRNLEAHMGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGP
        +SMM + + ++ + +  R    +DTAM+EFT RNDTAIRDYTSRNDAA+RNLEA MGQLA+EL++RP GTLPS TEEPK E +EHCK ITTRSGLAYE P
Subjt:  KSMMLQMLNTICQFYDERIHDKDDTAMREFTIRNDTAIRDYTSRNDAAIRNLEAHMGQLANELRSRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGP

Query:  KLPGEQASSPSKEKETVTEPNQPVEGEV
        K+P E +S P+KEKET TEP++P+E E+
Subjt:  KLPGEQASSPSKEKETVTEPNQPVEGEV

A0A6J1DZC3 uncharacterized protein LOC1110244497.7e-0536.84Show/hide
Query:  EGEVSGPIAPQQVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITEGLEQELEAMEKERANTFLSPMEKFEFGDLSKDEFKAMQPSIIEPP
        +GE++  +  Q+VTFN+LDAM+  D++EEC+ I I   ++  E  DL+   +E +LE  EKE   T ++         L K++ K++ P  IEPP
Subjt:  EGEVSGPIAPQQVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITEGLEQELEAMEKERANTFLSPMEKFEFGDLSKDEFKAMQPSIIEPP

A0A6J1DZC3 uncharacterized protein LOC1110244493.6e-3443.72Show/hide
Query:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN
        M  E S R SD+NC  +R+LN+DDP + G E        + TS+ N     ER EG  E   +  P  ++ +F  LE+KVEGM QRM+Q+ ++ E++E +
Subjt:  MNLEESLRCSDENCPTRRKLNMDDPLLEGVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVN

Query:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL
          PLV DPRK K P  SE                           Q++++Q K   P  G    +D+   E + +D  +P D+PE SEK  +QKEKGFDL
Subjt:  LPPLVTDPRKDKAPELSE---------------------------QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDL

Query:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQ
        EEL+DQADSPFTEEIM EKVPPKFKLPT+KQ
Subjt:  EELIDQADSPFTEEIMGEKVPPKFKLPTIKQ

A0A6J1E1E7 uncharacterized protein LOC1110255484.5e-5356.81Show/hide
Query:  QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDLEELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAY
        Q+I++Q KSL P  G++  +D  + E I ++  +P DRPE SEK  +QKEKGFDLEEL+ QADSPFTEEIM EKVPPKFKLPT+K FDG T+P+DHLDAY
Subjt:  QQIRRQSKSLIPTRGNERQNDQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDLEELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAY

Query:  REWMDIYGVTEAIRCRVFSFTLSGSAR-------------------------------------------RATESLKDYVARFNEEKLQVERLSDVVSLL
        REWMDIYGV++AIRCRVFS TL+GSAR                                           R  ESL DYVARFNEEKLQ+E L+D VSLL
Subjt:  REWMDIYGVTEAIRCRVFSFTLSGSAR-------------------------------------------RATESLKDYVARFNEEKLQVERLSDVVSLL

Query:  AFMSGIKDEHLSF
        AFMSG++DEHLSF
Subjt:  AFMSGIKDEHLSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAGAAGTGTCAAATCCAATCCTATTGGCCGATCGAAGAGATATTGAAATGCACAACTACGCGACCACTACTTTGCAAGGCTTAAATTCGGGCATTACAAATCC
AATTTCAGATGATGCCCAATTTGAATTCAAGTCGATGATGCTCCAAATGCTTAACACTATTTGCCAGTTTTATGATGAAAGAATTCACGACAAGGATGACACAGCTATGC
GAGAATTCACGATAAGAAATGACACAGCTATACGAGATTACACGTCTAGAAATGACGCAGCTATACGAAATTTGGAAGCTCATATGGGGCAGCTTGCCAATGAACTTAGG
AGTCGACCTCTAGGGACATTGCCAAGTACCACAGAGGAACCAAAGAGAGAGGATAAGGAGCATTGCAAAGCCATTACAACCCGGAGCGGGTTAGCTTATGAAGGACCAAA
ACTGCCGGGTGAACAAGCATCCAGTCCTTCAAAGGAAAAAGAGACTGTAACAGAACCTAACCAACCAGTAGAAGGTGAGGTAAGTGGGCCAATTGCCCCGCAGCAAGTGA
CATTCAACGTACTGGATGCTATGCGGCTTTCAGATGAAGTAGAAGAGTGCTCAAATATTCAGATAACTAACCCTGTGTCTATGGAAGAATTTTGTGATTTGATAACTGAA
GGTTTAGAGCAAGAATTAGAAGCAATGGAAAAAGAAAGAGCAAACACTTTCTTGTCACCGATGGAGAAGTTTGAGTTTGGAGATTTGAGCAAAGATGAATTTAAGGCGAT
GCAACCATCAATCATTGAGCCGCCTCGACTAGAACAAAAACCCTTGCCTACTCATTTAAAATATCCATACTTAGGAGAAAATGAAACTTTACATGTAATCATCTCTGCAA
CGTTAACTAACGAACATGAAATTTTGTTGTTACAGACCTCCGCTTCTTTGCCATCGTCTTGTCCAGTAGGTCGTCGGCGTTTTCAACTGTCCGATGAAGAAAGCCCATGG
CCGCCTGTCGAAATCCCTCACCGCAAGCGTACCTCTACTGACCACCGTGGCCGTCGGCAGCTCACTTCTAGTCTAGGGGTATGGGAGTGCTTCGAAATCCATGGTAGCCT
GAGTCCAAACAAAAAAAAATTAAGGAGGGATTCAAACCGGAAGATACTGCTATTGAGTTCCGACGAAGAGGATGAGAACGATTCGAACCATGATTCTCCTATGCGACTAC
CACAGTGGACCCGAACATTTGCTCGAACTAAATTAGCAGCGGGACACGCCAATAAGGTAAAATTTATGCACTTACCTACTGACTTAGTTGAAGATATGTTTCAGGGCGGC
AATATTGTTGTTGATAATGAGATTGGAACAAAGCGAGACAATGTGTTGGCCACAACAAATGCCTATGTTGGTACAAAGAATGCGGCTGGCACATCAGATGGTCAATTTCG
AACATCAGAGACCATTGTTCCAGGTATTTGTAGGCATTCCACATCTCAATTGGTTTATTACAAGAGGCGGTTAGTGTGCATAAGTGAGACAGAAAAGGACAACAGCATTG
AGACCGACCCTAGCATGGGAGAAAGATTGGGGAGTGTTTCGGTAGATTCTCCAAGCCAAGGACTAGACACTGACAAGGTTGAAGCAGGATGCCTTAATGGTGATGATAGT
GATGTGCAATTAGTGGAGGAGTCGGGGGGAACAATGAAATCTACAAGGATAAGTTCAACATCACCAGAGATGGAGGACAATCCCATGATCGAACAATCCAGCCCATCATC
TATATCAAGACCACCCAGCCGCCATAGAGTTATTAACCTTAGCAAAGCCGCTTACGAGAGGAATATGGCTAAGTTTAAACAACGCAAGATATTGCAGGAACGTGGTTTTC
AAACCGATGACCCTGCATTACCTCCATTCATTAAGGAGTTGATCAATCAACGAGTTAACTATGGTCAGCTAATTCGAGATTGTATACTTGATCGTGCTACCCGGTCGATC
GGCAAGTTGTTCTTTCCAAGTTTAATAACAGAATTGTGCCTCAAGCAAGGTGTCCATATAGATCTTGAAGAGTTCGAGAAGCCGACTGGGGTAGAACTTTTAACTACTCA
TGTAAAGAGATTAACTGCTCATTTTGCACAATACAACCCAACTGTCTCGGAGTTTCCAATCAACCTGGGTGGTGCTTCGACTTCTCAGCCACCTCAGCATGGTGATGATT
CTGATGAGAAGGATGATGTTGATTTTCTTGTTGGCCACATGTTTAAATCACATTGGTTTGAATTTAATTCTGAAAAGGTTAGAGTCAAACGGATCTCATCAAGGTGGTTA
AGCAAGCAAATAATAAATGATGTGAAGGAACAACAAGATATTGAGATTGAACGCCATCCAAGAAAGGAAGATGTTGACAGTCAACCAATTGCTAGAATTAGAGATCAATA
CCTTTCAACGGTTTATATTTGTCTTGATGGATACCAACGAGTTTTCATTGCTGGCTATAGCATCAATGTCATTCTCTTTAAACAACCAAATTGTCTGATATCTTCTAATA
ATTCCCTCTACGGGATTTGTTACGGGATTGGATCCAGCCAAGAAAATGGGCCTCTCCGGACCGGAGAGACCCGAACCAACGACAGACTGAGCAAGTCAGGGCTCGGACCC
AACCCCAGCTCGGCCCCATTGGCCAAGCCCGCTTTCACTCCTCTCCCGTTGGGTGCGGTGCCTCGGTCCATGAATCCTTGTGATCCAGTAACGACAGATAGATGGGCTCA
ACGAGAAGGTTCCAGAAGGCAAAACTCCTATAGATGGAACCACTGGCTGAACCCTAGAGAACCAAAGAGGCCAAATTTCCAGCATCAACAATTAGCGTCGTCTGTGGGGA
CGTTACAAACCATACGACCAAGCAACATGAATCTGGAGGAAAGTCTGCGATGCTCTGATGAGAATTGCCCTACGAGGAGGAAGCTGAACATGGATGACCCTCTGCTAGAA
GGAGTGGAAGGCGAGACGAGCCTGCAGATTGTCGAACAAACTTCTAGAGAGAATGATCGACAAGTTCCAGAACGTATAGAGGGGTCGGTCGAGATCTCTACGGTTGAAGT
TCCTGGACAGATCGAGGGCAAGTTCACGGCTTTGGAAGAGAAGGTTGAGGGAATGTATCAACGCATGTCCCAACTATTCCAGCGATTGGAACAGCGAGAAGTGAACCTGC
CGCCCCTCGTCACGGATCCTAGGAAGGATAAAGCCCCAGAGTTGAGCGAGCAGCAAATCAGAAGGCAATCGAAGTCCCTGATACCCACAAGAGGCAATGAACGCCAAAAT
GACCAAGGAGACCTCGAACTTATCGACATCGACAACGAAAGACCGTGGGATCGACCCGAGCCATCGGAGAAGCTGTTAAGCCAGAAGGAGAAGGGATTCGACCTCGAAGA
ATTGATAGACCAGGCCGACTCTCCATTCACGGAGGAGATAATGGGAGAAAAGGTCCCTCCAAAATTTAAGCTCCCGACCATAAAGCAGTTCGATGGGTCGACTGACCCGA
TAGACCACCTCGACGCCTACCGAGAATGGATGGACATCTACGGGGTAACCGAGGCCATTAGATGCCGAGTGTTTTCCTTTACCCTAAGTGGCTCGGCCAGGAGAGCAACA
GAAAGCCTTAAAGACTACGTGGCTCGATTTAATGAGGAAAAGCTACAAGTTGAAAGATTAAGTGACGTGGTGTCGCTTCTAGCTTTCATGTCTGGCATAAAAGATGAGCA
TCTATCGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAAGAAGTGTCAAATCCAATCCTATTGGCCGATCGAAGAGATATTGAAATGCACAACTACGCGACCACTACTTTGCAAGGCTTAAATTCGGGCATTACAAATCC
AATTTCAGATGATGCCCAATTTGAATTCAAGTCGATGATGCTCCAAATGCTTAACACTATTTGCCAGTTTTATGATGAAAGAATTCACGACAAGGATGACACAGCTATGC
GAGAATTCACGATAAGAAATGACACAGCTATACGAGATTACACGTCTAGAAATGACGCAGCTATACGAAATTTGGAAGCTCATATGGGGCAGCTTGCCAATGAACTTAGG
AGTCGACCTCTAGGGACATTGCCAAGTACCACAGAGGAACCAAAGAGAGAGGATAAGGAGCATTGCAAAGCCATTACAACCCGGAGCGGGTTAGCTTATGAAGGACCAAA
ACTGCCGGGTGAACAAGCATCCAGTCCTTCAAAGGAAAAAGAGACTGTAACAGAACCTAACCAACCAGTAGAAGGTGAGGTAAGTGGGCCAATTGCCCCGCAGCAAGTGA
CATTCAACGTACTGGATGCTATGCGGCTTTCAGATGAAGTAGAAGAGTGCTCAAATATTCAGATAACTAACCCTGTGTCTATGGAAGAATTTTGTGATTTGATAACTGAA
GGTTTAGAGCAAGAATTAGAAGCAATGGAAAAAGAAAGAGCAAACACTTTCTTGTCACCGATGGAGAAGTTTGAGTTTGGAGATTTGAGCAAAGATGAATTTAAGGCGAT
GCAACCATCAATCATTGAGCCGCCTCGACTAGAACAAAAACCCTTGCCTACTCATTTAAAATATCCATACTTAGGAGAAAATGAAACTTTACATGTAATCATCTCTGCAA
CGTTAACTAACGAACATGAAATTTTGTTGTTACAGACCTCCGCTTCTTTGCCATCGTCTTGTCCAGTAGGTCGTCGGCGTTTTCAACTGTCCGATGAAGAAAGCCCATGG
CCGCCTGTCGAAATCCCTCACCGCAAGCGTACCTCTACTGACCACCGTGGCCGTCGGCAGCTCACTTCTAGTCTAGGGGTATGGGAGTGCTTCGAAATCCATGGTAGCCT
GAGTCCAAACAAAAAAAAATTAAGGAGGGATTCAAACCGGAAGATACTGCTATTGAGTTCCGACGAAGAGGATGAGAACGATTCGAACCATGATTCTCCTATGCGACTAC
CACAGTGGACCCGAACATTTGCTCGAACTAAATTAGCAGCGGGACACGCCAATAAGGTAAAATTTATGCACTTACCTACTGACTTAGTTGAAGATATGTTTCAGGGCGGC
AATATTGTTGTTGATAATGAGATTGGAACAAAGCGAGACAATGTGTTGGCCACAACAAATGCCTATGTTGGTACAAAGAATGCGGCTGGCACATCAGATGGTCAATTTCG
AACATCAGAGACCATTGTTCCAGGTATTTGTAGGCATTCCACATCTCAATTGGTTTATTACAAGAGGCGGTTAGTGTGCATAAGTGAGACAGAAAAGGACAACAGCATTG
AGACCGACCCTAGCATGGGAGAAAGATTGGGGAGTGTTTCGGTAGATTCTCCAAGCCAAGGACTAGACACTGACAAGGTTGAAGCAGGATGCCTTAATGGTGATGATAGT
GATGTGCAATTAGTGGAGGAGTCGGGGGGAACAATGAAATCTACAAGGATAAGTTCAACATCACCAGAGATGGAGGACAATCCCATGATCGAACAATCCAGCCCATCATC
TATATCAAGACCACCCAGCCGCCATAGAGTTATTAACCTTAGCAAAGCCGCTTACGAGAGGAATATGGCTAAGTTTAAACAACGCAAGATATTGCAGGAACGTGGTTTTC
AAACCGATGACCCTGCATTACCTCCATTCATTAAGGAGTTGATCAATCAACGAGTTAACTATGGTCAGCTAATTCGAGATTGTATACTTGATCGTGCTACCCGGTCGATC
GGCAAGTTGTTCTTTCCAAGTTTAATAACAGAATTGTGCCTCAAGCAAGGTGTCCATATAGATCTTGAAGAGTTCGAGAAGCCGACTGGGGTAGAACTTTTAACTACTCA
TGTAAAGAGATTAACTGCTCATTTTGCACAATACAACCCAACTGTCTCGGAGTTTCCAATCAACCTGGGTGGTGCTTCGACTTCTCAGCCACCTCAGCATGGTGATGATT
CTGATGAGAAGGATGATGTTGATTTTCTTGTTGGCCACATGTTTAAATCACATTGGTTTGAATTTAATTCTGAAAAGGTTAGAGTCAAACGGATCTCATCAAGGTGGTTA
AGCAAGCAAATAATAAATGATGTGAAGGAACAACAAGATATTGAGATTGAACGCCATCCAAGAAAGGAAGATGTTGACAGTCAACCAATTGCTAGAATTAGAGATCAATA
CCTTTCAACGGTTTATATTTGTCTTGATGGATACCAACGAGTTTTCATTGCTGGCTATAGCATCAATGTCATTCTCTTTAAACAACCAAATTGTCTGATATCTTCTAATA
ATTCCCTCTACGGGATTTGTTACGGGATTGGATCCAGCCAAGAAAATGGGCCTCTCCGGACCGGAGAGACCCGAACCAACGACAGACTGAGCAAGTCAGGGCTCGGACCC
AACCCCAGCTCGGCCCCATTGGCCAAGCCCGCTTTCACTCCTCTCCCGTTGGGTGCGGTGCCTCGGTCCATGAATCCTTGTGATCCAGTAACGACAGATAGATGGGCTCA
ACGAGAAGGTTCCAGAAGGCAAAACTCCTATAGATGGAACCACTGGCTGAACCCTAGAGAACCAAAGAGGCCAAATTTCCAGCATCAACAATTAGCGTCGTCTGTGGGGA
CGTTACAAACCATACGACCAAGCAACATGAATCTGGAGGAAAGTCTGCGATGCTCTGATGAGAATTGCCCTACGAGGAGGAAGCTGAACATGGATGACCCTCTGCTAGAA
GGAGTGGAAGGCGAGACGAGCCTGCAGATTGTCGAACAAACTTCTAGAGAGAATGATCGACAAGTTCCAGAACGTATAGAGGGGTCGGTCGAGATCTCTACGGTTGAAGT
TCCTGGACAGATCGAGGGCAAGTTCACGGCTTTGGAAGAGAAGGTTGAGGGAATGTATCAACGCATGTCCCAACTATTCCAGCGATTGGAACAGCGAGAAGTGAACCTGC
CGCCCCTCGTCACGGATCCTAGGAAGGATAAAGCCCCAGAGTTGAGCGAGCAGCAAATCAGAAGGCAATCGAAGTCCCTGATACCCACAAGAGGCAATGAACGCCAAAAT
GACCAAGGAGACCTCGAACTTATCGACATCGACAACGAAAGACCGTGGGATCGACCCGAGCCATCGGAGAAGCTGTTAAGCCAGAAGGAGAAGGGATTCGACCTCGAAGA
ATTGATAGACCAGGCCGACTCTCCATTCACGGAGGAGATAATGGGAGAAAAGGTCCCTCCAAAATTTAAGCTCCCGACCATAAAGCAGTTCGATGGGTCGACTGACCCGA
TAGACCACCTCGACGCCTACCGAGAATGGATGGACATCTACGGGGTAACCGAGGCCATTAGATGCCGAGTGTTTTCCTTTACCCTAAGTGGCTCGGCCAGGAGAGCAACA
GAAAGCCTTAAAGACTACGTGGCTCGATTTAATGAGGAAAAGCTACAAGTTGAAAGATTAAGTGACGTGGTGTCGCTTCTAGCTTTCATGTCTGGCATAAAAGATGAGCA
TCTATCGTTCTAG
Protein sequenceShow/hide protein sequence
MAEEVSNPILLADRRDIEMHNYATTTLQGLNSGITNPISDDAQFEFKSMMLQMLNTICQFYDERIHDKDDTAMREFTIRNDTAIRDYTSRNDAAIRNLEAHMGQLANELR
SRPLGTLPSTTEEPKREDKEHCKAITTRSGLAYEGPKLPGEQASSPSKEKETVTEPNQPVEGEVSGPIAPQQVTFNVLDAMRLSDEVEECSNIQITNPVSMEEFCDLITE
GLEQELEAMEKERANTFLSPMEKFEFGDLSKDEFKAMQPSIIEPPRLEQKPLPTHLKYPYLGENETLHVIISATLTNEHEILLLQTSASLPSSCPVGRRRFQLSDEESPW
PPVEIPHRKRTSTDHRGRRQLTSSLGVWECFEIHGSLSPNKKKLRRDSNRKILLLSSDEEDENDSNHDSPMRLPQWTRTFARTKLAAGHANKVKFMHLPTDLVEDMFQGG
NIVVDNEIGTKRDNVLATTNAYVGTKNAAGTSDGQFRTSETIVPGICRHSTSQLVYYKRRLVCISETEKDNSIETDPSMGERLGSVSVDSPSQGLDTDKVEAGCLNGDDS
DVQLVEESGGTMKSTRISSTSPEMEDNPMIEQSSPSSISRPPSRHRVINLSKAAYERNMAKFKQRKILQERGFQTDDPALPPFIKELINQRVNYGQLIRDCILDRATRSI
GKLFFPSLITELCLKQGVHIDLEEFEKPTGVELLTTHVKRLTAHFAQYNPTVSEFPINLGGASTSQPPQHGDDSDEKDDVDFLVGHMFKSHWFEFNSEKVRVKRISSRWL
SKQIINDVKEQQDIEIERHPRKEDVDSQPIARIRDQYLSTVYICLDGYQRVFIAGYSINVILFKQPNCLISSNNSLYGICYGIGSSQENGPLRTGETRTNDRLSKSGLGP
NPSSAPLAKPAFTPLPLGAVPRSMNPCDPVTTDRWAQREGSRRQNSYRWNHWLNPREPKRPNFQHQQLASSVGTLQTIRPSNMNLEESLRCSDENCPTRRKLNMDDPLLE
GVEGETSLQIVEQTSRENDRQVPERIEGSVEISTVEVPGQIEGKFTALEEKVEGMYQRMSQLFQRLEQREVNLPPLVTDPRKDKAPELSEQQIRRQSKSLIPTRGNERQN
DQGDLELIDIDNERPWDRPEPSEKLLSQKEKGFDLEELIDQADSPFTEEIMGEKVPPKFKLPTIKQFDGSTDPIDHLDAYREWMDIYGVTEAIRCRVFSFTLSGSARRAT
ESLKDYVARFNEEKLQVERLSDVVSLLAFMSGIKDEHLSF