; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:11796560..11805298
RNA-Seq ExpressionMoc04g15760
SyntenyMoc04g15760
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156271.1 uncharacterized protein LOC111023203 [Momordica charantia]4.8e-3854.44Show/hide
Query:  MEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLLRCLDPEEAR----------------------------------
        M+VQ  Q TWMD IKNFL+SG VP D SQARKL+ Q+AHYLMQE KLFKRGYSLPLLRCLDPE A                                   
Subjt:  MEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLLRCLDPEEAR----------------------------------

Query:  --------------------YTPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN
                             TPYDL  GSEAV+PVE+GM  PRVE+F+EQTNS+AL VNLDLL EKRSHSQLRLAEY N
Subjt:  --------------------YTPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN

XP_022156986.1 uncharacterized protein LOC111023816 [Momordica charantia]5.1e-5641.44Show/hide
Query:  RLRAEARGLLTQFEDCVIRQVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAA
        R  A+AR LL QFED VIRQVPRSENSNAD LARLASAY+ DL R VPVEIL ESS+DQPE+ME+ S Q TWMDPIK+FL++G VP D SQARKL+RQAA
Subjt:  RLRAEARGLLTQFEDCVIRQVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAA

Query:  HYLMQEGKLFKRGYSLPLLRCLDPEEARY-----------------------------------------------------------------------
        HYLMQEGKLFKRGYSLPLLRCLDPEEARY                                                                       
Subjt:  HYLMQEGKLFKRGYSLPLLRCLDPEEARY-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------TPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN
                       TPY L FGSEAVVP EVGMP PRVE+FDEQ NS+AL VNLDLLEEKRSHSQLRLAEYQN
Subjt:  ---------------TPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN

XP_022158215.1 uncharacterized protein LOC111024751 [Momordica charantia]3.5e-4180.91Show/hide
Query:  QVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLL
        +VPRSENSNADALA LASAYE DL R VPVEIL ESSIDQPE+M++ S + TWMDPIK+FL++G +P D SQARKL+RQAAHYLMQEGKLFKRGYSLPLL
Subjt:  QVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLL

Query:  RCLDPEEARY
        RCLDPEEARY
Subjt:  RCLDPEEARY

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.9e-5943.23Show/hide
Query:  ANSVEEFTSRLDSELEEEEIENFRFSDDNGDDSDTSTSGQGLEYPSQMPENYLGPLRRRYSIPDDIVLRLPRERERADNLPEECVTLYLKMFEYDFRLPV
        +N   +   RL+S+L  EEIEN R SDD G+DSD STSGQGLEYPS++PE+YLG LRR ++IP++I+LRLP E ERADN PE  VTLY KMFEY  RLP+
Subjt:  ANSVEEFTSRLDSELEEEEIENFRFSDDNGDDSDTSTSGQGLEYPSQMPENYLGPLRRRYSIPDDIVLRLPRERERADNLPEECVTLYLKMFEYDFRLPV

Query:  HPLVQEFLV------------------------------LDDLDLLGVEQLLACFEVKRISKK-------------------------------------
        HP VQEFL                                ++ +L  V+QLLACFE KRI+KK                                     
Subjt:  HPLVQEFLV------------------------------LDDLDLLGVEQLLACFEVKRISKK-------------------------------------

Query:  HENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRK---RL
         ++ES   F++VP RFGNLV++RP+P+L++  F   KY+K++F  G+++ TLVT++LLL   LLDYNP + P E+ RPNSELAMVCGF+  VKRK   R 
Subjt:  HENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRK---RL

Query:  CTKQAAKSKEAPSPMVVGLPAE-----VEGDPEEGFSQEVQKEEEKD
           +AA+S +  +P VVG  +E     +E +   G S+E +  ++ +
Subjt:  CTKQAAKSKEAPSPMVVGLPAE-----VEGDPEEGFSQEVQKEEEKD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-4839.78Show/hide
Query:  ENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRKRLCTKQ
        ++ES   F++VP RFGNLV+++ IP+L++  F   K++KD F   ++I TLVT+KLLL   LLDYNPL+   EA RPNSELAMVCGF+ +VKRK      
Subjt:  ENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRKRLCTKQ

Query:  AAKSKEAPSPMVVGLPAE----------------VEGDPEEGFSQEVQKEEEKDL-------SLRGR---------------GEVG--------------
        A K+     P+   +P                  +E D   G S E +  EE +         +RG                 E G              
Subjt:  AAKSKEAPSPMVVGLPAE----------------VEGDPEEGFSQEVQKEEEKDL-------SLRGR---------------GEVG--------------

Query:  -------GTSNVDLRFKVELSNVGVRERAMEISGSYFDRCWRRASKFVSAPGSVIQRLLDSTAEAHAAACQTAFMVKVELEGRDLLTVKERDASSTVLEA
               GTSNV +RF +E S+ GV+++   IS +  DR  RRASKFVS PGSV+QR +D+ AEA  A+   A MVK EL+GR+ L  KER+ S   LEA
Subjt:  -------GTSNVDLRFKVELSNVGVRERAMEISGSYFDRCWRRASKFVSAPGSVIQRLLDSTAEAHAAACQTAFMVKVELEGRDLLTVKERDASSTVLEA

Query:  AAALEGELKEARAEAQAWKSSSDADKAELKSAQAEAARHLENLRGTHAVAKYLEKEKFALMK
        A  L+GEL +A+ E    ++  DA    LK    E  +H  +LR  HA+ K LEKEKF L+K
Subjt:  AAALEGELKEARAEAQAWKSSSDADKAELKSAQAEAARHLENLRGTHAVAKYLEKEKFALMK

TrEMBL top hitse value%identityAlignment
A0A6J1DUF4 uncharacterized protein LOC1110232032.3e-3854.44Show/hide
Query:  MEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLLRCLDPEEAR----------------------------------
        M+VQ  Q TWMD IKNFL+SG VP D SQARKL+ Q+AHYLMQE KLFKRGYSLPLLRCLDPE A                                   
Subjt:  MEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLLRCLDPEEAR----------------------------------

Query:  --------------------YTPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN
                             TPYDL  GSEAV+PVE+GM  PRVE+F+EQTNS+AL VNLDLL EKRSHSQLRLAEY N
Subjt:  --------------------YTPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN

A0A6J1DV78 Ribonuclease H1.7e-4180.91Show/hide
Query:  QVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLL
        +VPRSENSNADALA LASAYE DL R VPVEIL ESSIDQPE+M++ S + TWMDPIK+FL++G +P D SQARKL+RQAAHYLMQEGKLFKRGYSLPLL
Subjt:  QVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLL

Query:  RCLDPEEARY
        RCLDPEEARY
Subjt:  RCLDPEEARY

A0A6J1DWM2 Ribonuclease H2.5e-5641.44Show/hide
Query:  RLRAEARGLLTQFEDCVIRQVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAA
        R  A+AR LL QFED VIRQVPRSENSNAD LARLASAY+ DL R VPVEIL ESS+DQPE+ME+ S Q TWMDPIK+FL++G VP D SQARKL+RQAA
Subjt:  RLRAEARGLLTQFEDCVIRQVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQVTWMDPIKNFLISGLVPDDLSQARKLQRQAA

Query:  HYLMQEGKLFKRGYSLPLLRCLDPEEARY-----------------------------------------------------------------------
        HYLMQEGKLFKRGYSLPLLRCLDPEEARY                                                                       
Subjt:  HYLMQEGKLFKRGYSLPLLRCLDPEEARY-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------TPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN
                       TPY L FGSEAVVP EVGMP PRVE+FDEQ NS+AL VNLDLLEEKRSHSQLRLAEYQN
Subjt:  ---------------TPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHSQLRLAEYQN

A0A6J1DXS5 uncharacterized protein LOC1110255021.4e-5943.23Show/hide
Query:  ANSVEEFTSRLDSELEEEEIENFRFSDDNGDDSDTSTSGQGLEYPSQMPENYLGPLRRRYSIPDDIVLRLPRERERADNLPEECVTLYLKMFEYDFRLPV
        +N   +   RL+S+L  EEIEN R SDD G+DSD STSGQGLEYPS++PE+YLG LRR ++IP++I+LRLP E ERADN PE  VTLY KMFEY  RLP+
Subjt:  ANSVEEFTSRLDSELEEEEIENFRFSDDNGDDSDTSTSGQGLEYPSQMPENYLGPLRRRYSIPDDIVLRLPRERERADNLPEECVTLYLKMFEYDFRLPV

Query:  HPLVQEFLV------------------------------LDDLDLLGVEQLLACFEVKRISKK-------------------------------------
        HP VQEFL                                ++ +L  V+QLLACFE KRI+KK                                     
Subjt:  HPLVQEFLV------------------------------LDDLDLLGVEQLLACFEVKRISKK-------------------------------------

Query:  HENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRK---RL
         ++ES   F++VP RFGNLV++RP+P+L++  F   KY+K++F  G+++ TLVT++LLL   LLDYNP + P E+ RPNSELAMVCGF+  VKRK   R 
Subjt:  HENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRK---RL

Query:  CTKQAAKSKEAPSPMVVGLPAE-----VEGDPEEGFSQEVQKEEEKD
           +AA+S +  +P VVG  +E     +E +   G S+E +  ++ +
Subjt:  CTKQAAKSKEAPSPMVVGLPAE-----VEGDPEEGFSQEVQKEEEKD

A0A6J1DZB3 uncharacterized protein LOC1110256656.5e-4939.78Show/hide
Query:  ENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRKRLCTKQ
        ++ES   F++VP RFGNLV+++ IP+L++  F   K++KD F   ++I TLVT+KLLL   LLDYNPL+   EA RPNSELAMVCGF+ +VKRK      
Subjt:  ENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAFKYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRKRLCTKQ

Query:  AAKSKEAPSPMVVGLPAE----------------VEGDPEEGFSQEVQKEEEKDL-------SLRGR---------------GEVG--------------
        A K+     P+   +P                  +E D   G S E +  EE +         +RG                 E G              
Subjt:  AAKSKEAPSPMVVGLPAE----------------VEGDPEEGFSQEVQKEEEKDL-------SLRGR---------------GEVG--------------

Query:  -------GTSNVDLRFKVELSNVGVRERAMEISGSYFDRCWRRASKFVSAPGSVIQRLLDSTAEAHAAACQTAFMVKVELEGRDLLTVKERDASSTVLEA
               GTSNV +RF +E S+ GV+++   IS +  DR  RRASKFVS PGSV+QR +D+ AEA  A+   A MVK EL+GR+ L  KER+ S   LEA
Subjt:  -------GTSNVDLRFKVELSNVGVRERAMEISGSYFDRCWRRASKFVSAPGSVIQRLLDSTAEAHAAACQTAFMVKVELEGRDLLTVKERDASSTVLEA

Query:  AAALEGELKEARAEAQAWKSSSDADKAELKSAQAEAARHLENLRGTHAVAKYLEKEKFALMK
        A  L+GEL +A+ E    ++  DA    LK    E  +H  +LR  HA+ K LEKEKF L+K
Subjt:  AAALEGELKEARAEAQAWKSSSDADKAELKSAQAEAARHLENLRGTHAVAKYLEKEKFALMK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTAGAAGCAGTTGTGTCACCACGTAATGATGAGGCAAAGCTCAGCCGCGGCACCCCAGCGGAAGAACTAGAACTTGTCCCCTTGCTGGGGCCGAATAAGCAAGT
CAATGTCTGTAGCAGACTGCGGGCCGAGGCTCGAGGGCTCCTCACCCAGTTTGAGGACTGTGTGATTCGACAGGTGCCGAGGTCCGAAAACTCCAATGCCGACGCACTGG
CTCGCTTAGCCTCGGCCTACGAGAACGACCTGTCGAGAAAGGTTCCAGTTGAAATACTTGCCGAGTCGTCCATCGACCAGCCTGAAGTAATGGAGGTCCAGTCAGTTCAG
GTTACATGGATGGACCCAATTAAGAACTTCCTGATCAGTGGCTTAGTTCCTGACGATCTGAGCCAGGCCAGAAAGCTCCAACGTCAAGCGGCTCACTACTTGATGCAAGA
AGGCAAGCTCTTCAAGAGGGGATATTCCCTACCATTGTTGCGGTGCCTCGACCCAGAGGAGGCGAGGTATACGCCATATGACCTAGTCTTCGGGTCAGAGGCCGTTGTCC
CAGTGGAAGTCGGCATGCCAATTCCTCGAGTTGAACATTTCGACGAGCAAACCAACAGCAAAGCTCTCTTTGTGAATCTTGACCTACTCGAGGAGAAAAGATCTCACTCC
CAGCTCAGGCTAGCAGAGTATCAGAACTCCGTTTTTCCCCCGACTTCTTCTTCGTTCCCAATCCCGCCGCCGCTCCTTTCTGGCAGCTTCTCCGACGCCTCGACGCAGCA
CCGGTGCTGCTGCCCACCGAACGGGACGACGACTCGTAGTAGTGGCGGACGAACCACGAGCGGCGGCCGGCGAAGCAGCGGCGGGCGAAGACACCCCGAGCGGCGGCGCG
AGACCCAGCCTCCCTCCTCCGGCGTTGTGCGACGTCGCGCGGTTCTGCGCACGGGTTCCACTTCGAGCAGCGGCCGCCGGATGCGATTCCTTTGGCGGCGTTCGAACAGC
GACGGCCGGCGACTCCCGGCGCTCCAGCGGCTGCGCTCCGTGACAGGTACAGCAGCGGCGTGGCCCCTCCTCGTAGCGGCGCACGACGGAAGGTGGTACAGCAGCGAAAC
GACGACAGGATTGGGTGAGGTCGACGCAAAACTTAAGAGCACGGTTGTGGGTTTAGGTCCCGACAGGGGTCACGACAAGCGGTTACCTTGGTCGGTTTTGCGGTCGGACG
TGTCGGGTTCGAGTAGATCAAACTCGGATACTTTCGGGGTTGTAGTACAATTAGAACCGGTGTTCGCCTCGCACTCAAGTACTTACGACGTTTGGAATTCTTCCGATTTG
AGACGTCCGATTTATGACACGTGTGCGGTTGAGATTAAACTTCGGACTGCTATGAATACCTCCAAGGTTTCGTTTCCGCTTTTTACGCTTTCCTCCTCCCTTAGTGGTTC
ATCTAGTGACATAGCTCGGAGTCGGGACTCTTTGCTTAAGAAAGCTAATTCCGTAGAGGAGTTCACCAGTAGGTTAGATTCCGAACTTGAAGAGGAAGAGATAGAAAACT
TTAGATTTTCTGATGACAATGGGGATGATAGTGATACGTCCACCTCGGGCCAGGGTTTAGAATACCCTTCCCAAATGCCTGAGAACTACCTCGGTCCTCTTCGTAGGAGA
TATAGCATACCAGATGATATAGTCCTTAGGCTTCCTAGGGAAAGGGAACGAGCTGATAATCTACCGGAAGAATGCGTTACTCTTTACCTAAAGATGTTCGAGTACGACTT
TCGCCTACCCGTCCATCCATTGGTACAAGAGTTTCTCGTCCTGGACGATCTTGACCTCCTTGGAGTTGAACAGCTTCTAGCTTGTTTTGAGGTAAAACGTATCTCCAAGA
AGCATGAGAACGAGTCCGACCTGCCTTTCTACAACGTCCCTCGTAGGTTTGGGAACTTAGTTGCTGTTCGGCCGATTCCTCAACTCTCCGAGCCAATTTTCTACGCCTTC
AAATATTTCAAAGATAAATTCAAGAGCGGCAAGCAGATCAGCACGCTTGTCACAAACAAGCTTCTCCTTGCTTTCAGGCTGCTGGATTACAACCCTCTTCTGCTCCCGCC
TGAAGCTGAGAGGCCGAACTCCGAACTAGCTATGGTGTGTGGTTTCTCCCAAAACGTGAAGCGCAAACGCCTGTGCACTAAACAGGCTGCCAAAAGCAAAGAGGCACCCA
GCCCTATGGTAGTTGGCCTTCCTGCTGAGGTCGAGGGAGATCCGGAGGAAGGCTTCTCCCAAGAAGTCCAAAAAGAAGAAGAGAAAGACTTATCACTCCGAGGACGAGGC
GAGGTGGGAGGAACCTCCAATGTCGATCTGAGGTTCAAGGTTGAGCTGTCTAACGTCGGGGTGAGGGAAAGGGCAATGGAAATCTCCGGCTCTTACTTTGACCGCTGCTG
GAGGAGAGCTTCTAAGTTTGTAAGCGCTCCGGGATCAGTCATCCAACGATTGTTGGATTCCACTGCCGAGGCTCACGCTGCAGCTTGCCAGACAGCCTTCATGGTGAAAG
TTGAACTAGAAGGACGCGACTTGCTCACTGTGAAGGAGCGAGATGCCTCCTCTACCGTTTTAGAAGCTGCTGCTGCTCTGGAGGGGGAACTCAAAGAGGCTCGTGCTGAG
GCCCAGGCATGGAAATCCTCTTCTGATGCCGATAAGGCCGAGCTCAAAAGTGCCCAGGCAGAGGCTGCTCGACACCTAGAGAACTTGAGAGGCACGCACGCCGTGGCCAA
GTACCTGGAAAAGGAGAAGTTCGCATTGATGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTAGAAGCAGTTGTGTCACCACGTAATGATGAGGCAAAGCTCAGCCGCGGCACCCCAGCGGAAGAACTAGAACTTGTCCCCTTGCTGGGGCCGAATAAGCAAGT
CAATGTCTGTAGCAGACTGCGGGCCGAGGCTCGAGGGCTCCTCACCCAGTTTGAGGACTGTGTGATTCGACAGGTGCCGAGGTCCGAAAACTCCAATGCCGACGCACTGG
CTCGCTTAGCCTCGGCCTACGAGAACGACCTGTCGAGAAAGGTTCCAGTTGAAATACTTGCCGAGTCGTCCATCGACCAGCCTGAAGTAATGGAGGTCCAGTCAGTTCAG
GTTACATGGATGGACCCAATTAAGAACTTCCTGATCAGTGGCTTAGTTCCTGACGATCTGAGCCAGGCCAGAAAGCTCCAACGTCAAGCGGCTCACTACTTGATGCAAGA
AGGCAAGCTCTTCAAGAGGGGATATTCCCTACCATTGTTGCGGTGCCTCGACCCAGAGGAGGCGAGGTATACGCCATATGACCTAGTCTTCGGGTCAGAGGCCGTTGTCC
CAGTGGAAGTCGGCATGCCAATTCCTCGAGTTGAACATTTCGACGAGCAAACCAACAGCAAAGCTCTCTTTGTGAATCTTGACCTACTCGAGGAGAAAAGATCTCACTCC
CAGCTCAGGCTAGCAGAGTATCAGAACTCCGTTTTTCCCCCGACTTCTTCTTCGTTCCCAATCCCGCCGCCGCTCCTTTCTGGCAGCTTCTCCGACGCCTCGACGCAGCA
CCGGTGCTGCTGCCCACCGAACGGGACGACGACTCGTAGTAGTGGCGGACGAACCACGAGCGGCGGCCGGCGAAGCAGCGGCGGGCGAAGACACCCCGAGCGGCGGCGCG
AGACCCAGCCTCCCTCCTCCGGCGTTGTGCGACGTCGCGCGGTTCTGCGCACGGGTTCCACTTCGAGCAGCGGCCGCCGGATGCGATTCCTTTGGCGGCGTTCGAACAGC
GACGGCCGGCGACTCCCGGCGCTCCAGCGGCTGCGCTCCGTGACAGGTACAGCAGCGGCGTGGCCCCTCCTCGTAGCGGCGCACGACGGAAGGTGGTACAGCAGCGAAAC
GACGACAGGATTGGGTGAGGTCGACGCAAAACTTAAGAGCACGGTTGTGGGTTTAGGTCCCGACAGGGGTCACGACAAGCGGTTACCTTGGTCGGTTTTGCGGTCGGACG
TGTCGGGTTCGAGTAGATCAAACTCGGATACTTTCGGGGTTGTAGTACAATTAGAACCGGTGTTCGCCTCGCACTCAAGTACTTACGACGTTTGGAATTCTTCCGATTTG
AGACGTCCGATTTATGACACGTGTGCGGTTGAGATTAAACTTCGGACTGCTATGAATACCTCCAAGGTTTCGTTTCCGCTTTTTACGCTTTCCTCCTCCCTTAGTGGTTC
ATCTAGTGACATAGCTCGGAGTCGGGACTCTTTGCTTAAGAAAGCTAATTCCGTAGAGGAGTTCACCAGTAGGTTAGATTCCGAACTTGAAGAGGAAGAGATAGAAAACT
TTAGATTTTCTGATGACAATGGGGATGATAGTGATACGTCCACCTCGGGCCAGGGTTTAGAATACCCTTCCCAAATGCCTGAGAACTACCTCGGTCCTCTTCGTAGGAGA
TATAGCATACCAGATGATATAGTCCTTAGGCTTCCTAGGGAAAGGGAACGAGCTGATAATCTACCGGAAGAATGCGTTACTCTTTACCTAAAGATGTTCGAGTACGACTT
TCGCCTACCCGTCCATCCATTGGTACAAGAGTTTCTCGTCCTGGACGATCTTGACCTCCTTGGAGTTGAACAGCTTCTAGCTTGTTTTGAGGTAAAACGTATCTCCAAGA
AGCATGAGAACGAGTCCGACCTGCCTTTCTACAACGTCCCTCGTAGGTTTGGGAACTTAGTTGCTGTTCGGCCGATTCCTCAACTCTCCGAGCCAATTTTCTACGCCTTC
AAATATTTCAAAGATAAATTCAAGAGCGGCAAGCAGATCAGCACGCTTGTCACAAACAAGCTTCTCCTTGCTTTCAGGCTGCTGGATTACAACCCTCTTCTGCTCCCGCC
TGAAGCTGAGAGGCCGAACTCCGAACTAGCTATGGTGTGTGGTTTCTCCCAAAACGTGAAGCGCAAACGCCTGTGCACTAAACAGGCTGCCAAAAGCAAAGAGGCACCCA
GCCCTATGGTAGTTGGCCTTCCTGCTGAGGTCGAGGGAGATCCGGAGGAAGGCTTCTCCCAAGAAGTCCAAAAAGAAGAAGAGAAAGACTTATCACTCCGAGGACGAGGC
GAGGTGGGAGGAACCTCCAATGTCGATCTGAGGTTCAAGGTTGAGCTGTCTAACGTCGGGGTGAGGGAAAGGGCAATGGAAATCTCCGGCTCTTACTTTGACCGCTGCTG
GAGGAGAGCTTCTAAGTTTGTAAGCGCTCCGGGATCAGTCATCCAACGATTGTTGGATTCCACTGCCGAGGCTCACGCTGCAGCTTGCCAGACAGCCTTCATGGTGAAAG
TTGAACTAGAAGGACGCGACTTGCTCACTGTGAAGGAGCGAGATGCCTCCTCTACCGTTTTAGAAGCTGCTGCTGCTCTGGAGGGGGAACTCAAAGAGGCTCGTGCTGAG
GCCCAGGCATGGAAATCCTCTTCTGATGCCGATAAGGCCGAGCTCAAAAGTGCCCAGGCAGAGGCTGCTCGACACCTAGAGAACTTGAGAGGCACGCACGCCGTGGCCAA
GTACCTGGAAAAGGAGAAGTTCGCATTGATGAAGTAG
Protein sequenceShow/hide protein sequence
MELEAVVSPRNDEAKLSRGTPAEELELVPLLGPNKQVNVCSRLRAEARGLLTQFEDCVIRQVPRSENSNADALARLASAYENDLSRKVPVEILAESSIDQPEVMEVQSVQ
VTWMDPIKNFLISGLVPDDLSQARKLQRQAAHYLMQEGKLFKRGYSLPLLRCLDPEEARYTPYDLVFGSEAVVPVEVGMPIPRVEHFDEQTNSKALFVNLDLLEEKRSHS
QLRLAEYQNSVFPPTSSSFPIPPPLLSGSFSDASTQHRCCCPPNGTTTRSSGGRTTSGGRRSSGGRRHPERRRETQPPSSGVVRRRAVLRTGSTSSSGRRMRFLWRRSNS
DGRRLPALQRLRSVTGTAAAWPLLVAAHDGRWYSSETTTGLGEVDAKLKSTVVGLGPDRGHDKRLPWSVLRSDVSGSSRSNSDTFGVVVQLEPVFASHSSTYDVWNSSDL
RRPIYDTCAVEIKLRTAMNTSKVSFPLFTLSSSLSGSSSDIARSRDSLLKKANSVEEFTSRLDSELEEEEIENFRFSDDNGDDSDTSTSGQGLEYPSQMPENYLGPLRRR
YSIPDDIVLRLPRERERADNLPEECVTLYLKMFEYDFRLPVHPLVQEFLVLDDLDLLGVEQLLACFEVKRISKKHENESDLPFYNVPRRFGNLVAVRPIPQLSEPIFYAF
KYFKDKFKSGKQISTLVTNKLLLAFRLLDYNPLLLPPEAERPNSELAMVCGFSQNVKRKRLCTKQAAKSKEAPSPMVVGLPAEVEGDPEEGFSQEVQKEEEKDLSLRGRG
EVGGTSNVDLRFKVELSNVGVRERAMEISGSYFDRCWRRASKFVSAPGSVIQRLLDSTAEAHAAACQTAFMVKVELEGRDLLTVKERDASSTVLEAAAALEGELKEARAE
AQAWKSSSDADKAELKSAQAEAARHLENLRGTHAVAKYLEKEKFALMK