; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g28320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g28320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:20944061..20948801
RNA-Seq ExpressionMoc04g28320
SyntenyMoc04g28320
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]3.3e-6753.03Show/hide
Query:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT
        M+FLAASDAIKCRAFQI +EGS RLWY+QLKP+SI SYQQLR++FINQFSARQL+KLP SHL   KQRD ES+T+YI R MD+HVKVV+CTD++A++YFT
Subjt:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT

Query:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL-------------------------KKRDDDRTSSRRPEEKTSEQRGDARTESKN-RPRFDKYTST
        TGLNDRNL ++ GS+   SLN++L RAR+YIDGL                         K+  DD++SSR+  +  S  + D R  S    P+FDK+T  
Subjt:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL-------------------------KKRDDDRTSSRRPEEKTSEQRGDARTESKN-RPRFDKYTST

Query:  NKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ
        N  +  I A VE    + L + P K  +PS K+DK  YC FHKDH H++S C+ L +Q++DLI+
Subjt:  NKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ

XP_022152851.1 uncharacterized protein LOC111020475 [Momordica charantia]6.0e-6956.27Show/hide
Query:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF
        MS YDGSGDPISYVE+FEGKM+FLA SDA+KC AFQIT+EGSTRLWYRQLK +SI SYQQLR++FINQFS RQ +KLP SHLG  KQRD ES T YI RF
Subjt:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF

Query:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRGDARTESKNRPR----FDKYTSTNK
        MD+HVKVV+CTD++A++YFTTGLNDRNL ++ GS     LNE+  RAR+YIDGL+  + D       +E  + + G   T S + PR      +      
Subjt:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRGDARTESKNRPR----FDKYTSTNK

Query:  LITGILAAVEEDGFEILLSLP-GKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ
        + T I   + +  F+I+      K R+PS K+DK  YC FHKD  HDTS C+ L +Q+EDLI+
Subjt:  LITGILAAVEEDGFEILLSLP-GKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.6e-6738.35Show/hide
Query:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF
        M PYDGS DP  YVE+FE  M+F AA+DAIKC AFQI + GS RLWYR+L  + IS+Y QLRK FI+QFS+R   +   +HL   +Q++ E++ +Y+TRF
Subjt:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF

Query:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRG---DARTESKNRPR----------
         ++ +KV +C+D+ A+ YF TGL D  L +KL  ++  +  E+L + ++ IDG   ++  RT + RPE+   + R      + +SK+R +          
Subjt:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRG---DARTESKNRPR----------

Query:  -------------FDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ----------------
                     ++ YT T   I  IL  +EE G E LL  P K R    K++  KYC FH+DH H+TS  + L  QIEDLIQ                
Subjt:  -------------FDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ----------------

Query:  -----------------------------------------------------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALE
                                                               EGVHLP+NDALVIAPLID V+V+R+L+DGGAS N++  S Y AL 
Subjt:  -----------------------------------------------------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALE

Query:  WERAKLKPSLTLLVGFVGESVTAEG
        W R++LK S T LVGF GES++ EG
Subjt:  WERAKLKPSLTLLVGFVGESVTAEG

XP_022153957.1 uncharacterized protein LOC111021344 [Momordica charantia]4.8e-6641.95Show/hide
Query:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT
        M+F AA+DAIKCRAFQI +  S RLWYR+L  +SIS+Y QLRK  I+QFS+R   +  A+HL   +Q++ E++ +Y+TRF ++ +KV +C+D+ A+ YF 
Subjt:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT

Query:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPE---------------EKTSEQRGD----ARTE--------SKNRPRFDKYTS
        TGL D  L +KLG ++  +  E+L +A++ IDG   ++  RT + RPE               +  S+ +G     +RTE        S++RP +++YTS
Subjt:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPE---------------EKTSEQRGD----ARTE--------SKNRPRFDKYTS

Query:  TNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ-----------------------------------
        T   I+ IL  +EE G E LL  P K R    K++K KYC FH+DH H+T++C+ L  QIEDLIQ                                   
Subjt:  TNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ-----------------------------------

Query:  -------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALEWERAKLKPSLTLLVGFVGESVTAEG
                 EGVHLP+NDALVIAPLIDHV V+RVL+DG AS N++    Y AL W R +LK S T  VGF GESV+ EG
Subjt:  -------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALEWERAKLKPSLTLLVGFVGESVTAEG

XP_022156001.1 uncharacterized protein LOC111022976 [Momordica charantia]4.5e-7272.77Show/hide
Query:  MFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL---------
        MFINQ SA QL+KLP SHLG  KQRD ES+TDYITRFMD+HVKV+NCTDEMAIIYFTTGL  RNLVMKL SK AT  NELL  A RYIDGL         
Subjt:  MFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL---------

Query:  ----------KKRDDDRTSSRRPEEKTSEQRGDARTESKNRPRFDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTS
                  KKR DDRTSSR PEEKTSEQR DARTESKN P+FDKYT TN+ I  I A VEE GFE LLSL GK+RK S KKDKTKYC FHKDHDHDTS
Subjt:  ----------KKRDDDRTSSRRPEEKTSEQRGDARTESKNRPRFDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTS

Query:  TCYALTDQIEDLI
        TCYAL DQIEDLI
Subjt:  TCYALTDQIEDLI

TrEMBL top hitse value%identityAlignment
A0A6J1D5T3 uncharacterized protein LOC1110175481.6e-6753.03Show/hide
Query:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT
        M+FLAASDAIKCRAFQI +EGS RLWY+QLKP+SI SYQQLR++FINQFSARQL+KLP SHL   KQRD ES+T+YI R MD+HVKVV+CTD++A++YFT
Subjt:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT

Query:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL-------------------------KKRDDDRTSSRRPEEKTSEQRGDARTESKN-RPRFDKYTST
        TGLNDRNL ++ GS+   SLN++L RAR+YIDGL                         K+  DD++SSR+  +  S  + D R  S    P+FDK+T  
Subjt:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL-------------------------KKRDDDRTSSRRPEEKTSEQRGDARTESKN-RPRFDKYTST

Query:  NKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ
        N  +  I A VE    + L + P K  +PS K+DK  YC FHKDH H++S C+ L +Q++DLI+
Subjt:  NKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ

A0A6J1DHB3 uncharacterized protein LOC1110204794.7e-6738.35Show/hide
Query:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF
        M PYDGS DP  YVE+FE  M+F AA+DAIKC AFQI + GS RLWYR+L  + IS+Y QLRK FI+QFS+R   +   +HL   +Q++ E++ +Y+TRF
Subjt:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF

Query:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRG---DARTESKNRPR----------
         ++ +KV +C+D+ A+ YF TGL D  L +KL  ++  +  E+L + ++ IDG   ++  RT + RPE+   + R      + +SK+R +          
Subjt:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRG---DARTESKNRPR----------

Query:  -------------FDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ----------------
                     ++ YT T   I  IL  +EE G E LL  P K R    K++  KYC FH+DH H+TS  + L  QIEDLIQ                
Subjt:  -------------FDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ----------------

Query:  -----------------------------------------------------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALE
                                                               EGVHLP+NDALVIAPLID V+V+R+L+DGGAS N++  S Y AL 
Subjt:  -----------------------------------------------------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALE

Query:  WERAKLKPSLTLLVGFVGESVTAEG
        W R++LK S T LVGF GES++ EG
Subjt:  WERAKLKPSLTLLVGFVGESVTAEG

A0A6J1DIZ8 uncharacterized protein LOC1110204752.9e-6956.27Show/hide
Query:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF
        MS YDGSGDPISYVE+FEGKM+FLA SDA+KC AFQIT+EGSTRLWYRQLK +SI SYQQLR++FINQFS RQ +KLP SHLG  KQRD ES T YI RF
Subjt:  MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRF

Query:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRGDARTESKNRPR----FDKYTSTNK
        MD+HVKVV+CTD++A++YFTTGLNDRNL ++ GS     LNE+  RAR+YIDGL+  + D       +E  + + G   T S + PR      +      
Subjt:  MDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRGDARTESKNRPR----FDKYTSTNK

Query:  LITGILAAVEEDGFEILLSLP-GKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ
        + T I   + +  F+I+      K R+PS K+DK  YC FHKD  HDTS C+ L +Q+EDLI+
Subjt:  LITGILAAVEEDGFEILLSLP-GKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ

A0A6J1DKD3 uncharacterized protein LOC1110213442.3e-6641.95Show/hide
Query:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT
        M+F AA+DAIKCRAFQI +  S RLWYR+L  +SIS+Y QLRK  I+QFS+R   +  A+HL   +Q++ E++ +Y+TRF ++ +KV +C+D+ A+ YF 
Subjt:  MEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFT

Query:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPE---------------EKTSEQRGD----ARTE--------SKNRPRFDKYTS
        TGL D  L +KLG ++  +  E+L +A++ IDG   ++  RT + RPE               +  S+ +G     +RTE        S++RP +++YTS
Subjt:  TGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPE---------------EKTSEQRGD----ARTE--------SKNRPRFDKYTS

Query:  TNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ-----------------------------------
        T   I+ IL  +EE G E LL  P K R    K++K KYC FH+DH H+T++C+ L  QIEDLIQ                                   
Subjt:  TNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQ-----------------------------------

Query:  -------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALEWERAKLKPSLTLLVGFVGESVTAEG
                 EGVHLP+NDALVIAPLIDHV V+RVL+DG AS N++    Y AL W R +LK S T  VGF GESV+ EG
Subjt:  -------HTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALEWERAKLKPSLTLLVGFVGESVTAEG

A0A6J1DTI2 uncharacterized protein LOC1110229762.2e-7272.77Show/hide
Query:  MFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL---------
        MFINQ SA QL+KLP SHLG  KQRD ES+TDYITRFMD+HVKV+NCTDEMAIIYFTTGL  RNLVMKL SK AT  NELL  A RYIDGL         
Subjt:  MFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNCTDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGL---------

Query:  ----------KKRDDDRTSSRRPEEKTSEQRGDARTESKNRPRFDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTS
                  KKR DDRTSSR PEEKTSEQR DARTESKN P+FDKYT TN+ I  I A VEE GFE LLSL GK+RK S KKDKTKYC FHKDHDHDTS
Subjt:  ----------KKRDDDRTSSRRPEEKTSEQRGDARTESKNRPRFDKYTSTNKLITGILAAVEEDGFEILLSLPGKTRKPSSKKDKTKYCWFHKDHDHDTS

Query:  TCYALTDQIEDLI
        TCYAL DQIEDLI
Subjt:  TCYALTDQIEDLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCGTATGATGGGTCGGGAGACCCTATATCGTACGTGGAAATATTCGAAGGGAAGATGGAGTTCCTTGCAGCAAGTGACGCCATCAAGTGTCGAGCTTTCCAAAT
CACTATAGAAGGCTCGACTCGATTATGGTACCGACAGTTGAAGCCGAAATCCATCAGCAGCTATCAACAGTTACGTAAGATGTTCATCAACCAGTTCTCAGCAAGGCAGT
TGATGAAGCTTCCAGCATCCCACCTCGGGATGTTCAAACAACGAGATCGCGAGTCCATGACCGATTACATCACCAGGTTTATGGATGACCACGTTAAGGTGGTAAACTGC
ACCGACGAGATGGCTATAATATACTTCACCACCGGGTTGAATGATAGAAACCTAGTGATGAAGCTTGGCAGCAAGTCAGCTACCTCGTTGAATGAGTTACTCATTCGCGC
ACGAAGATACATCGATGGCCTCAAGAAACGAGACGACGACCGTACTTCAAGTCGTCGACCAGAGGAAAAGACTTCCGAGCAACGTGGTGACGCTCGAACCGAATCAAAGA
ATAGACCTAGGTTTGACAAGTACACGTCGACCAACAAACTCATAACGGGAATTCTTGCAGCTGTGGAGGAAGATGGGTTTGAGATTTTACTGTCATTGCCAGGGAAGACA
CGCAAGCCGAGCAGTAAAAAGGATAAGACGAAGTACTGTTGGTTCCACAAGGACCACGACCACGATACCTCGACCTGCTATGCGCTGACGGACCAGATTGAGGACCTCAT
TCAACACACCGAGGGTGTGCACTTGCCTTATAACGACGCCCTCGTCATCGCCCCATTGATTGATCATGTTATGGTCAAACGGGTTCTCATTGATGGAGGAGCCTCAACCA
ATGTCATCTTTTGGTCAGTCTACTCAGCCCTTGAATGGGAGCGGGCTAAGCTGAAGCCGAGCCTTACACTACTGGTGGGTTTTGTTGGAGAGTCAGTAACAGCCGAGGGA
TCCACAATCCGAGGCGAGCAGAAGTCATCAAGGAAATGCTACGCCGAAGCATTAGCGGGTTCTGCCACTTGTGCAGCCATTACGACCGAGGCTCCATCGCTGGATGAGCC
GACCTGCGAGATCCCAGCCGAGGAGCTTGAGCTTGTGCCACTCTTGAGTCCAGATAAGCAGGTTAGAGTCGGCACCAAGCTGGGAGGAGAGACCAGGGCAGAGTTCATCA
ACTTCTTGCAAACGAATGCCAACGTCTTTGCATGGTCGCATGAAGACATGCTAGGGTCGACCCGAGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGAA
AAAGTATCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAAC
CGATGCATACGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCGTATGATGGGTCGGGAGACCCTATATCGTACGTGGAAATATTCGAAGGGAAGATGGAGTTCCTTGCAGCAAGTGACGCCATCAAGTGTCGAGCTTTCCAAAT
CACTATAGAAGGCTCGACTCGATTATGGTACCGACAGTTGAAGCCGAAATCCATCAGCAGCTATCAACAGTTACGTAAGATGTTCATCAACCAGTTCTCAGCAAGGCAGT
TGATGAAGCTTCCAGCATCCCACCTCGGGATGTTCAAACAACGAGATCGCGAGTCCATGACCGATTACATCACCAGGTTTATGGATGACCACGTTAAGGTGGTAAACTGC
ACCGACGAGATGGCTATAATATACTTCACCACCGGGTTGAATGATAGAAACCTAGTGATGAAGCTTGGCAGCAAGTCAGCTACCTCGTTGAATGAGTTACTCATTCGCGC
ACGAAGATACATCGATGGCCTCAAGAAACGAGACGACGACCGTACTTCAAGTCGTCGACCAGAGGAAAAGACTTCCGAGCAACGTGGTGACGCTCGAACCGAATCAAAGA
ATAGACCTAGGTTTGACAAGTACACGTCGACCAACAAACTCATAACGGGAATTCTTGCAGCTGTGGAGGAAGATGGGTTTGAGATTTTACTGTCATTGCCAGGGAAGACA
CGCAAGCCGAGCAGTAAAAAGGATAAGACGAAGTACTGTTGGTTCCACAAGGACCACGACCACGATACCTCGACCTGCTATGCGCTGACGGACCAGATTGAGGACCTCAT
TCAACACACCGAGGGTGTGCACTTGCCTTATAACGACGCCCTCGTCATCGCCCCATTGATTGATCATGTTATGGTCAAACGGGTTCTCATTGATGGAGGAGCCTCAACCA
ATGTCATCTTTTGGTCAGTCTACTCAGCCCTTGAATGGGAGCGGGCTAAGCTGAAGCCGAGCCTTACACTACTGGTGGGTTTTGTTGGAGAGTCAGTAACAGCCGAGGGA
TCCACAATCCGAGGCGAGCAGAAGTCATCAAGGAAATGCTACGCCGAAGCATTAGCGGGTTCTGCCACTTGTGCAGCCATTACGACCGAGGCTCCATCGCTGGATGAGCC
GACCTGCGAGATCCCAGCCGAGGAGCTTGAGCTTGTGCCACTCTTGAGTCCAGATAAGCAGGTTAGAGTCGGCACCAAGCTGGGAGGAGAGACCAGGGCAGAGTTCATCA
ACTTCTTGCAAACGAATGCCAACGTCTTTGCATGGTCGCATGAAGACATGCTAGGGTCGACCCGAGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGAA
AAAGTATCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAAC
CGATGCATACGTGTAG
Protein sequenceShow/hide protein sequence
MSPYDGSGDPISYVEIFEGKMEFLAASDAIKCRAFQITIEGSTRLWYRQLKPKSISSYQQLRKMFINQFSARQLMKLPASHLGMFKQRDRESMTDYITRFMDDHVKVVNC
TDEMAIIYFTTGLNDRNLVMKLGSKSATSLNELLIRARRYIDGLKKRDDDRTSSRRPEEKTSEQRGDARTESKNRPRFDKYTSTNKLITGILAAVEEDGFEILLSLPGKT
RKPSSKKDKTKYCWFHKDHDHDTSTCYALTDQIEDLIQHTEGVHLPYNDALVIAPLIDHVMVKRVLIDGGASTNVIFWSVYSALEWERAKLKPSLTLLVGFVGESVTAEG
STIRGEQKSSRKCYAEALAGSATCAAITTEAPSLDEPTCEIPAEELELVPLLSPDKQVRVGTKLGGETRAEFINFLQTNANVFAWSHEDMLGSTRDHLGVQINQKKQKDE
KVSWEAPGAWEACRKTGFLPTLPLMKRVFQCVLVVPTDAYV