; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:19290564..19297430
RNA-Seq ExpressionMoc08g26690
SyntenyMoc08g26690
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-8056.33Show/hide
Query:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------
        RFVL E+C Q PA NAT  VR  Y+ W KAN+KA+ YILAS+S+VLAKKHE  +TA++IMDSLQ +                        A  R      
Subjt:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------

Query:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK
              ++ +GAVIDE SQVSF+LES P+SFL FRSNAVMNK+ YTLTTLLNELQT++                              S PSSSG+K +K
Subjt:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK

Query:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA
        KKK  G+G+K + AAA    K K   KG CFHCN + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSG TNHVCSSFQGISSWRQL+ 
Subjt:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA

Query:  GEMTLKVGTGEVVSAV
        GEMT++VGTG VVSA+
Subjt:  GEMTLKVGTGEVVSAV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-8056.33Show/hide
Query:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------
        RFVL E+C Q PA NAT  VR  Y+ W KAN+KA+ YILAS+S+VLAKKHE  +TA++IMDSLQ +                        A  R      
Subjt:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------

Query:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK
              ++ +GAVIDE SQVSF+LES P+SFL FRSNAVMNK+ YTLTTLLNELQT++                              S PSSSG+K +K
Subjt:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK

Query:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA
        KKK  G+G+K + AAA    K K   KG CFHCN + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSG TNHVCSSFQGISSWRQL+ 
Subjt:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA

Query:  GEMTLKVGTGEVVSAV
        GEMT++VGTG VVSA+
Subjt:  GEMTLKVGTGEVVSAV

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]2.7e-9068.94Show/hide
Query:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL
        Q   ESV+ESW+RFK+LLQ C HHGIPR IQIE YYKGLDD T LVIDAS NGALLVK YA+AFNILERISSNNHSWS+P  +QG+  K L E ESY AL
Subjt:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL

Query:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN
        NSK+ENLT LVM SMTQQ+  GAS G ANV+ IQGISCSFCEG++HYNN P N ESVYYLGN QNN  N YSNTYNPGWRNHPNFSWSGNQGG+NAG SN
Subjt:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN

Query:  APAFQQK----------------------------LMKQYMANNDATVQSQAASLRNLELQVGQ
        APA+QQK                            LMK+ M  ND TVQSQAASLRNLE+QVGQ
Subjt:  APAFQQK----------------------------LMKQYMANNDATVQSQAASLRNLELQVGQ

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]7.2e-8364.96Show/hide
Query:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL
        QFA ESVSESW+ FKRLLQSC HHGIPR IQIETYYK L+D T L                                 +P  VQGKSSK LVE ESYT L
Subjt:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL

Query:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN
        NS IENLT LVM SM QQS  GA  G ANVNQIQGISCSFCEGD+HYNNCPGN ESVYYLGNPQNNRNN YSNTYNPGWRNHPNFSWSG+QGGHNAG S+
Subjt:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN

Query:  APAFQ----------------------------QKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPI
        APAFQ                            +KLMKQYMANNDATVQSQA SLRNL+LQVGQLA DLKS+PI
Subjt:  APAFQ----------------------------QKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPI

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]1.6e-8764.89Show/hide
Query:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL
        QF  ESVSESW+RFKRL+Q   + GIPR IQI+TYY GLDD T LVIDASANGALL K YA+AFNILERISSNN SWS+P  + GK SK   E ES+TAL
Subjt:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL

Query:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN
        N KIENLT LVM SMT QS  GAS G ANV+ IQGISCSFC G+  YNNCPGN ESV+YLGN QNN NNPYS             +W+G           
Subjt:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN

Query:  APAFQQKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPIGALPNDTEVPKRYGKEQCKALTLRRGKALPPAHPNA
            +++ M +YM NND TVQSQA SLRNLE+QVGQLA DLKS+P G LP+D +VPKR GKEQC ALTLR GK LP AHPNA
Subjt:  APAFQQKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPIGALPNDTEVPKRYGKEQCKALTLRRGKALPPAHPNA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.5e-8156.33Show/hide
Query:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------
        RFVL E+C Q PA NAT  VR  Y+ W KAN+KA+ YILAS+S+VLAKKHE  +TA++IMDSLQ +                        A  R      
Subjt:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------

Query:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK
              ++ +GAVIDE SQVSF+LES P+SFL FRSNAVMNK+ YTLTTLLNELQT++                              S PSSSG+K +K
Subjt:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK

Query:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA
        KKK  G+G+K + AAA    K K   KG CFHCN + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSG TNHVCSSFQGISSWRQL+ 
Subjt:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA

Query:  GEMTLKVGTGEVVSAV
        GEMT++VGTG VVSA+
Subjt:  GEMTLKVGTGEVVSAV

A0A5D3CPJ6 Gag/pol protein5.5e-8156.33Show/hide
Query:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------
        RFVL E+C Q PA NAT  VR  Y+ W KAN+KA+ YILAS+S+VLAKKHE  +TA++IMDSLQ +                        A  R      
Subjt:  RFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSI------------------------ARTR------

Query:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK
              ++ +GAVIDE SQVSF+LES P+SFL FRSNAVMNK+ YTLTTLLNELQT++                              S PSSSG+K +K
Subjt:  ------SQPDGAVIDEQSQVSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQ------------------------------SAPSSSGSKTFK

Query:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA
        KKK  G+G+K + AAA    K K   KG CFHCN + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSG TNHVCSSFQGISSWRQL+ 
Subjt:  KKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDA

Query:  GEMTLKVGTGEVVSAV
        GEMT++VGTG VVSA+
Subjt:  GEMTLKVGTGEVVSAV

A0A6J1DRG1 uncharacterized protein LOC1110236691.3e-9068.94Show/hide
Query:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL
        Q   ESV+ESW+RFK+LLQ C HHGIPR IQIE YYKGLDD T LVIDAS NGALLVK YA+AFNILERISSNNHSWS+P  +QG+  K L E ESY AL
Subjt:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL

Query:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN
        NSK+ENLT LVM SMTQQ+  GAS G ANV+ IQGISCSFCEG++HYNN P N ESVYYLGN QNN  N YSNTYNPGWRNHPNFSWSGNQGG+NAG SN
Subjt:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN

Query:  APAFQQK----------------------------LMKQYMANNDATVQSQAASLRNLELQVGQ
        APA+QQK                            LMK+ M  ND TVQSQAASLRNLE+QVGQ
Subjt:  APAFQQK----------------------------LMKQYMANNDATVQSQAASLRNLELQVGQ

A0A6J1DXK5 uncharacterized protein LOC1110255008.0e-8864.89Show/hide
Query:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL
        QF  ESVSESW+RFKRL+Q   + GIPR IQI+TYY GLDD T LVIDASANGALL K YA+AFNILERISSNN SWS+P  + GK SK   E ES+TAL
Subjt:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL

Query:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN
        N KIENLT LVM SMT QS  GAS G ANV+ IQGISCSFC G+  YNNCPGN ESV+YLGN QNN NNPYS             +W+G           
Subjt:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN

Query:  APAFQQKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPIGALPNDTEVPKRYGKEQCKALTLRRGKALPPAHPNA
            +++ M +YM NND TVQSQA SLRNLE+QVGQLA DLKS+P G LP+D +VPKR GKEQC ALTLR GK LP AHPNA
Subjt:  APAFQQKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPIGALPNDTEVPKRYGKEQCKALTLRRGKALPPAHPNA

A0A6J1E1F3 uncharacterized protein LOC1110250653.5e-8364.96Show/hide
Query:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL
        QFA ESVSESW+ FKRLLQSC HHGIPR IQIETYYK L+D T L                                 +P  VQGKSSK LVE ESYT L
Subjt:  QFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLVEYESYTAL

Query:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN
        NS IENLT LVM SM QQS  GA  G ANVNQIQGISCSFCEGD+HYNNCPGN ESVYYLGNPQNNRNN YSNTYNPGWRNHPNFSWSG+QGGHNAG S+
Subjt:  NSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAGRSN

Query:  APAFQ----------------------------QKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPI
        APAFQ                            +KLMKQYMANNDATVQSQA SLRNL+LQVGQLA DLKS+PI
Subjt:  APAFQ----------------------------QKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTACGTGTCGTCCTGGAGCGACCATCCCTACGGAGGGTTCATTGTATGGAAATCAAAATCAAGGTTCGTCTTGCAAGAGGATTGTTCTCAAGCTCCTGCG
CCTAACGCTACTGTGGCGGTGCGCAACGCCTATGACATGTGGATCAAGGCCAATGACAAAGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAG
AAGCACGAGGACACGGTCACCGCTAAGAAGATCATGGACTCGCTGCAGAGCATTGCGAGAACACGTTCTCAACCTGACGGGGCCGTCATAGACGAGCAGAGTCAG
GTCAGCTTTATGCTGGAATCTCATCCGAAGAGTTTCCTTCCATTTCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTG
CAGACCTACCAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAAAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAA
GGCAAGGTCAAGGTTACAGAGAAAGGAAAGTGTTTCCACTGCAACATGGACGAGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAAC
GAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGTCACTAATCACGTTTGTTCTTCATTT
CAGGGAATTAGTTCTTGGAGGCAGCTTGACGCCGGAGAGATGACTCTCAAGGTCGGAACGGGAGAGGTCGTCTCAGCTGTGGGGGACCATTTGGGAGTGCGAATT
AATCAAAAGAAGCGAAAAGACGAAAAAACACCTCAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGC
GTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACCAATTTGCTTTGGAATCAGTCAGTGAATCCTGGGATCGGTTCAAACGATTATTGCAGAGCTGCTCT
CACCATGGGATCCCAAGATACATACAGATAGAGACATATTACAAAGGTCTTGATGATGTCACACTCCTAGTGATTGATGCGTCTGCAAATGGGGCTTTGCTAGTA
AAACTCTATGCTAAAGCATTTAATATTTTGGAAAGAATATCATCAAACAATCACTCATGGTCTAATCCTACAGTTGTTCAAGGAAAATCGAGTAAGAGGCTGGTT
GAGTATGAATCATATACTGCATTGAATTCAAAGATTGAGAATCTGACGGCCTTGGTAATGACGAGTATGACGCAGCAAAGTCCAGCTGGAGCATCAGTTGGTATG
GCTAATGTTAATCAAATTCAAGGAATTTCTTGCTCTTTCTGCGAAGGAGACAACCATTACAACAACTGCCCTGGAAATTCGGAGTCAGTTTATTATCTTGGGAAC
CCGCAGAATAATAGAAACAATCCGTATTCGAACACGTACAATCCTGGCTGGAGGAATCATCCTAATTTTAGTTGGAGTGGCAATCAAGGAGGACACAATGCGGGA
AGATCTAATGCTCCAGCATTTCAGCAGAAGCTGATGAAGCAATACATGGCCAATAATGATGCCACTGTGCAAAGCCAAGCTGCATCATTAAGAAACCTAGAATTG
CAAGTAGGCCAGTTAGCTAAGGATCTAAAGAGCAGACCGATTGGAGCATTACCCAACGATACAGAAGTGCCAAAAAGATATGGTAAGGAACAATGCAAGGCCCTC
ACTTTGCGAAGGGGGAAAGCATTACCTCCGGCGCATCCAAATGCCCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCTACGTGTCGTCCTGGAGCGACCATCCCTACGGAGGGTTCATTGTATGGAAATCAAAATCAAGGTTCGTCTTGCAAGAGGATTGTTCTCAAGCTCCTGCG
CCTAACGCTACTGTGGCGGTGCGCAACGCCTATGACATGTGGATCAAGGCCAATGACAAAGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAG
AAGCACGAGGACACGGTCACCGCTAAGAAGATCATGGACTCGCTGCAGAGCATTGCGAGAACACGTTCTCAACCTGACGGGGCCGTCATAGACGAGCAGAGTCAG
GTCAGCTTTATGCTGGAATCTCATCCGAAGAGTTTCCTTCCATTTCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTG
CAGACCTACCAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAAAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAA
GGCAAGGTCAAGGTTACAGAGAAAGGAAAGTGTTTCCACTGCAACATGGACGAGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAAC
GAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGTCACTAATCACGTTTGTTCTTCATTT
CAGGGAATTAGTTCTTGGAGGCAGCTTGACGCCGGAGAGATGACTCTCAAGGTCGGAACGGGAGAGGTCGTCTCAGCTGTGGGGGACCATTTGGGAGTGCGAATT
AATCAAAAGAAGCGAAAAGACGAAAAAACACCTCAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGC
GTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACCAATTTGCTTTGGAATCAGTCAGTGAATCCTGGGATCGGTTCAAACGATTATTGCAGAGCTGCTCT
CACCATGGGATCCCAAGATACATACAGATAGAGACATATTACAAAGGTCTTGATGATGTCACACTCCTAGTGATTGATGCGTCTGCAAATGGGGCTTTGCTAGTA
AAACTCTATGCTAAAGCATTTAATATTTTGGAAAGAATATCATCAAACAATCACTCATGGTCTAATCCTACAGTTGTTCAAGGAAAATCGAGTAAGAGGCTGGTT
GAGTATGAATCATATACTGCATTGAATTCAAAGATTGAGAATCTGACGGCCTTGGTAATGACGAGTATGACGCAGCAAAGTCCAGCTGGAGCATCAGTTGGTATG
GCTAATGTTAATCAAATTCAAGGAATTTCTTGCTCTTTCTGCGAAGGAGACAACCATTACAACAACTGCCCTGGAAATTCGGAGTCAGTTTATTATCTTGGGAAC
CCGCAGAATAATAGAAACAATCCGTATTCGAACACGTACAATCCTGGCTGGAGGAATCATCCTAATTTTAGTTGGAGTGGCAATCAAGGAGGACACAATGCGGGA
AGATCTAATGCTCCAGCATTTCAGCAGAAGCTGATGAAGCAATACATGGCCAATAATGATGCCACTGTGCAAAGCCAAGCTGCATCATTAAGAAACCTAGAATTG
CAAGTAGGCCAGTTAGCTAAGGATCTAAAGAGCAGACCGATTGGAGCATTACCCAACGATACAGAAGTGCCAAAAAGATATGGTAAGGAACAATGCAAGGCCCTC
ACTTTGCGAAGGGGGAAAGCATTACCTCCGGCGCATCCAAATGCCCCAGCTTGA
Protein sequenceShow/hide protein sequence
MTYVSSWSDHPYGGFIVWKSKSRFVLQEDCSQAPAPNATVAVRNAYDMWIKANDKAKVYILASISDVLAKKHEDTVTAKKIMDSLQSIARTRSQPDGAVIDEQSQ
VSFMLESHPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVTEKGKCFHCNMDEHWKRNCPKYLAEKKKAN
EGKYDLLVLETCLVENDDSAWILDSGVTNHVCSSFQGISSWRQLDAGEMTLKVGTGEVVSAVGDHLGVRINQKKRKDEKTPQEAPGAWEACRKTGFLPTLPLMKR
VFQCVLVVPTDAYQFALESVSESWDRFKRLLQSCSHHGIPRYIQIETYYKGLDDVTLLVIDASANGALLVKLYAKAFNILERISSNNHSWSNPTVVQGKSSKRLV
EYESYTALNSKIENLTALVMTSMTQQSPAGASVGMANVNQIQGISCSFCEGDNHYNNCPGNSESVYYLGNPQNNRNNPYSNTYNPGWRNHPNFSWSGNQGGHNAG
RSNAPAFQQKLMKQYMANNDATVQSQAASLRNLELQVGQLAKDLKSRPIGALPNDTEVPKRYGKEQCKALTLRRGKALPPAHPNAPA