; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g31400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g31400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr8:22605687..22615264
RNA-Seq ExpressionMoc08g31400
SyntenyMoc08g31400
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10423.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.2e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

TYK21209.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.2e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

TYK23724.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.2e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]5.3e-3747.06Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEF--VAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        EKYS+GH+CKNQ+L V+VVH+D   E  E+ + E +    G E   V E   +A NT+V F+T  T KL+G I+ +EVV++IDC ATHNFISQ + +   
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEF--VAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDREALV
        L   ETS+YGVIMG G  V+G G+C+G++L LP++T +++FLPL+LG+     D VLG+QWL     M+V+W  L MSF+     ++
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDREALV

XP_022848903.1 uncharacterized protein LOC111371244 [Olea europaea var. sylvestris]2.8e-3045.51Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEFVAFNTLVAFTT-ETQKLKGAIQGREVVVMIDCRATHNFISQHVANDLKLH
        EK+  GHKC+N++L V VV  +E  E EE+  V+        E G+VVE ++ N++V     ++ KLKG I G  V+V+IDC ATHNFIS  +A  L++ 
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEFVAFNTLVAFTT-ETQKLKGAIQGREVVVMIDCRATHNFISQHVANDLKLH

Query:  CIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFK
         + T  YG+IMG G+ V+G G+C+GVV++LP I   DDFLPLKLG      D +LG++WL+ + KM+V+W  L M  K
Subjt:  CIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFK

TrEMBL top hitse value%identityAlignment
A0A5D3CEX8 Ty3/gypsy retrotransposon protein4.0e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

A0A5D3DD68 Ty3/gypsy retrotransposon protein4.0e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

A0A5D3DJA9 Ty3/gypsy retrotransposon protein4.0e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

A0A5D3DRT3 Ty3/gypsy retrotransposon protein4.0e-3043.02Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        E +S GH+CKN++L + VV +D       + +VE ++ A+  E  +V  V  ++ N++V  T   T KLKG ++ +E+V+M+DC ATHNFIS  +  +LK
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKV--VEFVAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF
        L   ET++YGVIMG G  V+G G+C+G+ + LP I+  +DFLPL+LG+     D VLG+QWL++   M V+W  L M+F
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSF

A0A6J1DN22 Reverse transcriptase2.6e-3747.06Show/hide
Query:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEF--VAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK
        EKYS+GH+CKNQ+L V+VVH+D   E  E+ + E +    G E   V E   +A NT+V F+T  T KL+G I+ +EVV++IDC ATHNFISQ + +   
Subjt:  EKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEF--VAFNTLVAFTTE-TQKLKGAIQGREVVVMIDCRATHNFISQHVANDLK

Query:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDREALV
        L   ETS+YGVIMG G  V+G G+C+G++L LP++T +++FLPL+LG+     D VLG+QWL     M+V+W  L MSF+     ++
Subjt:  LHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDREALV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.7e-0932.58Show/hide
Query:  GAIQGREVVVMIDCRATHNFISQHVANDLKLHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKV
        G I   +VVV ID  AT NFI   +A  LKL    T+   V++G    ++  G C G+ L + ++   ++FL L L   D   D +LG +WL ++ +  V
Subjt:  GAIQGREVVVMIDCRATHNFISQHVANDLKLHCIETSSYGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKV

Query:  NWSVLMMSFKRDREALVSWKNLPEEKATWEAV
        NW     SF  +++    W  L  E    E V
Subjt:  NWSVLMMSFKRDREALVSWKNLPEEKATWEAV

AT3G30770.1 Eukaryotic aspartyl protease family protein5.2e-0629.24Show/hide
Query:  HNDEIAEEEEMT--EVEALNQAHGNE-KGKVVEFVAFNTLV-AFTTETQKLK-----GAIQGREVVVMIDCRATHNFISQHVANDLKLHCIETSSYGVIM
        + D+  + E M+   VE     +GNE +G + +F     +    TTE  K K     G I   +VVV+ID  AT+NFIS  +A  LKL    T+   V++
Subjt:  HNDEIAEEEEMT--EVEALNQAHGNE-KGKVVEFVAFNTLV-AFTTETQKLK-----GAIQGREVVVMIDCRATHNFISQHVANDLKLHCIETSSYGVIM

Query:  GLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDRE
        G    ++  G C G+ L + ++   ++FL L L   D   D +LG    + + +  + W     SF  +++
Subjt:  GLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCGATGCTGGAGAAATACTCTGTTGGCCATAAATGTAAGAATCAGAAGCTGGGGGTGTACGTGGTCCATAACGATGAAATAGCAGAGGAAGAGGAGATGACAGA
GGTGGAGGCTTTGAACCAAGCACACGGCAACGAGAAGGGTAAGGTTGTTGAATTTGTAGCTTTTAACACCTTGGTCGCATTTACGACAGAGACGCAAAAATTAAAAGGGG
CTATCCAAGGTAGGGAAGTGGTGGTTATGATTGACTGTAGAGCCACACATAATTTCATCTCTCAGCATGTGGCTAATGACTTGAAATTGCATTGCATAGAGACCTCGAGT
TATGGGGTGATAATGGGTTTGGGCACGCCAGTTAAAGGAACAGGAATGTGCAGGGGAGTGGTGCTAAATCTGCCAAAGATTACGACTAAGGATGATTTCTTGCCACTCAA
ACTAGGAAGTTTTGATGGGAAAGATGATGGGGTGTTAGGAATACAATGGTTAAAGCGCATGAGAAAGATGAAAGTGAACTGGTCAGTGTTAATGATGTCTTTCAAACGCG
ACAGGGAAGCTTTGGTCAGTTGGAAGAACTTGCCAGAAGAAAAAGCAACGTGGGAAGCAGTTCATGACTTAAGCCGGCAGTTTCCCGATTTTCACCGTCTGCATTTTCCT
GAGTCACACCTTGAGAATATGACTTTTGAACCAACGGCTGATGTTAGAGCTCCAGTTGTTAACACTTATAGTAGGAGGGGTAGAGTGGCAATTCTACATTCATCTAGGGC
CGGAAGGAGGGAGAGGGCTTCAACTTCTCAAGTACCCTTTGAAGATTCCCCGATAAAAAGTATAAAGTCTCGTCGTCGTAAACGTAGTGAATCATCGGAGGGACGTGGTG
ATTGCTCCGATTACTTGTACTCCAACTGCCCTGAGGAGTTGTTAGGAATTCTTAGATATAACTACTCCATCCCAAATGACATTGAATTGAGGATTCCTGCGGCAGGCGAA
ACGATTAACAAACCTCCGCCTGGGTGCGTCAGTTTTTATCCTCAAATGTTTGAGTACGGGGTTAGGTTACCCTTGCATCTCTTCGCCCAAATCTTTCTTAATGTTGTCAA
CCTAGCTCCTGCTCAGCTAACACCAAATGGATGGGGCACCCTAATGGGTTTCTGTGGCTATGACCCACTTTGTAAATGTTATAGACAACTAGATCCAGGTCGTTCGTGTG
GAGACATGCGAGTGGGGTTCTGCAAAAACAAGCTAGACAATAACAAAGAAGGTGAACTCCCAGCATCTCTCTCAAGCAATCTCTTTCAATTTCCCCTCTTGTTCCAAAGA
AACACTTCCACAAGAACGATCTCGGTACCCAAAGGATGCTTGGAAGATCGTTTGGTGGTGTTTGGGAATTCTTTTGAAGAATTGTTCATCAAAGGATTCACATCCCATCA
GTATCTATGGTGGCGATTCTGTGAGGATAGAACTTGTGTCCTATCCGTCGAACAAATGCTTTCCATCCATACCATCAAGAAATCGTCCAAAGCCCCTCGTCGATTTTACT
TAAGTTGTTTTCTTGGTATCGCCAAATTAGTCAATGGACCGGTAATGGAGAAGGGCTTCTCTTCAGTCGTGAACGGACCCACTTCCATAAAGAACTGGAAGCAAATGTGG
TTTTATGTTTCCAGAAATTGGTTAATGACGACCAGCGAAGAGGCCCCTTATTGTGAGGTCCCTAAGGAATTCGGTAGATTGGTGTTGATCCATCCTCCGCCAGAATTGAC
CGAGGAAACCAGATCGGTTTTGGCTGGTTGTGCTACTATTTCAGCCACGGATCATTATAATCCCGACCTTCTGTCCAATCGGAACTTGAGAAAGGAGATGAACCCCCGAC
CATCTAGGTTTAATAGCAGTCGATGCCACGGTGTACCCCTTGGGAATGGTTTAGCTGACTCCGCTAGGACCAGGTCTGCAGGACTAGGATGTGATCAGCCTCCACCGCTT
AGACCGATGCCTCCGAAGAAAACCCTCAAACGGAAAATTGCTTTACCTGTCATTGAGATAAATGAACATGGGATCTCCTCCACTCATACTTCGCAGAACGACAATGCTGG
CGACCAACTTCAAAGCTCCGGTGGCCTCCAAACTGAGCTTGTCGTCCTGACCGACCCAACCTTCAGAATTGACGAGGGATCTGGCTTTGCCGGTCTGGGGGATGGACCTA
GTATTTCCCTTCAAGAGCTCGACGAGGTCCAATCTCACTCCCCTTTAGGTGACGATACGACTTTCACTACTGGACCATCAAACCTTGAAACCCTCAACGTTCTCGCTGCG
TCAGAGAAATGCATCGTCGAAAATAACAAGTCTAAGGTCGAGCAGACCAAATTGGAGGAAGAAAATCGTCATTTGTTGACCGAAGTCTCCGAGTTGACCACTGAGGCTAA
TCAGATTAGGCCTCTCCTTGCACAATTAAAGGGCGAGCTCTGCAAACTCAGGACCTCTACAAAGAACGTTTTAAGTGTAGAGAATTGCAAATACAAAAGCTCGATTTGGC
GTTTTGACTTTCTTTACGAGATCATGTCTCAATTTCCAGACTTCAAAGAGTTGGAGAAGGATCTGGAATATGGCGGTCTTAACTATTTGGTCGAGTGGCTAAAAAAATTT
GCTCAGGAGGTCAACTCATATCAACTTGTTTTCATGTTTGACCGCGACTGGGAACTTGCTATGAAGAAATTAGTTGATCAGAACGACCGACCAAACCCAGTTGAAGGAAC
AAAACCTTTCCTCGCCTCTAGTGGGAGCGACCGAGCTAGAGCCCTAGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCCGATGCTGGAGAAATACTCTGTTGGCCATAAATGTAAGAATCAGAAGCTGGGGGTGTACGTGGTCCATAACGATGAAATAGCAGAGGAAGAGGAGATGACAGA
GGTGGAGGCTTTGAACCAAGCACACGGCAACGAGAAGGGTAAGGTTGTTGAATTTGTAGCTTTTAACACCTTGGTCGCATTTACGACAGAGACGCAAAAATTAAAAGGGG
CTATCCAAGGTAGGGAAGTGGTGGTTATGATTGACTGTAGAGCCACACATAATTTCATCTCTCAGCATGTGGCTAATGACTTGAAATTGCATTGCATAGAGACCTCGAGT
TATGGGGTGATAATGGGTTTGGGCACGCCAGTTAAAGGAACAGGAATGTGCAGGGGAGTGGTGCTAAATCTGCCAAAGATTACGACTAAGGATGATTTCTTGCCACTCAA
ACTAGGAAGTTTTGATGGGAAAGATGATGGGGTGTTAGGAATACAATGGTTAAAGCGCATGAGAAAGATGAAAGTGAACTGGTCAGTGTTAATGATGTCTTTCAAACGCG
ACAGGGAAGCTTTGGTCAGTTGGAAGAACTTGCCAGAAGAAAAAGCAACGTGGGAAGCAGTTCATGACTTAAGCCGGCAGTTTCCCGATTTTCACCGTCTGCATTTTCCT
GAGTCACACCTTGAGAATATGACTTTTGAACCAACGGCTGATGTTAGAGCTCCAGTTGTTAACACTTATAGTAGGAGGGGTAGAGTGGCAATTCTACATTCATCTAGGGC
CGGAAGGAGGGAGAGGGCTTCAACTTCTCAAGTACCCTTTGAAGATTCCCCGATAAAAAGTATAAAGTCTCGTCGTCGTAAACGTAGTGAATCATCGGAGGGACGTGGTG
ATTGCTCCGATTACTTGTACTCCAACTGCCCTGAGGAGTTGTTAGGAATTCTTAGATATAACTACTCCATCCCAAATGACATTGAATTGAGGATTCCTGCGGCAGGCGAA
ACGATTAACAAACCTCCGCCTGGGTGCGTCAGTTTTTATCCTCAAATGTTTGAGTACGGGGTTAGGTTACCCTTGCATCTCTTCGCCCAAATCTTTCTTAATGTTGTCAA
CCTAGCTCCTGCTCAGCTAACACCAAATGGATGGGGCACCCTAATGGGTTTCTGTGGCTATGACCCACTTTGTAAATGTTATAGACAACTAGATCCAGGTCGTTCGTGTG
GAGACATGCGAGTGGGGTTCTGCAAAAACAAGCTAGACAATAACAAAGAAGGTGAACTCCCAGCATCTCTCTCAAGCAATCTCTTTCAATTTCCCCTCTTGTTCCAAAGA
AACACTTCCACAAGAACGATCTCGGTACCCAAAGGATGCTTGGAAGATCGTTTGGTGGTGTTTGGGAATTCTTTTGAAGAATTGTTCATCAAAGGATTCACATCCCATCA
GTATCTATGGTGGCGATTCTGTGAGGATAGAACTTGTGTCCTATCCGTCGAACAAATGCTTTCCATCCATACCATCAAGAAATCGTCCAAAGCCCCTCGTCGATTTTACT
TAAGTTGTTTTCTTGGTATCGCCAAATTAGTCAATGGACCGGTAATGGAGAAGGGCTTCTCTTCAGTCGTGAACGGACCCACTTCCATAAAGAACTGGAAGCAAATGTGG
TTTTATGTTTCCAGAAATTGGTTAATGACGACCAGCGAAGAGGCCCCTTATTGTGAGGTCCCTAAGGAATTCGGTAGATTGGTGTTGATCCATCCTCCGCCAGAATTGAC
CGAGGAAACCAGATCGGTTTTGGCTGGTTGTGCTACTATTTCAGCCACGGATCATTATAATCCCGACCTTCTGTCCAATCGGAACTTGAGAAAGGAGATGAACCCCCGAC
CATCTAGGTTTAATAGCAGTCGATGCCACGGTGTACCCCTTGGGAATGGTTTAGCTGACTCCGCTAGGACCAGGTCTGCAGGACTAGGATGTGATCAGCCTCCACCGCTT
AGACCGATGCCTCCGAAGAAAACCCTCAAACGGAAAATTGCTTTACCTGTCATTGAGATAAATGAACATGGGATCTCCTCCACTCATACTTCGCAGAACGACAATGCTGG
CGACCAACTTCAAAGCTCCGGTGGCCTCCAAACTGAGCTTGTCGTCCTGACCGACCCAACCTTCAGAATTGACGAGGGATCTGGCTTTGCCGGTCTGGGGGATGGACCTA
GTATTTCCCTTCAAGAGCTCGACGAGGTCCAATCTCACTCCCCTTTAGGTGACGATACGACTTTCACTACTGGACCATCAAACCTTGAAACCCTCAACGTTCTCGCTGCG
TCAGAGAAATGCATCGTCGAAAATAACAAGTCTAAGGTCGAGCAGACCAAATTGGAGGAAGAAAATCGTCATTTGTTGACCGAAGTCTCCGAGTTGACCACTGAGGCTAA
TCAGATTAGGCCTCTCCTTGCACAATTAAAGGGCGAGCTCTGCAAACTCAGGACCTCTACAAAGAACGTTTTAAGTGTAGAGAATTGCAAATACAAAAGCTCGATTTGGC
GTTTTGACTTTCTTTACGAGATCATGTCTCAATTTCCAGACTTCAAAGAGTTGGAGAAGGATCTGGAATATGGCGGTCTTAACTATTTGGTCGAGTGGCTAAAAAAATTT
GCTCAGGAGGTCAACTCATATCAACTTGTTTTCATGTTTGACCGCGACTGGGAACTTGCTATGAAGAAATTAGTTGATCAGAACGACCGACCAAACCCAGTTGAAGGAAC
AAAACCTTTCCTCGCCTCTAGTGGGAGCGACCGAGCTAGAGCCCTAGGCTAG
Protein sequenceShow/hide protein sequence
MFPMLEKYSVGHKCKNQKLGVYVVHNDEIAEEEEMTEVEALNQAHGNEKGKVVEFVAFNTLVAFTTETQKLKGAIQGREVVVMIDCRATHNFISQHVANDLKLHCIETSS
YGVIMGLGTPVKGTGMCRGVVLNLPKITTKDDFLPLKLGSFDGKDDGVLGIQWLKRMRKMKVNWSVLMMSFKRDREALVSWKNLPEEKATWEAVHDLSRQFPDFHRLHFP
ESHLENMTFEPTADVRAPVVNTYSRRGRVAILHSSRAGRRERASTSQVPFEDSPIKSIKSRRRKRSESSEGRGDCSDYLYSNCPEELLGILRYNYSIPNDIELRIPAAGE
TINKPPPGCVSFYPQMFEYGVRLPLHLFAQIFLNVVNLAPAQLTPNGWGTLMGFCGYDPLCKCYRQLDPGRSCGDMRVGFCKNKLDNNKEGELPASLSSNLFQFPLLFQR
NTSTRTISVPKGCLEDRLVVFGNSFEELFIKGFTSHQYLWWRFCEDRTCVLSVEQMLSIHTIKKSSKAPRRFYLSCFLGIAKLVNGPVMEKGFSSVVNGPTSIKNWKQMW
FYVSRNWLMTTSEEAPYCEVPKEFGRLVLIHPPPELTEETRSVLAGCATISATDHYNPDLLSNRNLRKEMNPRPSRFNSSRCHGVPLGNGLADSARTRSAGLGCDQPPPL
RPMPPKKTLKRKIALPVIEINEHGISSTHTSQNDNAGDQLQSSGGLQTELVVLTDPTFRIDEGSGFAGLGDGPSISLQELDEVQSHSPLGDDTTFTTGPSNLETLNVLAA
SEKCIVENNKSKVEQTKLEEENRHLLTEVSELTTEANQIRPLLAQLKGELCKLRTSTKNVLSVENCKYKSSIWRFDFLYEIMSQFPDFKELEKDLEYGGLNYLVEWLKKF
AQEVNSYQLVFMFDRDWELAMKKLVDQNDRPNPVEGTKPFLASSGSDRARALG