; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007186 (gene) of Chayote v1 genome

Gene IDSed0007186
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG02:41426750..41431228
RNA-Seq ExpressionSed0007186
SyntenySed0007186
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAF79879.1 T7N9.5 [Arabidopsis thaliana]6.7e-2145.6Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPF-KDNCTSIKPTFDPF
        P+++N T +  L K K DYS+L+ FGC CY S     R KFDPR K C+F+GYP+G KGYK  D E +   ISR V+F E  FPF   N T     F  F
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPF-KDNCTSIKPTFDPF

Query:  PDIVLPCPMNYDFTIPRLHTHDSSP
        P I LP P N D  +P + +   +P
Subjt:  PDIVLPCPMNYDFTIPRLHTHDSSP

KAB1219409.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morella rubra]1.5e-2043.18Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFP
        PLL N + +  L      Y +LRVFGC CY    +  R KFDPR + C+FVGYP GIKGYK +D + H   +S DV F E  FPFK++  +  P F   P
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFP

Query:  DIVLPCPM---NYDFTIPRLHTHDSSPTQTIP
        DIVLP  +    +    P L T     T T P
Subjt:  DIVLPCPM---NYDFTIPRLHTHDSSPTQTIP

KAG7556653.1 Retrotransposon Copia-like N-terminal [Arabidopsis suecica]2.5e-2044Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPF-KDNCTSIKPTFDPF
        P+L N T +  L  E+ DYS+L+ FGC CY S     R KFDPR + C+F+GYP G KGYK  D E +   +SR V+F E  FPF   N T    TF  F
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPF-KDNCTSIKPTFDPF

Query:  PDIVLPCPMNYDFTIPRLHTHDSSP
        P   LP P N+D  +P + +   +P
Subjt:  PDIVLPCPMNYDFTIPRLHTHDSSP

XP_012837652.1 PREDICTED: uncharacterized protein LOC105958190 [Erythranthe guttata]1.3e-2148.57Show/hide
Query:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCT-SIKPTFDPFP
        +L+++T F  L  + V Y+ LRVFGC  ++S     R KFDPR +IC+F+GYP G+KGYK  D + HE  +SR+V+F E  FPF DN      P+ +PFP
Subjt:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCT-SIKPTFDPFP

Query:  DIVLP
        D+VLP
Subjt:  DIVLP

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]1.9e-2032.54Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFP
        P+L   T ++ L     DYS+L+VFGC C++S     R KF PR    VFVGYP G+KGYK +D E   F +SRDV+F E  FPF     +  P  DPFP
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFP

Query:  DIVLPCPMNYDFT-----IPRLH------------THDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYD
         +V+  P +YD        P  H            + D SPT  IP    +    E   +   +L  N + +   +         V +SL++ +D  +  
Subjt:  DIVLPCPMNYDFT-----IPRLH------------THDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYD

Query:  THSNVPATNSLPFGINRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH
          +NV +   +P           P++SV+L     R+S+R    PS+L+DYH
Subjt:  THSNVPATNSLPFGINRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH

TrEMBL top hitse value%identityAlignment
A0A2N9E374 Integrase catalytic domain-containing protein9.1e-2434.32Show/hide
Query:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPD
        +L + T F  L+K K  +S+L++FGC CY S     R KF PR   CVF+GYP  +KGYK  D   H+  ISRDV F E  FPF  N  SI    DPF  
Subjt:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPD

Query:  IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVD-DVCEEVQDHGAL-LSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGIN
        +VLP        I   HT   S   T P +  +  D    + DH A  + D+++  + +   S P++                D+ +++P ++S+P    
Subjt:  IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVD-DVCEEVQDHGAL-LSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGIN

Query:  RSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH
         S+ + S N     P    RKSTR+   P +L DYH
Subjt:  RSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH

A0A2N9G5F4 Uncharacterized protein1.6e-2336.02Show/hide
Query:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPD
        +L + T F  L+K K  +S+L++FGC CY+S     R KF PR   CVF+GYP G+KGYK  D   H+  ISRDV+F E  FPF  +  SI    DPF  
Subjt:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPD

Query:  IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVD-DVCEEVQDHGAL-LSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGIN
        +VLP  ++ D      HT   S   T P +  +  D    + DH A  + D++             S   IDS  S  D   +D+ ++ P + S+P    
Subjt:  IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVD-DVCEEVQDHGAL-LSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGIN

Query:  RSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH
         S  + S N     P    RKSTR    P +L DYH
Subjt:  RSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH

A0A2N9GUM1 Integrase catalytic domain-containing protein4.5e-2333.06Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFK-------------D
        PLLS+ + +  L  +   YS+L+VFGC C+ S     R KFDPR K CVF+GYP  +KGYK +D   H+F +SRDVVF E  FPF+              
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFK-------------D

Query:  NCTSIKPTFDPFPDIVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSN
        + T I+P+F+      LP P++  F I  L +HD+S                               V+   +++ P    ++DS +    L   D H +
Subjt:  NCTSIKPTFDPFPDIVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSN

Query:  VPATNSLPFGINRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH
        +P+       I+ SS I   N++ S P+   R+STR    PS+L+DYH
Subjt:  VPATNSLPFGINRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH

A0A2N9GUQ2 Integrase catalytic domain-containing protein9.1e-2435.32Show/hide
Query:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPD
        +L + T F  L+K K  +S+L++FGC CY+S     R KF PR   CVF+GYP  +KGYK  D   H+  +SRDV+F E  FPF  N  SI    DPF  
Subjt:  LLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPD

Query:  IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGAL-LSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGINR
        +VLP  ++ + T  +      S    IP D ++      + DH A  + D++             S   IDS  S  D  L D+ +++P ++S+P     
Subjt:  IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGAL-LSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGINR

Query:  SSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH
        S  + S N     P    RKSTR    P +L DYH
Subjt:  SSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH

A0A2N9J064 Integrase catalytic domain-containing protein3.5e-2337.55Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFP
        PLLSN + F  L  +   Y++L+VFGC C+ S     R KFDPR K C F+GYP G+KGYK  +   H+ +ISRDVVF E  FPF+ N T I P F  F 
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFP

Query:  DIVLPC-PMNYDFTIPRLHTHDSSPT-QTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGI
           L C P     T P  HT    PT   IP  H              L++D L+  +     + P S     S L T  LL +++ S+ P+ + +    
Subjt:  DIVLPC-PMNYDFTIPRLHTHDSSPT-QTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGI

Query:  NRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH
            E  SP +SV  P+   R+STR    P++L+DYH
Subjt:  NRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFLKDYH

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-0734.04Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKP
        PLL   + F  L     +Y  LRVFGC CY       + K D + + CVF+GY +    Y     +     ISR V F E  FPF +   ++ P
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0625.19Show/hide
Query:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDN----CTSIKPTF
        PLL   + F  L  +  +Y  L+VFGC CY       R K + + K C F+GY +    Y            SR V F E  FPF        TS +   
Subjt:  PLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDN----CTSIKPTF

Query:  DPFPD-----------IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLAS------QPDSRDVIDSLLSTVDL
        D  P+           +VLP P       P L T    P+   P+       C        L S +++  ++    +      QP ++       ++   
Subjt:  DPFPD-----------IVLPCPMNYDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLAS------QPDSRDVIDSLLSTVDL

Query:  LLYDTHSNVPATNS------LPFGINRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFL
        +L + + N P+ NS      LP     S  I +P+ S+S P + +  ST T   P  L
Subjt:  LLYDTHSNVPATNS------LPFGINRSSEIGSPNESVSLPVTCTRKSTRTRVAPSFL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTGCTGTCCAATGCCACTCATTTCTCCAATTTGAATAAGGAAAAGGTGGATTATTCTAACCTTCGAGTCTTTGGATGCAAATGTTACATGTCTAATTTTCAATC
TAAGAGAATTAAATTTGATCCCCGAGTAAAGATCTGTGTTTTTGTGGGCTATCCAGTTGGCATTAAAGGTTACAAACAGTTTGACTTCGAGAAGCATGAGTTTGTTATAT
CTAGAGATGTAGTTTTTATTGAAATCCAATTTCCGTTTAAGGATAATTGCACATCTATAAAGCCAACATTTGATCCATTTCCTGATATTGTTTTGCCTTGTCCTATGAAC
TATGATTTCACCATACCACGCTTACATACACATGATTCAAGTCCTACTCAAACCATCCCAATTGATCACACTGTTGATGATGTATGTGAAGAAGTACAAGATCATGGTGC
GTTGCTTTCTGATAATTTGAACGGTGTCACTACAGAGGTTTTAGCATCACAACCTGATTCAAGGGATGTAATTGACTCCCTTTTGTCTACTGTTGATCTTTTACTATATG
ATACACACTCAAATGTTCCAGCTACTAATTCTTTACCCTTTGGTATCAATCGTTCCTCTGAGATTGGTTCTCCTAATGAATCTGTGTCTTTACCTGTCACTTGTACTCGT
AAATCTACTAGGACAAGGGTTGCACCTTCATTTTTAAAGGACTATCATTGA
mRNA sequenceShow/hide mRNA sequence
CACAACTTTACTCTCTAAATATTGTTATGGTATCAGAGCACATTTAGTGCTATTCCACAGTTTTTTTTTCCTTCTGCAATTCTCCAAATCTTGCCCAGAAAATGACTGAA
TCAGGAAAGATAAGTGCGGAAGATTCCACGACCATGGAAGCAACCATTGAAGCTCAAACTAATCCTTTTCTCATCCACTCTTCGTTCAATTCCATCACCACCCTAGTTAC
TCAACCTCTTGTTGGTGCTGCGAATTACGTTTCTTGGATAAAGGCTATGAACTCAGTTTCAAAGGAGATTGCTGCCAGCATTGTCTATAGTGGAAACGTTCAAGAAATCT
GGGACAAACTTGCCGAGAGGTTCGAAGAAAGCAACATGCCTACGGTATTTCAACTTAGAAAAGAGCTCGCAACCACATCTCAAGGTTCGTTCTCCATTGAAGTATATTTC
ACGAAACTCAAAACTATTTGGCAAGAACTCGTCGATTTCAAACCCTATAACGATTGTACCTGTGGAGGCTTCAAGTCTACTCTCGATCACATGAACTCAGAATACACTAT
GATTTTTCTCTTGGGACTTAACGACTCTCACAAATTTTGCTGATGGATCCCATACCTTAAATCAGTAAGGTATTTTCTTTGGTAATCCAAGGAGAAAGACAAAGAACTGT
AGGGGCACTTTCTCAATCAATAGATCCCATTGTCTAAATGGCAGCCGAGACTAAGAAAGGAGATGCGATTATCTCGGGAAATCAGGCACGACGAAGCAGCTTGAGATGTA
CACATTGCGACTATAGAGGACACACTAAAGAAAAGTGTTACAAGCTTCATGGATACCCACCTGATTATTGACCTAGAAATTATGCTTCTGTTATTGGTAACACCAATATG
GTTAGTTAAAACCCTGATTTTTTCTCAAGTTTGAGCACCTCTCAACATTCTCTCTTGCTCGGCCTTCTTGGTAATCCTTCGCAAGGGATAAAAACTGAGGCCATCAATGG
AGTAACTCATATTGCAGGAAAAAGCTTCATTGAGGACGATTGGCAAGGCTAGTGCTCTGAATGGACTTTACATCTTCTCACCTGTGAGAAAGCTTGCTCAGGTTCATAGT
GTTTTACATGATTCCCATCACAATAATAAAACTGATTCTATGATTTGTGCTGTTAGTGCCGACCTATGGCATAGTAGGTTAGGACACCTATCAGTTAGAAGATTACAATG
TTTGAAGGATGATTTGCGTTTGTTCTCTAGCTCTAGTAGTACATGTGATATATGTTCTTTAGCTAAACAACGTCGTCTTGTATTTCCTTTTCATAATAAAGTGGCTGCTG
AAATTTTTTATCTTGTTCATTGTGATGTATAGGGTCCATTCAAAGCATCTTCTTTGACTGGTTATCATTTTTTCTTAAGTATTGTTGATGATTGCTCAAGATATACATGG
GTCTACATGATGCGGTCAAAAAGTGATGTTCACACCATTATTCCTAGGTTTTTTAAACTAGTAGAAACCCAGTATAAGAAAACGATTAAAACTTTTTGCTCAAATAATGC
CCTTGAATTGAGATTTGAACTTTTTTTTTGCTTCCGTTGGGACTATGCATCAATTTTCATGTGTAGAGACACCTCAAAAGAACTCGGTAGTAGAACTCAAGCATCAACAC
CTATTAAATGTTGCACGAGCTTGTACTTTCAGTCTCATGTTCACATATATCTCTAGAGTGAGTGTGTGCTAGCTGCAACATACATAATAAGCCGCATTTCAATGCCTTTG
CTGTCCAATGCCACTCATTTCTCCAATTTGAATAAGGAAAAGGTGGATTATTCTAACCTTCGAGTCTTTGGATGCAAATGTTACATGTCTAATTTTCAATCTAAGAGAAT
TAAATTTGATCCCCGAGTAAAGATCTGTGTTTTTGTGGGCTATCCAGTTGGCATTAAAGGTTACAAACAGTTTGACTTCGAGAAGCATGAGTTTGTTATATCTAGAGATG
TAGTTTTTATTGAAATCCAATTTCCGTTTAAGGATAATTGCACATCTATAAAGCCAACATTTGATCCATTTCCTGATATTGTTTTGCCTTGTCCTATGAACTATGATTTC
ACCATACCACGCTTACATACACATGATTCAAGTCCTACTCAAACCATCCCAATTGATCACACTGTTGATGATGTATGTGAAGAAGTACAAGATCATGGTGCGTTGCTTTC
TGATAATTTGAACGGTGTCACTACAGAGGTTTTAGCATCACAACCTGATTCAAGGGATGTAATTGACTCCCTTTTGTCTACTGTTGATCTTTTACTATATGATACACACT
CAAATGTTCCAGCTACTAATTCTTTACCCTTTGGTATCAATCGTTCCTCTGAGATTGGTTCTCCTAATGAATCTGTGTCTTTACCTGTCACTTGTACTCGTAAATCTACT
AGGACAAGGGTTGCACCTTCATTTTTAAAGGACTATCATTGAGGTCTCCTCATGAACTCTTCCACATCCAAGAATACACAATGCTCTTTTCCTCTCCAAAAGTTCTTGTC
CTATGAGAACTTTTCAACAAATCAACAAAAATTTCTTCTGAATGTTATTGTTGCTTATCAACCTTCCTTCTATCATCAAGCAATAAAAGATCAGAAGTGAAAGGATGCTA
TGGATCAGGAACTTGTTGCTATGGAGAAGACACATACTTGGAGCCTTATCCCCTTGTCAAAATGACGTCATGTCGTGGGTTGCAAATGGGTCTATAAAGTTAAATATAAA
CCTGAGGGAACTGTGGACCGATATAAGGCTAGGCTTGTTGCGAAAGGCTACAGTCAATTTGAAGACGTTGACTTTCTTGATACTTTCTCCCTAGTGGCGAAAAATGTCTC
TGTAAAGGTTGTTCTTTCCTTAGCAGCCTCATATAAGTGGCCATTGTCTCAAATGGATGTCAACGATGCTTTTTTAAATGGTGAGTTATTTTAGGAAGTTTATATGTCAT
GACCTTAGGGATAGTATGAAAGTCAACAAGATAAGGCATATGCGCCCCTCGTTTGTAGGCTTCACAAGTCCATATATGGACTCAAACAGGTTTCAAGGAAATGATTCTCT
AAGTTCTCTAGTGTCTAGTTAAATCATGGTTTCTCTCGATCCAAGGCAGATTACTCACTCTTTGTAAAAGGACATGGGGAAGGCTTTGTAGCATTACTCATTTATGTAGA
TGACATTCTGATTACAGGACCTTCTCCATCTCATGTTGAACAAGTCAAAGAGTTGCTTAAGTCATGTTTCATGATGAAAGACTTAGGAAACGCTCGCTTTTTTCTTGGGT
TAGAAATATCTCGATCCACGACTGGTTTATATATGACACAACGGAAGACACAATGGAAGTATTGTCTTCAATTGTTAGAAGACTACGATTTGTTGGGAGCCAAAGTAGCT
TCTCAACCTATGATGCCTAATCAGAAGTTCTCTAATGCTGATGGGGAGTAACTAGATAATGAGAAATCAACTGAATATAGAAGGCTAATCGGGAGATTGTTATATTTGAA
AATTACAAGACCTGATATTTGCTTTGTCGTTCATAAATTGAGTCAGTTCGTTTCTCAACCACATACGGCCCATTTTGATGCTGCAATGTACCTTCTTCGTTACCTCAAAG
GAAATGCCGGACATGGTGCTGTCATGCGAGCTAATAGCTTTTTTCAACTAAAAGCCTTTGTTGATTCGGATTGGGGCTCATGTCTGGACACTCGGTGTTCTGTTACGGGT
TTCTGTATTTTTTGGGTAGTTCCCTCATATCTTGGAAGACTAAAAAACAAGCTGTTGTGTCAAGGTCGTCCGCTGAAGCCGAATATCGAGCTTTGGCTACTGTCACCAGC
CAATTAACTTGGTTGCACTCCCTCATTAACGATTTGCAGATTACTACTCCTCCTCCCTCGATTATCTATTGTGATAACCTTGCAGCCATAGCCATTGCAACCAATCTGAT
TTTTCATGAACGTACGAAGCATATAGAAGTTGATTGTCACTTGGTGAAAGATAAGATTGATGACGGTTTGATAAAATTGCTTCCTGTTCGTTCCAACGGACAACTAGCTG
ATATGTTTATCAAAGCACTAAGTATAACTCAGATCAAAAGCTTCATGGTCAAGATGGGCATTCTTAATCTTCATGATTGTCCATCTTAAGGGGTGAG
Protein sequenceShow/hide protein sequence
MPLLSNATHFSNLNKEKVDYSNLRVFGCKCYMSNFQSKRIKFDPRVKICVFVGYPVGIKGYKQFDFEKHEFVISRDVVFIEIQFPFKDNCTSIKPTFDPFPDIVLPCPMN
YDFTIPRLHTHDSSPTQTIPIDHTVDDVCEEVQDHGALLSDNLNGVTTEVLASQPDSRDVIDSLLSTVDLLLYDTHSNVPATNSLPFGINRSSEIGSPNESVSLPVTCTR
KSTRTRVAPSFLKDYH