; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002728 (gene) of Snake gourd v1 genome

Gene IDTan0002728
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG01:29101259..29105067
RNA-Seq ExpressionTan0002728
SyntenyTan0002728
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17077.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.3e-4540.53Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSS--ACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDH
        FE+   N  A  +++V  +TG +S  +    R R  SR  EL+R+V ++GRIP+ I    EKP+   A RFS AIG  VR++FPVR  KW DV  E  + 
Subjt:  FENSAPNASANPTTTVVSSTGETSDSS--ACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDH

Query:  MKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSK
        +K  L   F LD +   ++RF+ H++ +TFKE+R D  +H++K   PEE RA  P+  V    DWHFLCD + S   Q+   TNK  R+K P+N  + SK
Subjt:  MKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSK

Query:  SFMHVLQKQ----KSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE
        SF   LQ+Q    + +   V  +ELF++T  + G  +V+ AA +AH+QM ELQ + T EGS PL+ED I   VL       KG+GWG + K ++ +S   
Subjt:  SFMHVLQKQ----KSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE

Query:  T
        +
Subjt:  T

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]1.2e-4938.89Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK
        F  S+    A   +   S + + S     R RG SRN ELDR+VN+HGRI IEI +E+ KPVC  AT+FSNAIGTI R + P+R   W DV  E++D + 
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK

Query:  NKLL---------------------------NHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPE
        ++LL                           ++F+ D+ +  V +++   +Q+TFKEYR DL +HYR+   P+E RA P K ++D  DW+ LC+RWE+PE
Subjt:  NKLL---------------------------NHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPE

Query:  CQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSS
         +K   TNKK+R K+P+   T SKSF+ V  + K K   +V  ++LF+Q+ + E   WVN  A +A+ +M+ L E S QE   P++   + + VL   S 
Subjt:  CQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSS

Query:  QVKGMG
         +KG+G
Subjt:  QVKGMG

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]4.7e-5442.65Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK
        F  S+    A   +   S + + S     R RG SRN ELDR+VN+HGRI IEI +E+ KPVC  AT+FSNAIGTI R + P+R   W DV  E++D + 
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK

Query:  NKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSKSFM
        ++LL++F+ D+ +  V +++   +Q+TFKEYR DL +HYR+   P+E RA P K ++D  DW+ LC+RWE+PE +K   TNKK+R K+P+   T SKSF+
Subjt:  NKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSKSFM

Query:  HVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMG
         V  + K K   +V  ++LF+Q+ + E   WVN  A +A+ +M+ L E S QE   P++   + + VL   S  +KG+G
Subjt:  HVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMG

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]1.2e-4639.51Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK
        F  S+    A   +   S + + S     R RG SRN ELDR+VN+HGRI IEI +E+ KPVC  AT+FSNAIGTI R + P+R   W DV  E++D + 
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK

Query:  NKLL---------------------------NHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPE
        ++LL                           ++F+ D+ +  V +++   +Q+TFKEYR DL +HYR+   P+E RA P K ++D  DW+ LC+RWE+PE
Subjt:  NKLL---------------------------NHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPE

Query:  CQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLA
         +K   TNKK+R K+P+   T SKSF+ V  + K K   +V  ++LF+Q+ + E   WVN  A +A+ +M+ L E S QE   P++
Subjt:  CQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLA

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]1.2e-4938.89Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK
        F  S+    A   +   S + + S     R RG SRN ELDR+VN+HGRI IEI +E+ KPVC  AT+FSNAIGTI R + P+R   W DV  E++D + 
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMK

Query:  NKLL---------------------------NHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPE
        ++LL                           ++F+ D+ +  V +++   +Q+TFKEYR DL +HYR+   P+E RA P K ++D  DW+ LC+RWE+PE
Subjt:  NKLL---------------------------NHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAKPHKHVSDIADWHFLCDRWESPE

Query:  CQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSS
         +K   TNKK+R K+P+   T SKSF+ V  + K K   +V  ++LF+Q+ + E   WVN  A +A+ +M+ L E S QE   P++   + + VL   S 
Subjt:  CQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSK-SVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSS

Query:  QVKGMG
         +KG+G
Subjt:  QVKGMG

TrEMBL top hitse value%identityAlignment
A0A5A7T4P0 CACTA en-spm transposon protein1.9e-4539.87Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD
        FE+   N  A  +++V  +T E+S        R R  SR  EL+R+V ++GRIP+ I    EKP+   A RFS AIG  VR++FPVR  KW DV  E  +
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD

Query:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS
         +K  L   F LD +   ++RF+ H++ +TFKE+R D  +H++K   PEE RA  P+  V    DWHFLCD + S   Q+   TNK  R+K P+N  + S
Subjt:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS

Query:  KSFM---HVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE
        KSF+   H L +++ + V+   +ELF++T  + G  +V+ AA +AH+QM ELQ +   EGS PL+ED I   VL       KG+GWG + K ++ +S   
Subjt:  KSFM---HVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE

Query:  T
        +
Subjt:  T

A0A5D3C0E2 CACTA en-spm transposon protein1.9e-4539.87Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD
        FE+   N  A  +++V  +T E+S        R R  SR  EL+R+V ++GRIP+ I    EKP+   A RFS AIG  VR++FPVR  KW DV  E  +
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD

Query:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS
         +K  L   F LD +   ++RF+ H++ +TFKE+R D  +H++K   PEE RA  P+  V    DWHFLCD + S   Q+   TNK  R+K P+N  + S
Subjt:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS

Query:  KSFM---HVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE
        KSF+   H L +++ + V+   +ELF++T  + G  +V+ AA +AH+QM ELQ +   EGS PL+ED I   VL       KG+GWG + K ++ +S   
Subjt:  KSFM---HVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE

Query:  T
        +
Subjt:  T

A0A5D3CZD0 CACTA en-spm transposon protein1.1e-4540.53Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSS--ACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDH
        FE+   N  A  +++V  +TG +S  +    R R  SR  EL+R+V ++GRIP+ I    EKP+   A RFS AIG  VR++FPVR  KW DV  E  + 
Subjt:  FENSAPNASANPTTTVVSSTGETSDSS--ACRVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDH

Query:  MKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSK
        +K  L   F LD +   ++RF+ H++ +TFKE+R D  +H++K   PEE RA  P+  V    DWHFLCD + S   Q+   TNK  R+K P+N  + SK
Subjt:  MKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSK

Query:  SFMHVLQKQ----KSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE
        SF   LQ+Q    + +   V  +ELF++T  + G  +V+ AA +AH+QM ELQ + T EGS PL+ED I   VL       KG+GWG + K ++ +S   
Subjt:  SFMHVLQKQ----KSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE

Query:  T
        +
Subjt:  T

A0A5D3DB16 CACTA en-spm transposon protein1.9e-4539.87Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD
        FE+   N  A  +++V  +T E+S        R R  SR  EL+R+V ++GRIP+ I    EKP+   A RFS AIG  VR++FPVR  KW DV  E  +
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD

Query:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS
         +K  L   F LD +   ++RF+ H++ +TFKE+R D  +H++K   PEE RA  P+  V    DWHFLCD + S   Q+   TNK  R+K P+N  + S
Subjt:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS

Query:  KSFM---HVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE
        KSF+   H L +++ + V+   +ELF++T  + G  +V+ AA +AH+QM ELQ +   EGS PL+ED I   VL       KG+GWG + K ++ +S   
Subjt:  KSFM---HVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEE

Query:  T
        +
Subjt:  T

A0A5D3E5D1 CACTA en-spm transposon protein1.5e-4540.4Show/hide
Query:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD
        FE+   N  A  +++V  +T E+S        R R  SR  EL+R+V ++GRIP+ I    EKP+   A RFS AIG  VR++FPVR  KW DV  E  +
Subjt:  FENSAPNASANPTTTVVSSTGETSDSSAC---RVRGFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKD

Query:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS
         +K  L   F LD +   ++RF+ H++ +TFKE+R D  +H++K   PEE RA  P+  V    DWHFLCD + S   Q+   TNK  R+K P+N  + S
Subjt:  HMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL-PPEERRAK-PHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVS

Query:  KSFMHVLQKQ----KSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTE
        KSF   LQ+Q    + +   V  +ELF++T  + G  +V+ AA +AH+QM ELQ + T EGS PL+ED I   VL       KG+GWG + K ++ +S  
Subjt:  KSFMHVLQKQ----KSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEGSVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTE

Query:  ET
         +
Subjt:  ET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGAGGACGTTAAACCTTCTCTTGAGATTAATTCGAGTCTTGATGAGAGTTCTGACTCAGATCATGATAGTGATTAATACTTTGAGTAATTATGTGTTTCGTTT
ATTACTTTATAAACTTTATAATTTTTTTTTATCATTATTCAATTATTTATATCACTCATATGTTTCATGTGTGCTAGAGATAATGGTACGCTCTGTTGAATTAGGAGATG
AGACGACTAATCTTTTTGAAAATAGCGCCCCCAATGCTAGTGCTAACCCTACCACCACAGTGGTGTCTTCTACGGGCGAGACTTCAGATAGTTCTGCCTGTAGAGTAAGA
GGATTCTCAAGGAACAAAGAGCTTGATAGATATGTTAACGTCCATGGGAGAATTCCAATAGAAATAACTGATGAATTGGAGAAGCCTGTGTGCGATTTTGCCACAAGATT
TAGCAATGCAATTGGTACGATAGTTCGAGAATCGTTTCCAGTACGTATGACGAAGTGGAAAGATGTGCCTACAGAACTGAAGGACCATATGAAAAACAAACTTTTGAACC
ATTTCGAGCTAGATTTATCTCAACCTGTCGTCCATAGGTTTATTAATCATGAGATACAAAGTACTTTTAAGGAGTATAGGAGAGACTTGGTTAGACACTATAGAAAATTG
CCTCCTGAAGAAAGACGTGCAAAACCACATAAACATGTGTCTGACATTGCTGATTGGCATTTCTTATGTGACAGGTGGGAGTCTCCTGAATGTCAGAAAACTATGGGGAC
AAATAAAAAGAACCGCAAGAAGCTGCCATGGAACGATTGCACTGTGTCTAAGTCCTTCATGCACGTACTGCAAAAACAGAAATCAAAAAGTGTTGAAGTAGGAACCATTG
AGTTGTTCAAACAAACTAGGTACAAAGAAGGCAAATGGTGGGTGAATCCAGCAGCTGGTAATGCTCATAGTCAAATGAGAGAGTTGCAAGAGAAATCCACTCAAGAAGGG
TCTGTACCACTTGCTGAGGATGCAATTGTCCAAACTGTTTTGGACAGTGGATCGAGCCAAGTCAAAGGCATGGGTTGGGGGCAAAGGTTGAAACGACAGAAACATTCATC
TACTGAAGAGACCCAACGAGTAGGGGTGTGCAGAAACCGGTCCGAACCGACGAACCGGGCCGAACCAGACCGAACCGACGGTAATGGTTCGGTTTTTTTGGACAACATCG
GTTTTTCGGTTCTACTGTAG
mRNA sequenceShow/hide mRNA sequence
CAAAGGGTTACCAAGCATGCTCCATTTGTAAAGAAGATATGTCATCTTTTAGGATAAGAGGTAAGATTTCTTATATGGGGCACCGACATTATCTTCCAGCGGGTCATAGT
TGGCGCAAGAGTAAGCAATTTGACGGAAAGTCAGAACTTACATCGGGTCTCATCTCCATTGTAATGGATGGAGATGAGATTTTACAAGAAATTTACATGCTCAATTTTCC
TGTGCTCAGTAAACATCCAACAAAGAAAAATAAGAAAAGGAAACAAACATATCTTAATTGGAAGGAAAAAAATATTTTCTTTGAACTCCCTTATTGGTCAAAACTTATGC
TAAGACATAAATTTGATGTAATGCACATTGAAAAAAACATATGCGATAACTTGGTTGACACATTGTTAAATATTGAAGGAAAAACCAAGGACACGACAAACGCACGATTG
GATCTAGAGAATCTGAATATACGGAAAGATCTACACTTGCAAAAGTTAGGCAACATGGTCGTAAAGCCACATGCTGAATACACATTGCCTACTAAGGAAAGGATCGATTT
CTGTAAATTTTTGAAATCGGTGAAATTCCCTGATGGATTTGTGTCGAACATATCACGATGTGTAAGTGTCAATGATGGAAAACTATGGAGATTGAAAACTCATGACTCTC
ATGTTCTGCTTCAGCGACTTCTCCCTATTGGTGTTCGAGGGTATTTACCTAAAGATGTGTGTACTACTATTGTTGAGCTATGCACATTCTTTCGTGATTTATGTGCGAAA
ATGACACATATTAGTGATTTGGATCAATTACAATCAGATATTATAGTGATACTTTGTAAGTTGGAAAAACTATTTCCGCCAGCATTTTTCGATGTAATGGTGCATCTTGC
AGTTCACTTGCCATATGAAACCCGAGTTGTGGACCCTGTTAGCTACAGTTGGATGTACCCTATTGAGCGAAGTCTACAAACATTGAAACAGTACGTGCGAAACAAAAACG
CGTCCAGTGGGTTCCATAGCAGAAACATTTGTAATGAACGAATAAGTTAATTTTTGTGCATTGTATCTAAGTGGAATTGAAACAAGATTTAATAGAGAGGAGCGAAATGA
TGACCAAATTCCCATGAACGAGATCTGGGGTGAGTTTGAAGTATTTAGACAAAGTGCAAGACCATTGGGAGGTGCAACATCAAGAACCTTATTAACTAAGGAGAAACAGA
TAACACATTGGTATATTTTGAACAATTGTGACGAAATAAAGTCATATCGTATGTACGTCTCGAATACCCTAAGTGTTAATCTCAGTTTGTATATATACAAAGTGAGATAA
ATTTGATGTGTTATTTTGTTTATCTTTTTAATAGACAACATTTGCAATTAATTCGTCCTGAAGTTCAAACAACCAAAGATTTATATAAAAGACACCAACTCCAATTTCCT
ACTTGGTTCAAATCTCATGTAAGTTCCATGTTATAAGAGTTTATAAATTTAATTTCAATCATATTAATTTATAACGTATATAACTGAATTGACAGGTTCTCTCATTGCGT
GAGAGTGAAAATATGTCTGATGATTTGTACTCAATAGCGATGAGACCTAGTTCTCAAGTGTGTTCTTATAGTGGATGTATTGTTAATAGAAAGCGTTTTCACACAATAAT
GCGGGATAATCGTCGGGCTATACAGAATAGTGGAGTTTTAGTGTTTGGAGAAAGTGAGAGTCAGAGTGATGAATCGAATTTCTACGGTGTATTGATTGAGGTGTTGGACT
TAGAGTATGTGAAAAGAAGACGTGTTATGATCTTTAAATGTAAATGGTTTGACACCGACAGTAAAAAGAAAATATCACATTTCGATTTAGGGTTGAGATCAATCAATACA
TCACACTACTGGTATGTTGATGACCCTTTTATCCTTGCTACACAAGCACAACAAGCATTTTACATTGACGATCCAAAATATGGTAATAACTGGAAGGTGGTGCAAGTGGT
TCAGAATAAGCGATTGTGGGACATACCAGAGATAGAAGATATTGCAAATGATCAACTTGAATTGACAAACGTTGAGGGTGGAATAAGAGTTGATGAATCAATTCAGGAGA
CCGCATTGTGTAGGGTCGATATTGATCCCACCATTGTGGAAGGAAACAAGAGTAGAGGGATATTATCAAATCATGATGATGATTTCATAAACGATGAAGATGAGGACGTT
AAACCTTCTCTTGAGATTAATTCGAGTCTTGATGAGAGTTCTGACTCAGATCATGATAGTGATTAATACTTTGAGTAATTATGTGTTTCGTTTATTACTTTATAAACTTT
ATAATTTTTTTTTATCATTATTCAATTATTTATATCACTCATATGTTTCATGTGTGCTAGAGATAATGGTACGCTCTGTTGAATTAGGAGATGAGACGACTAATCTTTTT
GAAAATAGCGCCCCCAATGCTAGTGCTAACCCTACCACCACAGTGGTGTCTTCTACGGGCGAGACTTCAGATAGTTCTGCCTGTAGAGTAAGAGGATTCTCAAGGAACAA
AGAGCTTGATAGATATGTTAACGTCCATGGGAGAATTCCAATAGAAATAACTGATGAATTGGAGAAGCCTGTGTGCGATTTTGCCACAAGATTTAGCAATGCAATTGGTA
CGATAGTTCGAGAATCGTTTCCAGTACGTATGACGAAGTGGAAAGATGTGCCTACAGAACTGAAGGACCATATGAAAAACAAACTTTTGAACCATTTCGAGCTAGATTTA
TCTCAACCTGTCGTCCATAGGTTTATTAATCATGAGATACAAAGTACTTTTAAGGAGTATAGGAGAGACTTGGTTAGACACTATAGAAAATTGCCTCCTGAAGAAAGACG
TGCAAAACCACATAAACATGTGTCTGACATTGCTGATTGGCATTTCTTATGTGACAGGTGGGAGTCTCCTGAATGTCAGAAAACTATGGGGACAAATAAAAAGAACCGCA
AGAAGCTGCCATGGAACGATTGCACTGTGTCTAAGTCCTTCATGCACGTACTGCAAAAACAGAAATCAAAAAGTGTTGAAGTAGGAACCATTGAGTTGTTCAAACAAACT
AGGTACAAAGAAGGCAAATGGTGGGTGAATCCAGCAGCTGGTAATGCTCATAGTCAAATGAGAGAGTTGCAAGAGAAATCCACTCAAGAAGGGTCTGTACCACTTGCTGA
GGATGCAATTGTCCAAACTGTTTTGGACAGTGGATCGAGCCAAGTCAAAGGCATGGGTTGGGGGCAAAGGTTGAAACGACAGAAACATTCATCTACTGAAGAGACCCAAC
GAGTAGGGGTGTGCAGAAACCGGTCCGAACCGACGAACCGGGCCGAACCAGACCGAACCGACGGTAATGGTTCGGTTTTTTTGGACAACATCGGTTTTTCGGTTCTACTG
TAG
Protein sequenceShow/hide protein sequence
MKMRTLNLLLRLIRVLMRVLTQIMIVINTLSNYVFRLLLYKLYNFFLSLFNYLYHSYVSCVLEIMVRSVELGDETTNLFENSAPNASANPTTTVVSSTGETSDSSACRVR
GFSRNKELDRYVNVHGRIPIEITDELEKPVCDFATRFSNAIGTIVRESFPVRMTKWKDVPTELKDHMKNKLLNHFELDLSQPVVHRFINHEIQSTFKEYRRDLVRHYRKL
PPEERRAKPHKHVSDIADWHFLCDRWESPECQKTMGTNKKNRKKLPWNDCTVSKSFMHVLQKQKSKSVEVGTIELFKQTRYKEGKWWVNPAAGNAHSQMRELQEKSTQEG
SVPLAEDAIVQTVLDSGSSQVKGMGWGQRLKRQKHSSTEETQRVGVCRNRSEPTNRAEPDRTDGNGSVFLDNIGFSVLL