; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008071 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008071
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:11405402..11407045
RNA-Seq ExpressionLag0008071
SyntenyLag0008071
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021757945.1 uncharacterized protein LOC110722981 [Chenopodium quinoa]1.8e-2930.61Show/hide
Query:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRS-LGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLF
        D+L++ WEKF LT +E  + GDV  N     S      +L+GKL   +  + E M+K     W +   + +  +  NLF+F  + E+++ +V+   PW F
Subjt:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRS-LGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLF

Query:  DKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSL
        D  +++L       +P    F     W+  Y++P +  N  +   + N +  F++ D+      GW E +RIRV LD++KPLRWG+ V +     + W  
Subjt:  DKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSL

Query:  IRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQ-RHQYGMWLQ
        ++YE+LS+ C +C I+ H  + C+   M G   +   +QYG WL+
Subjt:  IRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQ-RHQYGMWLQ

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]4.5e-4436.22Show/hide
Query:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP
        MA  +L+E W+ F LT+EE+ +  D+D + +  T + L  SLI KL++ R IS  V++   K AW +     +V+ +G N+FLF      ++ ++LR  P
Subjt:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP

Query:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC
        W FD+ ++++  P+ + KP   +F  V+ W+HF++L +   N +MA RL NA+G F D ++       W   LR+RV+ D+ KPL  G+K+ LD PMG C
Subjt:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC

Query:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCTNN
        W  I+YE+L +    C  + HI ++C+   +   S S+  QYG WL++ G  N+
Subjt:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCTNN

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.8e-5442.11Show/hide
Query:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD
        ++L+ +W+KF LT+EE+++  DVD + V +  + L +SL+GKL+A R+IS +V+ +    AW +   LTVE +G NLFLF    E +  +V++  PW FD
Subjt:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD

Query:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLI
        K +++L KP         EF  V FW+H ++LPM   N +MA RL NA+G FVD D    +   W  SLRIRV +DI+KPLR G+K+ +D PMG CW  I
Subjt:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLI

Query:  RYEKLSELCSFCSIIGHIAQNCNS-FIMGGSSSSQRHQYGMWLQYIG
        +YE+L + C FC +IGH + +C++ ++     S    +YG WL+++G
Subjt:  RYEKLSELCSFCSIIGHIAQNCNS-FIMGGSSSSQRHQYGMWLQYIG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.5e-4336.97Show/hide
Query:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP
        MA  DL+E W+ F LT+EEE+   DVD +  A T   L   L+GKL   R I+  VM+   + AW +      V+ LG NLFLF+     ++ K+ +  P
Subjt:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP

Query:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC
        W FD+ +++++KP+ ++ P   +F  +  W+ F++LP+      MA RL NA+G F + D        W  +LR+RV LDISKPLR G+K+ LD P+G  
Subjt:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC

Query:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCT-------------------NNFFPSPSTSPLG
        W  I+YE+L + C  C               G SSS ++HQYG WL+Y G                     NN F S STSP+G
Subjt:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCT-------------------NNFFPSPSTSPLG

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.4e-2934.27Show/hide
Query:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD
        D L++     +LT+EE+ +      +   +  +S    L+GKL+  R  + E M+    + W    G+ V  +G NLF+F      ++ +VL   PW FD
Subjt:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD

Query:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFV--DYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLD--EPMGSC
        K +LML +  P V+P   +   V FW+H   LP+ L N  + E + NAVG+F+  DY++GG     W  ++RIRV LD+ KPLR G+K+ L   EP+   
Subjt:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFV--DYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLD--EPMGSC

Query:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSS-QRHQYGMWLQ
        W   +YE+L   C FC  +GH  + C+  +     S     QYG WL+
Subjt:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSS-QRHQYGMWLQ

TrEMBL top hitse value%identityAlignment
A0A6J1BSZ1 uncharacterized protein LOC1110054812.2e-4436.22Show/hide
Query:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP
        MA  +L+E W+ F LT+EE+ +  D+D + +  T + L  SLI KL++ R IS  V++   K AW +     +V+ +G N+FLF      ++ ++LR  P
Subjt:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP

Query:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC
        W FD+ ++++  P+ + KP   +F  V+ W+HF++L +   N +MA RL NA+G F D ++       W   LR+RV+ D+ KPL  G+K+ LD PMG C
Subjt:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC

Query:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCTNN
        W  I+YE+L +    C  + HI ++C+   +   S S+  QYG WL++ G  N+
Subjt:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCTNN

A0A6J1D765 uncharacterized protein LOC1110179024.1e-2729.8Show/hide
Query:  LDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLF
        +D++ + WE F  T +E +    +DR    +T+ ++   ++ KL   + IS E +R   K+ W +      E LG+N+++    +  E+++VL   PW F
Subjt:  LDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLF

Query:  DKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWK-ESLRIRVQLDISKPLRWGVKVKLDEPMGSCWS
        +K +L+L+ P    +P    F    FW+  + +P +  ++ MA  L   +G   + +  G    GW    +R+RV++D+SKPLR G+K+K  +     W 
Subjt:  DKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWK-ESLRIRVQLDISKPLRWGVKVKLDEPMGSCWS

Query:  LIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQ
         +RYEKL + C  C  IGH  + C       +++S   QYG WL+
Subjt:  LIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQ

A0A6J1DU55 uncharacterized protein LOC1110231351.4e-5442.11Show/hide
Query:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD
        ++L+ +W+KF LT+EE+++  DVD + V +  + L +SL+GKL+A R+IS +V+ +    AW +   LTVE +G NLFLF    E +  +V++  PW FD
Subjt:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD

Query:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLI
        K +++L KP         EF  V FW+H ++LPM   N +MA RL NA+G FVD D    +   W  SLRIRV +DI+KPLR G+K+ +D PMG CW  I
Subjt:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLI

Query:  RYEKLSELCSFCSIIGHIAQNCNS-FIMGGSSSSQRHQYGMWLQYIG
        +YE+L + C FC +IGH + +C++ ++     S    +YG WL+++G
Subjt:  RYEKLSELCSFCSIIGHIAQNCNS-FIMGGSSSSQRHQYGMWLQYIG

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-4336.97Show/hide
Query:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP
        MA  DL+E W+ F LT+EEE+   DVD +  A T   L   L+GKL   R I+  VM+   + AW +      V+ LG NLFLF+     ++ K+ +  P
Subjt:  MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIP-TGLTVEKLGLNLFLFTLLTEEEQTKVLRQEP

Query:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC
        W FD+ +++++KP+ ++ P   +F  +  W+ F++LP+      MA RL NA+G F + D        W  +LR+RV LDISKPLR G+K+ LD P+G  
Subjt:  WLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSC

Query:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCT-------------------NNFFPSPSTSPLG
        W  I+YE+L + C  C               G SSS ++HQYG WL+Y G                     NN F S STSP+G
Subjt:  WSLIRYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQYIGCT-------------------NNFFPSPSTSPLG

A0A803M2M9 Uncharacterized protein4.9e-2829.22Show/hide
Query:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD
        ++L   WEK  +T +EED+   +++N V  +  ++  SL+GK+++ R  + E M+   K  W++  G+   K+  NLF+      +++ K+L  EPW+FD
Subjt:  DDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFD

Query:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLI
          +++L +     +P++ +  L  FW+  Y LP+D  ++   + +A  VG  ++ ++      GW  S R R+ LD +KP+R   K++  E + + +   
Subjt:  KFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLI

Query:  RYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQ
        +YE+L  LC  C I+GH  ++C +    G    +  Q+GMWL+
Subjt:  RYEKLSELCSFCSIIGHIAQNCNSFIMGGSSSSQRHQYGMWLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein8.5e-0927.14Show/hide
Query:  FLFTLLTEEEQTKVLRQEPWLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDI
        F+FTL  EE    VLR+ PW F+ ++++L +     +PQ   F  + FW+    +P    N  + E +  A+G+ +D D          +  R+ +  DI
Subjt:  FLFTLLTEEEQTKVLRQEPWLFDKFILMLSKPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDI

Query:  SKPLRWGVKVKLDEPMGSCWSLIRYEKLSELCSFCSIIGH
        + PLR+    +    + +     RYE+L   C  C ++ H
Subjt:  SKPLRWGVKVKLDEPMGSCWSLIRYEKLSELCSFCSIIGH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTGACGATTTGATGGAAAATTGGGAGAAATTTAATCTTACAGCTGAAGAGGAGGATATGGAAGGGGATGTTGATCGAAATGTTGTTGCTGTTACGAGCCGATC
TTTGGGATTTAGTCTCATTGGTAAGCTTATTGCGCCTCGTGTTATTTCTGGTGAGGTAATGAGGAAAAATTTCAAGGCTGCCTGGAATATCCCAACGGGTCTGACCGTCG
AGAAGCTGGGGCTTAATTTATTTCTATTTACGTTGCTGACAGAGGAGGAACAAACCAAAGTTCTTCGACAAGAACCATGGTTGTTTGACAAGTTCATTCTGATGTTATCC
AAACCGATTCCAATGGTTAAACCTCAAGCAACGGAATTCATGCTGGTGAATTTCTGGTTACACTTCTATGAACTTCCTATGGACCTTTACAATTCATCCATGGCGGAAAG
GCTCGCGAACGCTGTGGGTAAGTTTGTTGATTACGACAATGGGGGTAGGAGGAGACACGGATGGAAGGAGAGCCTCCGAATTCGAGTGCAGCTGGATATATCAAAACCCC
TTCGATGGGGCGTTAAGGTGAAGCTTGACGAACCGATGGGAAGTTGCTGGTCTCTGATTCGGTATGAAAAATTATCGGAATTATGCTCTTTCTGCAGTATAATTGGCCAC
ATTGCCCAGAATTGTAATTCTTTTATTATGGGCGGGAGTTCATCATCACAACGACATCAATATGGCATGTGGTTACAGTATATTGGGTGCACTAACAATTTCTTCCCTTC
TCCGAGCACAAGTCCGCTGGGACAGAATAAAATTATGTTTGAGCCTCAACCAGGAGACAATTCTCCTCAGGCAGCGATCGTGGGTGCGATTGGCCAGAGTTTAGCAGGGG
ATTCTTTGAAGCAAGATTCCGGCACCAAACCGATGGATATTTCACCGATGATGGATGAAGCGATTACTTTTCCTCTTCCGGCGGCAATACAGGGCAGCGATAATGGCATT
AATGGTGGATTAAATGCAGAGGTTTCAACGGTGAAGAAGAAGCTGTGTTTCGGAGATGTTACGAAAGTTTTGCCTAAATCACTGGATTCAACGGCCCAAACTCAAGATTT
TATTCCTTTAACAATTAATGTGGAAAATGGGCCTAAGGGGGATATGCTCGGTGCGGACCTGTCTACCAAGAATGGGCCAACAAGAGTTGGGCTACCTGAGGAATCAAAGA
ATCTGAATAGACCTGGAGTGTACTCCCTTAGTGGTTTCTCAGTCAAAGAAGAAAGCAAGAGGAGCAATGTCTACAAGTGGAATTTCAATTCCATGACCCCTCAAATGGGT
CCATTTACATCGGCCAGTAAGGCTGGATCTGGTATAAATGGACCAGCCCAATCAGACCTTGGCGTGAATGATCAAAGTTCGGTGCTGAATCTGACCCTGAAGCAAGTGGT
GATGGGTCTTCCTAATTCCAAGTCCAGGAAACGAAGGGCTCATTTTGATATTTTTGAAGCGGGCATCAATTCTAGTCTGGAAGTTTTTAAGAAGCGAGTGGGAGATCGTC
TTGATGGTGGTAACAAGAAACGGCCTAGGGTGGAAGATGAGGATGATGCAAACCATGATGAAGCATCGGCGGAGGCTGGTAATCAGCCCCGTCGGAAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTTGACGATTTGATGGAAAATTGGGAGAAATTTAATCTTACAGCTGAAGAGGAGGATATGGAAGGGGATGTTGATCGAAATGTTGTTGCTGTTACGAGCCGATC
TTTGGGATTTAGTCTCATTGGTAAGCTTATTGCGCCTCGTGTTATTTCTGGTGAGGTAATGAGGAAAAATTTCAAGGCTGCCTGGAATATCCCAACGGGTCTGACCGTCG
AGAAGCTGGGGCTTAATTTATTTCTATTTACGTTGCTGACAGAGGAGGAACAAACCAAAGTTCTTCGACAAGAACCATGGTTGTTTGACAAGTTCATTCTGATGTTATCC
AAACCGATTCCAATGGTTAAACCTCAAGCAACGGAATTCATGCTGGTGAATTTCTGGTTACACTTCTATGAACTTCCTATGGACCTTTACAATTCATCCATGGCGGAAAG
GCTCGCGAACGCTGTGGGTAAGTTTGTTGATTACGACAATGGGGGTAGGAGGAGACACGGATGGAAGGAGAGCCTCCGAATTCGAGTGCAGCTGGATATATCAAAACCCC
TTCGATGGGGCGTTAAGGTGAAGCTTGACGAACCGATGGGAAGTTGCTGGTCTCTGATTCGGTATGAAAAATTATCGGAATTATGCTCTTTCTGCAGTATAATTGGCCAC
ATTGCCCAGAATTGTAATTCTTTTATTATGGGCGGGAGTTCATCATCACAACGACATCAATATGGCATGTGGTTACAGTATATTGGGTGCACTAACAATTTCTTCCCTTC
TCCGAGCACAAGTCCGCTGGGACAGAATAAAATTATGTTTGAGCCTCAACCAGGAGACAATTCTCCTCAGGCAGCGATCGTGGGTGCGATTGGCCAGAGTTTAGCAGGGG
ATTCTTTGAAGCAAGATTCCGGCACCAAACCGATGGATATTTCACCGATGATGGATGAAGCGATTACTTTTCCTCTTCCGGCGGCAATACAGGGCAGCGATAATGGCATT
AATGGTGGATTAAATGCAGAGGTTTCAACGGTGAAGAAGAAGCTGTGTTTCGGAGATGTTACGAAAGTTTTGCCTAAATCACTGGATTCAACGGCCCAAACTCAAGATTT
TATTCCTTTAACAATTAATGTGGAAAATGGGCCTAAGGGGGATATGCTCGGTGCGGACCTGTCTACCAAGAATGGGCCAACAAGAGTTGGGCTACCTGAGGAATCAAAGA
ATCTGAATAGACCTGGAGTGTACTCCCTTAGTGGTTTCTCAGTCAAAGAAGAAAGCAAGAGGAGCAATGTCTACAAGTGGAATTTCAATTCCATGACCCCTCAAATGGGT
CCATTTACATCGGCCAGTAAGGCTGGATCTGGTATAAATGGACCAGCCCAATCAGACCTTGGCGTGAATGATCAAAGTTCGGTGCTGAATCTGACCCTGAAGCAAGTGGT
GATGGGTCTTCCTAATTCCAAGTCCAGGAAACGAAGGGCTCATTTTGATATTTTTGAAGCGGGCATCAATTCTAGTCTGGAAGTTTTTAAGAAGCGAGTGGGAGATCGTC
TTGATGGTGGTAACAAGAAACGGCCTAGGGTGGAAGATGAGGATGATGCAAACCATGATGAAGCATCGGCGGAGGCTGGTAATCAGCCCCGTCGGAAGCCATGA
Protein sequenceShow/hide protein sequence
MALDDLMENWEKFNLTAEEEDMEGDVDRNVVAVTSRSLGFSLIGKLIAPRVISGEVMRKNFKAAWNIPTGLTVEKLGLNLFLFTLLTEEEQTKVLRQEPWLFDKFILMLS
KPIPMVKPQATEFMLVNFWLHFYELPMDLYNSSMAERLANAVGKFVDYDNGGRRRHGWKESLRIRVQLDISKPLRWGVKVKLDEPMGSCWSLIRYEKLSELCSFCSIIGH
IAQNCNSFIMGGSSSSQRHQYGMWLQYIGCTNNFFPSPSTSPLGQNKIMFEPQPGDNSPQAAIVGAIGQSLAGDSLKQDSGTKPMDISPMMDEAITFPLPAAIQGSDNGI
NGGLNAEVSTVKKKLCFGDVTKVLPKSLDSTAQTQDFIPLTINVENGPKGDMLGADLSTKNGPTRVGLPEESKNLNRPGVYSLSGFSVKEESKRSNVYKWNFNSMTPQMG
PFTSASKAGSGINGPAQSDLGVNDQSSVLNLTLKQVVMGLPNSKSRKRRAHFDIFEAGINSSLEVFKKRVGDRLDGGNKKRPRVEDEDDANHDEASAEAGNQPRRKP