; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041299 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041299
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr13:15233765..15236134
RNA-Seq ExpressionLag0041299
SyntenyLag0041299
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]1.5e-2029.32Show/hide
Query:  LKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKAV
        + +L E    EDE A+V+ + ED +    + +   ++ K+ T KK+  + F   + +IW Q  Q  +   G N FM  F N   +  +   GPW F K++
Subjt:  LKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKAV

Query:  ILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD
        I++E+P       ++ F    FW+  H +P  C  +++   +   +G+V  V++  ES + WG  +R+KVQ+D+T PLKR +  ++ K ++
Subjt:  ILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD

TXG53848.1 hypothetical protein EZV62_019104 [Acer yangbiense]2.5e-2028.64Show/hide
Query:  VLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKA
        ++K  E L +  DE  +++ + E++    +  +   ++ KI + KK+  + FI  + ++W    +  I + G NIFM  F N   +  I + GPW+FDK+
Subjt:  VLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKA

Query:  VILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD---DGNLP
        +I++E+         + F  V  WI  H +P  C  R +A  +   +G+V  +D+  ES+  WG  +++KVQID++ PLKR +  ++ K ++     +  
Subjt:  VILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD---DGNLP

Query:  YGPWIR
        +G W+R
Subjt:  YGPWIR

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]6.1e-2225.96Show/hide
Query:  DGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIW-GQEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFD
        D + +K E+L + +D+   +  ++       +Q L+ +++ K  T K I  + F S +S IW  + + T+   G NIF  +F+N   ++ I E GPW FD
Subjt:  DGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIW-GQEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFD

Query:  KAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDD-----
        K ++++ E S      ++ FRYV FWI  H LP AC  R+    +G L+G+V+++D  E  E + G  +RI+V IDV  PLKR +   +  GDD+     
Subjt:  KAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDD-----

Query:  ----GNLP--------YGPWIRE-PYKTK-IWPKNPFSTPPVFQQGRGRGRNGMGGRGKWRHEDVEDGGDELLDRSHEGKSEINTSDKGQMETEAQISVA
              LP         G  +R+ P  TK I   + F   P  +        G G +        E G  + L+        +  S K  M  ++ + + 
Subjt:  ----GNLP--------YGPWIRE-PYKTK-IWPKNPFSTPPVFQQGRGRGRNGMGGRGKWRHEDVEDGGDELLDRSHEGKSEINTSDKGQMETEAQISVA

Query:  EISNTTAIFDPANGETVQKMVKEIDKYPEISARNSMEIKEATASGKVNVESKLNSVVKAKSVGKENVINVAEECNNDLKETSLVDMEICGDEQMVGSNIQ
        +            GE       ++D   E+ + N++E K  TA  +  V SK   +  A+S  KE +   + + +  ++ T  V   + G+         
Subjt:  EISNTTAIFDPANGETVQKMVKEIDKYPEISARNSMEIKEATASGKVNVESKLNSVVKAKSVGKENVINVAEECNNDLKETSLVDMEICGDEQMVGSNIQ

Query:  SSGGPCKDLTPNNKGK-------GILKEGNAAIGKNKTWKRIVRHKEGDEETEPIIQGNKIIVGGKHKIG
         S      +T   + K       G + EGN ++GK          K+GD + E      KI V    +IG
Subjt:  SSGGPCKDLTPNNKGK-------GILKEGNAAIGKNKTWKRIVRHKEGDEETEPIIQGNKIIVGGKHKIG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.5e-2030.05Show/hide
Query:  ALFDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQEQTT--IVNAGFNIFMCKFKNAHIKRWIKESGP
        A FD +L++ +  K+T +E  +   +     AT+  RL   ++ K+F  + I   +  + M   W  E     + + G+N+F+  F  A  +  I +SGP
Subjt:  ALFDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQEQTT--IVNAGFNIFMCKFKNAHIKRWIKESGP

Query:  WFFDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDDG
        W FD+ ++L+ +P A     E+DF  +  W+ F  LP  C  R  A  +G+ LG  E+ D D+ +   WG +LR++V +D++ PL+R I   +     DG
Subjt:  WFFDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDDG

Query:  NLPYGPWIREPYK
         +  G WI   Y+
Subjt:  NLPYGPWIREPYK

XP_042952130.1 uncharacterized protein At4g02000-like [Carya illinoinensis]4.3e-2030.27Show/hide
Query:  DGVLKKLEELKVTEDERASVYHLQE--DELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWG-QEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWF
        + + +K +E+++ ++E+A++    E   EL   +QR    +L K  +P+ I  ++  + ++K+W    +        NIF   F N   K  +    PW 
Subjt:  DGVLKKLEELKVTEDERASVYHLQE--DELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWG-QEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWF

Query:  FDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKR
        FD  +++++E   +T  +++DF   SFW+ FH +P +C   K    IGS +GKVE+VD+ E+    WGC LR+++ +D+T P+ R
Subjt:  FDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKR

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein2.9e-2225.96Show/hide
Query:  DGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIW-GQEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFD
        D + +K E+L + +D+   +  ++       +Q L+ +++ K  T K I  + F S +S IW  + + T+   G NIF  +F+N   ++ I E GPW FD
Subjt:  DGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIW-GQEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFD

Query:  KAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDD-----
        K ++++ E S      ++ FRYV FWI  H LP AC  R+    +G L+G+V+++D  E  E + G  +RI+V IDV  PLKR +   +  GDD+     
Subjt:  KAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDD-----

Query:  ----GNLP--------YGPWIRE-PYKTK-IWPKNPFSTPPVFQQGRGRGRNGMGGRGKWRHEDVEDGGDELLDRSHEGKSEINTSDKGQMETEAQISVA
              LP         G  +R+ P  TK I   + F   P  +        G G +        E G  + L+        +  S K  M  ++ + + 
Subjt:  ----GNLP--------YGPWIRE-PYKTK-IWPKNPFSTPPVFQQGRGRGRNGMGGRGKWRHEDVEDGGDELLDRSHEGKSEINTSDKGQMETEAQISVA

Query:  EISNTTAIFDPANGETVQKMVKEIDKYPEISARNSMEIKEATASGKVNVESKLNSVVKAKSVGKENVINVAEECNNDLKETSLVDMEICGDEQMVGSNIQ
        +            GE       ++D   E+ + N++E K  TA  +  V SK   +  A+S  KE +   + + +  ++ T  V   + G+         
Subjt:  EISNTTAIFDPANGETVQKMVKEIDKYPEISARNSMEIKEATASGKVNVESKLNSVVKAKSVGKENVINVAEECNNDLKETSLVDMEICGDEQMVGSNIQ

Query:  SSGGPCKDLTPNNKGK-------GILKEGNAAIGKNKTWKRIVRHKEGDEETEPIIQGNKIIVGGKHKIG
         S      +T   + K       G + EGN ++GK          K+GD + E      KI V    +IG
Subjt:  SSGGPCKDLTPNNKGK-------GILKEGNAAIGKNKTWKRIVRHKEGDEETEPIIQGNKIIVGGKHKIG

A0A5C7HA62 DUF4283 domain-containing protein1.2e-2028.64Show/hide
Query:  VLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKA
        ++K  E L +  DE  +++ + E++    +  +   ++ KI + KK+  + FI  + ++W    +  I + G NIFM  F N   +  I + GPW+FDK+
Subjt:  VLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKA

Query:  VILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD---DGNLP
        +I++E+         + F  V  WI  H +P  C  R +A  +   +G+V  +D+  ES+  WG  +++KVQID++ PLKR +  ++ K ++     +  
Subjt:  VILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD---DGNLP

Query:  YGPWIR
        +G W+R
Subjt:  YGPWIR

A0A5C7IJL3 CCHC-type domain-containing protein8.0e-2028.57Show/hide
Query:  VLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKA
        + +  E L + +++R  ++ + ED      Q + + ++ K+ + K++  + FIS +  +W    +  I + G N+FM  F+N   +  + + GPW FDK+
Subjt:  VLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQ-EQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKA

Query:  VILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDDGNL
        +I++E P       ++ F    FW+  H +P  C  R+SA  +   +G+V  +++  ES   WG  LR+KV+ID++ PLKR +   +   D  GN+
Subjt:  VILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDDGNL

A0A5C7ITT0 Uncharacterized protein2.7e-2030.77Show/hide
Query:  FDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQEQTTIVNA-GFNIFMCKFKNAHIKRWIKESGPWFF
        ++ + K  E L ++ D       ++ D   TS +++ ++++ KI T K I  + FIS + K+W   +   +NA G NIF+ +F+N+  KR +   GPW F
Subjt:  FDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQEQTTIVNA-GFNIFMCKFKNAHIKRWIKESGPWFF

Query:  DKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD
        + ++I +EEP      ++++F  VSF +  H LP  C  +++  A+G +L +VE++D+    + +    LRI V ID++ PLKR +   +   D+
Subjt:  DKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDD

A0A6J1DX30 uncharacterized protein LOC1110248741.2e-2030.05Show/hide
Query:  ALFDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQEQTT--IVNAGFNIFMCKFKNAHIKRWIKESGP
        A FD +L++ +  K+T +E  +   +     AT+  RL   ++ K+F  + I   +  + M   W  E     + + G+N+F+  F  A  +  I +SGP
Subjt:  ALFDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKIWGQEQTT--IVNAGFNIFMCKFKNAHIKRWIKESGP

Query:  WFFDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDDG
        W FD+ ++L+ +P A     E+DF  +  W+ F  LP  C  R  A  +G+ LG  E+ D D+ +   WG +LR++V +D++ PL+R I   +     DG
Subjt:  WFFDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIKVQIDVTVPLKRVIFFEIWKGDDDG

Query:  NLPYGPWIREPYK
         +  G WI   Y+
Subjt:  NLPYGPWIREPYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGATATGAAGTCATCGAGCACGAAAACGCCGTGGGACGTCATATCTGAAGTCTTCGAGCACGGAAAAGGTTTTCCAGAGGGAGAAATGGGTGGTTTTCCAGTTTG
TGAGGGTTCTGCTTTGCCGGATTCGCCCGCGTTGTTCGATGGAGTCCTGAAGAAATTAGAGGAGCTGAAAGTTACTGAGGACGAGCGGGCAAGTGTATATCACCTGCAAG
AGGATGAATTGGCTACATCAAAGCAGCGATTGACAAATGCTGTATTATGCAAAATATTTACGCCAAAGAAGATATTTCCAAAAATGTTCATATCAAAGATGTCGAAGATT
TGGGGCCAAGAACAAACAACCATTGTTAACGCGGGTTTTAATATATTTATGTGCAAATTTAAAAATGCTCATATCAAGAGATGGATTAAGGAGTCGGGGCCGTGGTTCTT
TGATAAAGCGGTAATTTTAATGGAAGAGCCTAGTGCAGAGACGTGCGCCGAGGAAATGGACTTCAGGTATGTGTCTTTTTGGATTCATTTTCATAAATTACCACATGCTT
GTTTTGCCAGGAAATCAGCCACAGCAATAGGAAGCCTTCTTGGGAAAGTGGAACAAGTGGACATGGATGAGGAATCCGAACAAATGTGGGGATGCTCTTTAAGAATTAAG
GTGCAGATCGACGTTACGGTTCCTTTGAAGCGTGTGATTTTTTTTGAAATCTGGAAAGGAGACGATGATGGTAACCTCCCATATGGACCATGGATACGAGAGCCTTACAA
GACAAAAATTTGGCCTAAAAACCCATTTTCTACCCCTCCGGTGTTTCAACAAGGACGAGGAAGAGGAAGAAATGGGATGGGTGGAAGAGGGAAGTGGCGGCATGAAGATG
TGGAAGATGGTGGGGATGAGCTCCTTGATCGAAGCCATGAAGGTAAATCGGAAATTAACACTTCCGACAAGGGACAAATGGAAACGGAAGCTCAGATCAGTGTGGCGGAG
ATTTCCAATACGACGGCGATTTTCGATCCGGCTAACGGTGAAACGGTCCAAAAGATGGTTAAGGAAATTGACAAATATCCGGAAATCTCGGCTCGGAATAGTATGGAAAT
TAAGGAGGCAACGGCTAGTGGGAAAGTCAATGTGGAATCAAAATTAAATTCGGTTGTGAAGGCTAAATCAGTAGGGAAGGAAAATGTAATTAATGTGGCTGAAGAGTGCA
ATAATGATTTGAAGGAGACCTCTTTAGTGGATATGGAAATTTGTGGGGATGAGCAGATGGTGGGATCCAATATTCAAAGCAGTGGTGGGCCATGTAAAGATCTCACTCCC
AATAATAAAGGAAAAGGAATTCTGAAGGAAGGAAATGCGGCAATTGGCAAAAATAAAACATGGAAGAGGATTGTTAGACACAAAGAGGGAGATGAAGAAACTGAGCCCAT
CATCCAAGGAAATAAAATCATTGTGGGTGGGAAACACAAGATTGGTTTTGAAGAGGAGGATGGCGACGTGAAACGAAGAAAGCAGTTAACTATAAGGGATATTTCAGAAG
CACGATCGGTGGAGGCTGCTGGACAGCCCCGCCGGGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGATATGAAGTCATCGAGCACGAAAACGCCGTGGGACGTCATATCTGAAGTCTTCGAGCACGGAAAAGGTTTTCCAGAGGGAGAAATGGGTGGTTTTCCAGTTTG
TGAGGGTTCTGCTTTGCCGGATTCGCCCGCGTTGTTCGATGGAGTCCTGAAGAAATTAGAGGAGCTGAAAGTTACTGAGGACGAGCGGGCAAGTGTATATCACCTGCAAG
AGGATGAATTGGCTACATCAAAGCAGCGATTGACAAATGCTGTATTATGCAAAATATTTACGCCAAAGAAGATATTTCCAAAAATGTTCATATCAAAGATGTCGAAGATT
TGGGGCCAAGAACAAACAACCATTGTTAACGCGGGTTTTAATATATTTATGTGCAAATTTAAAAATGCTCATATCAAGAGATGGATTAAGGAGTCGGGGCCGTGGTTCTT
TGATAAAGCGGTAATTTTAATGGAAGAGCCTAGTGCAGAGACGTGCGCCGAGGAAATGGACTTCAGGTATGTGTCTTTTTGGATTCATTTTCATAAATTACCACATGCTT
GTTTTGCCAGGAAATCAGCCACAGCAATAGGAAGCCTTCTTGGGAAAGTGGAACAAGTGGACATGGATGAGGAATCCGAACAAATGTGGGGATGCTCTTTAAGAATTAAG
GTGCAGATCGACGTTACGGTTCCTTTGAAGCGTGTGATTTTTTTTGAAATCTGGAAAGGAGACGATGATGGTAACCTCCCATATGGACCATGGATACGAGAGCCTTACAA
GACAAAAATTTGGCCTAAAAACCCATTTTCTACCCCTCCGGTGTTTCAACAAGGACGAGGAAGAGGAAGAAATGGGATGGGTGGAAGAGGGAAGTGGCGGCATGAAGATG
TGGAAGATGGTGGGGATGAGCTCCTTGATCGAAGCCATGAAGGTAAATCGGAAATTAACACTTCCGACAAGGGACAAATGGAAACGGAAGCTCAGATCAGTGTGGCGGAG
ATTTCCAATACGACGGCGATTTTCGATCCGGCTAACGGTGAAACGGTCCAAAAGATGGTTAAGGAAATTGACAAATATCCGGAAATCTCGGCTCGGAATAGTATGGAAAT
TAAGGAGGCAACGGCTAGTGGGAAAGTCAATGTGGAATCAAAATTAAATTCGGTTGTGAAGGCTAAATCAGTAGGGAAGGAAAATGTAATTAATGTGGCTGAAGAGTGCA
ATAATGATTTGAAGGAGACCTCTTTAGTGGATATGGAAATTTGTGGGGATGAGCAGATGGTGGGATCCAATATTCAAAGCAGTGGTGGGCCATGTAAAGATCTCACTCCC
AATAATAAAGGAAAAGGAATTCTGAAGGAAGGAAATGCGGCAATTGGCAAAAATAAAACATGGAAGAGGATTGTTAGACACAAAGAGGGAGATGAAGAAACTGAGCCCAT
CATCCAAGGAAATAAAATCATTGTGGGTGGGAAACACAAGATTGGTTTTGAAGAGGAGGATGGCGACGTGAAACGAAGAAAGCAGTTAACTATAAGGGATATTTCAGAAG
CACGATCGGTGGAGGCTGCTGGACAGCCCCGCCGGGCACAATGA
Protein sequenceShow/hide protein sequence
MSDMKSSSTKTPWDVISEVFEHGKGFPEGEMGGFPVCEGSALPDSPALFDGVLKKLEELKVTEDERASVYHLQEDELATSKQRLTNAVLCKIFTPKKIFPKMFISKMSKI
WGQEQTTIVNAGFNIFMCKFKNAHIKRWIKESGPWFFDKAVILMEEPSAETCAEEMDFRYVSFWIHFHKLPHACFARKSATAIGSLLGKVEQVDMDEESEQMWGCSLRIK
VQIDVTVPLKRVIFFEIWKGDDDGNLPYGPWIREPYKTKIWPKNPFSTPPVFQQGRGRGRNGMGGRGKWRHEDVEDGGDELLDRSHEGKSEINTSDKGQMETEAQISVAE
ISNTTAIFDPANGETVQKMVKEIDKYPEISARNSMEIKEATASGKVNVESKLNSVVKAKSVGKENVINVAEECNNDLKETSLVDMEICGDEQMVGSNIQSSGGPCKDLTP
NNKGKGILKEGNAAIGKNKTWKRIVRHKEGDEETEPIIQGNKIIVGGKHKIGFEEEDGDVKRRKQLTIRDISEARSVEAAGQPRRAQ