; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017578 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017578
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:5573394..5574998
RNA-Seq ExpressionLag0017578
SyntenyLag0017578
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.6e-3233.33Show/hide
Query:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES
        MA  +LLE W+ F LT EE+   V +D +A   T K L  +L  KLLS R IS  V++   K AW +     +V+ +G N+FLF+ N   +  R+L+   
Subjt:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES

Query:  WLFDKFLLVLSKLIPM----------ATQWV------------------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW
        W FD+ L+++   + +           + WV                          D ++    + W   +RVRV+ D+ KPL RGIK+ LD P+G CW
Subjt:  WLFDKFLLVLSKLIPM----------ATQWV------------------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW

Query:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTG
         PI+YE+LP+    CG + H   +CS   ++S S + Q  YG WL++ G
Subjt:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.6e-2232.93Show/hide
Query:  LDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLF
        +D++ + WE F  T +E  T V +DR    +T  ++   +  KL + + IS E +R   K+ W +      E LG+N+++    S  E  RVL    W F
Subjt:  LDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLF

Query:  DKFLLVLSKLIPMATQ------------WV-----------------------IVDFDNGGRRYGWK-ESIRVRVQLDISKPLRRGIKVRLDDPLGSCWS
        +K LLVL+   P AT             W+                        V+   G    GW    IRVRV++D+SKPLRRGIK++  D     W 
Subjt:  DKFLLVLSKLIPMATQ------------WV-----------------------IVDFDNGGRRYGWK-ESIRVRVQLDISKPLRRGIKVRLDDPLGSCWS

Query:  PIRYEKLPELCSFCGIIGHTAHNCS--SFYMNSGSSSQQHHYGMWLQYT
        P+RYEKLP+ C  CG IGH+   C   S  + + S  Q   YG WL+ T
Subjt:  PIRYEKLPELCSFCGIIGHTAHNCS--SFYMNSGSSSQQHHYGMWLQYT

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]3.8e-4237.8Show/hide
Query:  DDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFD
        ++LL +W+KF LT EE+   + VD +A  +  + L Y+L  KLL+ R+IS +V+ +    AW +   LTVE +G NLFLF    E +  RV+K   W FD
Subjt:  DDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFD

Query:  KFLLVLSK--------------------LIPMATQWV--------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIR
        K L+VL K                    L  +   W+               VD D   + + W  S+R+RV +DI+KPLRRGIK+ +D P+G CW PI+
Subjt:  KFLLVLSK--------------------LIPMATQWV--------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIR

Query:  YEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQ-QHHYGMWLQYTG
        YE+LP+ C FCG+IGH++H+C + Y+ +   S+    YG WL++ G
Subjt:  YEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQ-QHHYGMWLQYTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.1e-3131.29Show/hide
Query:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES
        MA  DLLE W+ F LT EEE T + VD +A A T   L   L  KL   R I+  VM+   + AW +      V+ LG NLFLFS     +  ++ K   
Subjt:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES

Query:  WLFDKFLLVLSK---LIPMATQ-------WV-IVDFDNG----------GRRYG-------------WKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW
        W FD+ L++++K   LIP +         WV   D   G          G   G             W  ++RVRV LDISKPLRRGIK+ LD P+G  W
Subjt:  WLFDKFLLVLSK---LIPMATQ-------WV-IVDFDNG----------GRRYG-------------WKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW

Query:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTGR---TTTFFRSPRTSPM---GQNKIMVDIQ--DDGNRPKEMAAVTAPSFL-
         PI+YE+LP+ C  CG+                SS ++H YG WL+Y G    T    + P+   +   G N           G++  + A  T P  + 
Subjt:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTGR---TTTFFRSPRTSPM---GQNKIMVDIQ--DDGNRPKEMAAVTAPSFL-

Query:  ---GATGNDETASQPHDSGSKPMEISPAIEETVTVPVNSIHP
             T   +  ++P   G  P+ I    +      +++++P
Subjt:  ---GATGNDETASQPHDSGSKPMEISPAIEETVTVPVNSIHP

XP_023908234.1 uncharacterized protein LOC112019922 [Quercus suber]5.8e-2230.89Show/hide
Query:  DDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFD
        D++++  E   LTVEEE   + +         +S   +L  K L+ +  +        +AAW I +G+ + ++G NLF F   SE +  RVLK   W FD
Subjt:  DDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFD

Query:  KFLLVLSK----------LIPMATQWVIV---DFD--------NGGRRYGWKESI-------------RVRVQLDISKPLRRGIKVRLDDPLGSCWSPIR
          LL+L+K           +  A+ W+ +    FD          G R G  E +             RVRV L +SKPLRRG  + +D   G  W   +
Subjt:  KFLLVLSK----------LIPMATQWVIV---DFD--------NGGRRYGWKESI-------------RVRVQLDISKPLRRGIKVRLDDPLGSCWSPIR

Query:  YEKLPELCSFCGIIGHTAHNCS-SFYMNSGSSSQQHHYGMWLQYTG
        YE+LP  C FCG++GH  H+C+  F    G    ++ YG W++ +G
Subjt:  YEKLPELCSFCGIIGHTAHNCS-SFYMNSGSSSQQHHYGMWLQYTG

TrEMBL top hitse value%identityAlignment
A0A2N9IG44 CCHC-type domain-containing protein3.2e-2635.6Show/hide
Query:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESW
        MA D LLE+W KF+LT E EA    V+ +    +R      L  KL++ ++ S EV++      W    G+T   +  NLF F   +E E +RVL    W
Subjt:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESW

Query:  LFDKFLLVLSKLIPMATQWVIVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIRYEKLPELCSFCGIIGHTAHNCS
        LFD +LL L +         + + D      GW  ++RVR+QLD ++P+ RG  +     LG  W   RYE+LP +C  CG+IGH   +C+
Subjt:  LFDKFLLVLSKLIPMATQWVIVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIRYEKLPELCSFCGIIGHTAHNCS

A0A6J1BSZ1 uncharacterized protein LOC1110054817.8e-3333.33Show/hide
Query:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES
        MA  +LLE W+ F LT EE+   V +D +A   T K L  +L  KLLS R IS  V++   K AW +     +V+ +G N+FLF+ N   +  R+L+   
Subjt:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES

Query:  WLFDKFLLVLSKLIPM----------ATQWV------------------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW
        W FD+ L+++   + +           + WV                          D ++    + W   +RVRV+ D+ KPL RGIK+ LD P+G CW
Subjt:  WLFDKFLLVLSKLIPM----------ATQWV------------------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW

Query:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTG
         PI+YE+LP+    CG + H   +CS   ++S S + Q  YG WL++ G
Subjt:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTG

A0A6J1D765 uncharacterized protein LOC1110179021.3e-2232.93Show/hide
Query:  LDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLF
        +D++ + WE F  T +E  T V +DR    +T  ++   +  KL + + IS E +R   K+ W +      E LG+N+++    S  E  RVL    W F
Subjt:  LDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLF

Query:  DKFLLVLSKLIPMATQ------------WV-----------------------IVDFDNGGRRYGWK-ESIRVRVQLDISKPLRRGIKVRLDDPLGSCWS
        +K LLVL+   P AT             W+                        V+   G    GW    IRVRV++D+SKPLRRGIK++  D     W 
Subjt:  DKFLLVLSKLIPMATQ------------WV-----------------------IVDFDNGGRRYGWK-ESIRVRVQLDISKPLRRGIKVRLDDPLGSCWS

Query:  PIRYEKLPELCSFCGIIGHTAHNCS--SFYMNSGSSSQQHHYGMWLQYT
        P+RYEKLP+ C  CG IGH+   C   S  + + S  Q   YG WL+ T
Subjt:  PIRYEKLPELCSFCGIIGHTAHNCS--SFYMNSGSSSQQHHYGMWLQYT

A0A6J1DU55 uncharacterized protein LOC1110231351.9e-4237.8Show/hide
Query:  DDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFD
        ++LL +W+KF LT EE+   + VD +A  +  + L Y+L  KLL+ R+IS +V+ +    AW +   LTVE +G NLFLF    E +  RV+K   W FD
Subjt:  DDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFD

Query:  KFLLVLSK--------------------LIPMATQWV--------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIR
        K L+VL K                    L  +   W+               VD D   + + W  S+R+RV +DI+KPLRRGIK+ +D P+G CW PI+
Subjt:  KFLLVLSK--------------------LIPMATQWV--------------IVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIR

Query:  YEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQ-QHHYGMWLQYTG
        YE+LP+ C FCG+IGH++H+C + Y+ +   S+    YG WL++ G
Subjt:  YEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQ-QHHYGMWLQYTG

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-3131.29Show/hide
Query:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES
        MA  DLLE W+ F LT EEE T + VD +A A T   L   L  KL   R I+  VM+   + AW +      V+ LG NLFLFS     +  ++ K   
Subjt:  MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIP-TGLTVEKLGVNLFLFSLNSEEEHIRVLKQES

Query:  WLFDKFLLVLSK---LIPMATQ-------WV-IVDFDNG----------GRRYG-------------WKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW
        W FD+ L++++K   LIP +         WV   D   G          G   G             W  ++RVRV LDISKPLRRGIK+ LD P+G  W
Subjt:  WLFDKFLLVLSK---LIPMATQ-------WV-IVDFDNG----------GRRYG-------------WKESIRVRVQLDISKPLRRGIKVRLDDPLGSCW

Query:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTGR---TTTFFRSPRTSPM---GQNKIMVDIQ--DDGNRPKEMAAVTAPSFL-
         PI+YE+LP+ C  CG+                SS ++H YG WL+Y G    T    + P+   +   G N           G++  + A  T P  + 
Subjt:  SPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTGR---TTTFFRSPRTSPM---GQNKIMVDIQ--DDGNRPKEMAAVTAPSFL-

Query:  ---GATGNDETASQPHDSGSKPMEISPAIEETVTVPVNSIHP
             T   +  ++P   G  P+ I    +      +++++P
Subjt:  ---GATGNDETASQPHDSGSKPMEISPAIEETVTVPVNSIHP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding3.4e-0428.57Show/hide
Query:  PMATQWVIVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIRYEKLPELCSFCGIIGHTAHNC
        P+      ++FD G          RV ++++++KPL+  + +  D         + YE L ++CS CGI GH  H+C
Subjt:  PMATQWVIVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIRYEKLPELCSFCGIIGHTAHNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTGACGATTTGTTGGAGAACTGGGAAAAGTTCAATCTGACTGTTGAGGAGGAAGCGACAGAGGTGGGCGTTGACCGCAATGCTGCTGCGGTTACTAGAAAGTC
CCTGGGATACAACCTTTTCGACAAACTGCTTTCTCCTCGGGTCATCTCAGGAGAAGTTATGAGAAAGAATTTTAAAGCGGCATGGAATATTCCTACAGGATTGACGGTCG
AAAAATTAGGGGTTAATTTATTTTTATTTTCCCTGAACTCGGAAGAAGAACATATTCGAGTCTTGAAACAAGAATCGTGGCTCTTCGACAAGTTCCTGCTGGTGCTTTCC
AAACTCATTCCTATGGCAACACAGTGGGTAATTGTGGATTTCGATAATGGAGGAAGGAGATATGGCTGGAAAGAGAGTATCCGGGTTCGTGTTCAATTGGATATCTCCAA
ACCTCTCCGGAGGGGCATCAAAGTTAGACTCGACGATCCGTTAGGCAGCTGTTGGTCTCCTATCCGTTATGAAAAATTGCCAGAACTATGTTCTTTTTGCGGCATCATAG
GGCATACGGCACACAATTGTAGCTCTTTCTACATGAACAGTGGCTCATCTTCTCAACAGCATCACTATGGTATGTGGCTTCAATACACAGGTCGTACAACTACTTTTTTT
CGATCTCCTAGAACGAGTCCGATGGGGCAAAATAAAATTATGGTTGATATTCAAGATGATGGCAATCGTCCCAAGGAGATGGCAGCAGTTACGGCACCATCGTTTCTTGG
TGCGACGGGCAACGACGAAACAGCTTCACAGCCGCACGATTCTGGCAGTAAACCGATGGAGATTTCGCCGGCGATCGAGGAAACGGTTACCGTTCCGGTTAATTCAATTC
ATCCGTTCGGTCAGGATAGCATTAATGGTGGTATTACTTTAGACTCATCGAGGGCGAAGAAGAAGCTCGATTTTGCGGATGTTACGTTTTCCCCATTGATTACGCCAGAA
CCCACCGCTCAGTATACAACCGTTTCTTCAACCATGGCCCAAGGGGAGTCCAAGCCCATTTCAGGTTTATCAGAGTCGGCATGTTACTCCTTGTCCGGTTTCCAATTCAA
GGAAGAAGGGAAACGCATGGAGGCTTTTAAGCCCAACTTGAATGCAGTGGCCCAACAGATGGGCCCATTTAAATTGACCCAAGAACTTGCAAGGAAAGAATCCCAACCAG
CCCAACCAGCCCAGTTTGCAGTCCCTGAATGCAACGAAAAGCAGGCCCCTAACGTGGTTTCACATCAGCCGGTATCACAAATTGTTTTGGGACTACCGAACTCTAAATGG
AGGAGGCGTGCACGATCCAATCTGTCCAGTTCTGGGCATGCTTCAAGCTCAGATCCATTTAAGAAGCGCTTTGGTGATGGCCTTTCTGGAGGTATCAATAAAAGACCTCG
GTCAGAAAATGAGGAAGAATCAACTTATGGAGCAGCATCGGCGGAGGCTGACAATCAGCCCCTCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTTGACGATTTGTTGGAGAACTGGGAAAAGTTCAATCTGACTGTTGAGGAGGAAGCGACAGAGGTGGGCGTTGACCGCAATGCTGCTGCGGTTACTAGAAAGTC
CCTGGGATACAACCTTTTCGACAAACTGCTTTCTCCTCGGGTCATCTCAGGAGAAGTTATGAGAAAGAATTTTAAAGCGGCATGGAATATTCCTACAGGATTGACGGTCG
AAAAATTAGGGGTTAATTTATTTTTATTTTCCCTGAACTCGGAAGAAGAACATATTCGAGTCTTGAAACAAGAATCGTGGCTCTTCGACAAGTTCCTGCTGGTGCTTTCC
AAACTCATTCCTATGGCAACACAGTGGGTAATTGTGGATTTCGATAATGGAGGAAGGAGATATGGCTGGAAAGAGAGTATCCGGGTTCGTGTTCAATTGGATATCTCCAA
ACCTCTCCGGAGGGGCATCAAAGTTAGACTCGACGATCCGTTAGGCAGCTGTTGGTCTCCTATCCGTTATGAAAAATTGCCAGAACTATGTTCTTTTTGCGGCATCATAG
GGCATACGGCACACAATTGTAGCTCTTTCTACATGAACAGTGGCTCATCTTCTCAACAGCATCACTATGGTATGTGGCTTCAATACACAGGTCGTACAACTACTTTTTTT
CGATCTCCTAGAACGAGTCCGATGGGGCAAAATAAAATTATGGTTGATATTCAAGATGATGGCAATCGTCCCAAGGAGATGGCAGCAGTTACGGCACCATCGTTTCTTGG
TGCGACGGGCAACGACGAAACAGCTTCACAGCCGCACGATTCTGGCAGTAAACCGATGGAGATTTCGCCGGCGATCGAGGAAACGGTTACCGTTCCGGTTAATTCAATTC
ATCCGTTCGGTCAGGATAGCATTAATGGTGGTATTACTTTAGACTCATCGAGGGCGAAGAAGAAGCTCGATTTTGCGGATGTTACGTTTTCCCCATTGATTACGCCAGAA
CCCACCGCTCAGTATACAACCGTTTCTTCAACCATGGCCCAAGGGGAGTCCAAGCCCATTTCAGGTTTATCAGAGTCGGCATGTTACTCCTTGTCCGGTTTCCAATTCAA
GGAAGAAGGGAAACGCATGGAGGCTTTTAAGCCCAACTTGAATGCAGTGGCCCAACAGATGGGCCCATTTAAATTGACCCAAGAACTTGCAAGGAAAGAATCCCAACCAG
CCCAACCAGCCCAGTTTGCAGTCCCTGAATGCAACGAAAAGCAGGCCCCTAACGTGGTTTCACATCAGCCGGTATCACAAATTGTTTTGGGACTACCGAACTCTAAATGG
AGGAGGCGTGCACGATCCAATCTGTCCAGTTCTGGGCATGCTTCAAGCTCAGATCCATTTAAGAAGCGCTTTGGTGATGGCCTTTCTGGAGGTATCAATAAAAGACCTCG
GTCAGAAAATGAGGAAGAATCAACTTATGGAGCAGCATCGGCGGAGGCTGACAATCAGCCCCTCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MALDDLLENWEKFNLTVEEEATEVGVDRNAAAVTRKSLGYNLFDKLLSPRVISGEVMRKNFKAAWNIPTGLTVEKLGVNLFLFSLNSEEEHIRVLKQESWLFDKFLLVLS
KLIPMATQWVIVDFDNGGRRYGWKESIRVRVQLDISKPLRRGIKVRLDDPLGSCWSPIRYEKLPELCSFCGIIGHTAHNCSSFYMNSGSSSQQHHYGMWLQYTGRTTTFF
RSPRTSPMGQNKIMVDIQDDGNRPKEMAAVTAPSFLGATGNDETASQPHDSGSKPMEISPAIEETVTVPVNSIHPFGQDSINGGITLDSSRAKKKLDFADVTFSPLITPE
PTAQYTTVSSTMAQGESKPISGLSESACYSLSGFQFKEEGKRMEAFKPNLNAVAQQMGPFKLTQELARKESQPAQPAQFAVPECNEKQAPNVVSHQPVSQIVLGLPNSKW
RRRARSNLSSSGHASSSDPFKKRFGDGLSGGINKRPRSENEEESTYGAASAEADNQPLREP