; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001881 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001881
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:36493447..36497530
RNA-Seq ExpressionLag0001881
SyntenyLag0001881
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR021924 - Protein of unknown function DUF3537
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583771.1 hypothetical protein SDJN03_19703, partial [Cucurbita argyrosperma subsp. sororia]2.9e-4973.76Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA
        M+N  KKSPS      P+DS+++ESE EA EL+R ESFLKWICI DQSNP+ ASLSC +FF+FA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SLSAVA
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA

Query:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
         LSF  LS+WLRLFG NRFLFLDKL +ASP+VR EYS+QL+
Subjt:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

KAG7032428.1 hypothetical protein SDJN02_06473 [Cucurbita argyrosperma subsp. argyrosperma]2.9e-4973.76Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA
        M+N  KKSPS      P+DS+++ESE EA EL+R ESFLKWICI DQSNP+ ASLSC +FF+FA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SLSAVA
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA

Query:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
         LSF  LS+WLRLFG NRFLFLDKL +ASP+VR EYS+QL+
Subjt:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]1.3e-4972.19Show/hide
Query:  MENEEKKSPSQIDLYQP--LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P  ++SRK        +ESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP--LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK
Subjt:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

XP_022994504.1 uncharacterized protein LOC111490209 [Cucurbita maxima]5.9e-5070.97Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR
        ME+ EKKSPS      P      ++SRK        +ESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+R
Subjt:  MENEEKKSPSQIDLYQP------LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR

Query:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        PFHVVVQ+SLSAVATLSF CLS WLR FGL+RFLFLDKLCE+S K RDEYSKQLK
Subjt:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]5.4e-5175.89Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA
        ME EEKKS  QID  +     ++E ESEAAEL+RF+S LKWICI D SNP+ ASLSC VFFVFA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SLSAVA
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA

Query:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        TLSF CLS+WLR FGLNRFLFLDKL EASP+VR EY +QL+
Subjt:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

TrEMBL top hitse value%identityAlignment
A0A1S3C949 uncharacterized protein LOC1034982311.4e-4976.26Show/hide
Query:  EEKKSPSQIDLYQPLDSR-KAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATL
        EEKKS  Q+D  +   +  ++ESE EAAEL+RFESFLKWICI+D SN + ASLSC VFFVF  AVPIASHFALSCSDCDEDHQRPFHVVVQ+SLSAVATL
Subjt:  EEKKSPSQIDLYQPLDSR-KAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATL

Query:  SFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        SF CLS+WLRLFGLNRFLFLDKL EASPK+R EY +QL+
Subjt:  SFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

A0A5A7UJ39 Extracellular ligand-gated ion channel protein1.4e-4976.26Show/hide
Query:  EEKKSPSQIDLYQPLDSR-KAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATL
        EEKKS  Q+D  +   +  ++ESE EAAEL+RFESFLKWICI+D SN + ASLSC VFFVF  AVPIASHFALSCSDCDEDHQRPFHVVVQ+SLSAVATL
Subjt:  EEKKSPSQIDLYQPLDSR-KAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATL

Query:  SFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        SF CLS+WLRLFGLNRFLFLDKL EASPK+R EY +QL+
Subjt:  SFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

A0A6J1EG16 uncharacterized protein LOC1114339901.4e-4973.76Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA
        M+N  KKSPS      P+DS+++ESE EA EL+R ESFLKWICI DQSNP+ ASLSC +FF+FA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SLSAVA
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVA

Query:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
         LSF  LS+WLRLFG NRFLFLDKL +ASP+VR EYS+QL+
Subjt:  TLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

A0A6J1GSP8 uncharacterized protein LOC1114567806.4e-5072.19Show/hide
Query:  MENEEKKSPSQIDLYQP--LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P  ++SRK        +ESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP--LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK
Subjt:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

A0A6J1K314 uncharacterized protein LOC1114902092.9e-5070.97Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR
        ME+ EKKSPS      P      ++SRK        +ESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+R
Subjt:  MENEEKKSPSQIDLYQP------LDSRK--------AESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR

Query:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK
        PFHVVVQ+SLSAVATLSF CLS WLR FGL+RFLFLDKLCE+S K RDEYSKQLK
Subjt:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)2.4e-2550.91Show/hide
Query:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP
        EL  F  +L+W+C +D S+P++A LS  +F VF   VP  SHF L+C+DCD  H RP+  VVQ+SLS+VAT+SF CL+ ++  +GL RFLF DKL + S 
Subjt:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQL
         VR  Y+ QL
Subjt:  KVRDEYSKQL

AT1G50630.2 Protein of unknown function (DUF3537)2.4e-2550.91Show/hide
Query:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP
        EL  F  +L+W+C +D S+P++A LS  +F VF   VP  SHF L+C+DCD  H RP+  VVQ+SLS+VAT+SF CL+ ++  +GL RFLF DKL + S 
Subjt:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQL
         VR  Y+ QL
Subjt:  KVRDEYSKQL

AT3G20300.1 Protein of unknown function (DUF3537)1.4e-2552.73Show/hide
Query:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP
        EL  F  +L+W+C +DQS+P++A LS  +F VF   VP  SHF L+CSDCD  H RP+  VVQ+SLS+ A LSF CLS ++  +GL RFLF DKL + S 
Subjt:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQL
         VR  Y+ QL
Subjt:  KVRDEYSKQL

AT4G03820.1 Protein of unknown function (DUF3537)3.4e-1944.83Show/hide
Query:  ESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKL
        ES A +L    SF +     DQSN     LS  +FF+ A  VP+ SHF L C+DCD  H+RP+  +VQ+SLS  A +SF  LS W + +G+ RFLF DKL
Subjt:  ESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKL

Query:  CEASPKVRDEYSKQLK
         + S KVR  Y  +++
Subjt:  CEASPKVRDEYSKQLK

AT4G22270.1 Protein of unknown function (DUF3537)1.0e-2349.57Show/hide
Query:  SFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEY
        +F+  +   DQSN  +A LS  VFF+    VP+ SHF L CSDCD  H+RP+ V+VQ+SLS  A +SF  LSIW R FG+ RFLFLDKL + S KVR EY
Subjt:  SFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEY

Query:  SKQLKGFAVGLRRVALF
          +++     L+R+ +F
Subjt:  SKQLKGFAVGLRRVALF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCTCGGTGAGGTCGTCGTCGACGTGGGAGTGGTGGTCGTTGAGGTCGAGGTTGTGCATGTGGGCGGCCTTGGCGGCGGCGGCTTGGACGTCGCGAGGGGCGTCCTG
TTCCGAGGAGAAGAAAATGTATGAAAATCTCCCGCTCCCCTGTTCCGAGGCTATGGAAAACGAGGAGAAGAAATCTCCCTCTCAAATTGATCTGTACCAACCACTCGATT
CGCGAAAAGCAGAATCTGAATCTGAAGCGGCCGAATTGAAGAGGTTCGAATCATTCCTGAAATGGATTTGCATAATCGATCAATCCAATCCATTCAGCGCTTCGCTCTCC
TGCTTCGTCTTCTTCGTCTTCGCATTCGCCGTCCCTATCGCATCGCACTTCGCTCTCTCTTGCTCCGATTGCGACGAAGATCACCAGAGGCCTTTTCATGTCGTCGTTCA
GGTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGCCTCTCTATTTGGCTCCGTCTCTTCGGATTAAACCGATTTCTGTTCCTCGATAAGCTCTGTGAAGCAAGTC
CGAAGGTTCGGGATGAGTATTCCAAGCAATTGAAGGGGTTTGCTGTCGGCTTGAGGCGGGTGGCTCTCTTCGGGTCCAGAGGTGGTGGCGCGAGTAGTGGCCGGCTTTGG
CGGCTGCTTTCAAGTGTGTTTCTTGCTTTCAAGTGTGTTTCTTTCTTTGGCAGGGAGATTTTGATGATGGATGACTTGGTGAAGAAGTGGAAGTCGTGGTCACTTACCGG
GGAGGAGGCGGAGGTGATACGTCCGGGTGCAGTTTCGTCTTCTCTGTCAGCGGCAGACTTGGGTTTATGTGTGGTGGGTAAGGTCCTATCATCCAAGCGGGTCAATCCAG
AGGCCTTTCGCAATGGGGAGAAGTCGAGAGTTCTAGGTACGGGTCCTTGGGCTTTTGATCGGTCCCTTATCGTGCTTAAGGAGCCAAAGGATGTTGCGTCGGTGCTAGAA
ACAGAGTTTGAAGATTGTCCTTTTTGGGTGCAGATTCATAATGTTCCTATGAAGTGGCAGACGAGAGGTTGGGCGCAGCATCTAGGAGGCAAGATTGGGGTGGTTGAGGA
GGTGGGCGGATCAAATGAGAATGATTGGATAGGCCCGATTCTGCGGCTCAGAGTTCGTCTTCGTTTGGACCGACCACTACGAAGGGGGTTTCATATGCAGGATGATGATG
GGAAGGACCATTGGTGTCCAATGTTGTATGAGCGTCTCCCGGACTTCTGTTTTGGGTGTGGTTGTTTAGGGCACTCGAGGCGGGAGTGTGAAGGTGTGGGGGCGAGTTCT
TCAGGTGTGGGGGATGACCAGTATGGCGAGTGGCTGAGAGCAGGTGCCTTTCTTGGGGAGAGTATCCGGCAGGATTCAGGTCAGGTAGGGGCAAAGGACAGTGGGAAGCA
GGCCGTAGTGGAACCCATGCCTGGTTCAGGAGAGGTAGTAGGTCCTGTTCTGGGTGAGGTGCATGAGAAGGAGACAGGGGTATCTGATAGTGTTGCTGGGGAGGAAGTGC
CAGTTCCTGCTTTGGATACACGGACGGAGATTTGCACGGTGGAGGGAGGGAAGGAAGCTCAGGTGGAGGGTAGTGTGAAGGGTAAGCAGAAGGTGGGGCATGGCCCCTCC
GAACCATGGTCCACCTTTGTGGAGGGGGAGGCTATGGTGGTGGATAAGGTGGAGATGGTCCAGGGTAGAGAGGTTGAGAAGGGGGGCAAGAGTTCTAGTAGCAGGGGTGG
AAGAGGCGAGCGAGAGAGACTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCTCGGTGAGGTCGTCGTCGACGTGGGAGTGGTGGTCGTTGAGGTCGAGGTTGTGCATGTGGGCGGCCTTGGCGGCGGCGGCTTGGACGTCGCGAGGGGCGTCCTG
TTCCGAGGAGAAGAAAATGTATGAAAATCTCCCGCTCCCCTGTTCCGAGGCTATGGAAAACGAGGAGAAGAAATCTCCCTCTCAAATTGATCTGTACCAACCACTCGATT
CGCGAAAAGCAGAATCTGAATCTGAAGCGGCCGAATTGAAGAGGTTCGAATCATTCCTGAAATGGATTTGCATAATCGATCAATCCAATCCATTCAGCGCTTCGCTCTCC
TGCTTCGTCTTCTTCGTCTTCGCATTCGCCGTCCCTATCGCATCGCACTTCGCTCTCTCTTGCTCCGATTGCGACGAAGATCACCAGAGGCCTTTTCATGTCGTCGTTCA
GGTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGCCTCTCTATTTGGCTCCGTCTCTTCGGATTAAACCGATTTCTGTTCCTCGATAAGCTCTGTGAAGCAAGTC
CGAAGGTTCGGGATGAGTATTCCAAGCAATTGAAGGGGTTTGCTGTCGGCTTGAGGCGGGTGGCTCTCTTCGGGTCCAGAGGTGGTGGCGCGAGTAGTGGCCGGCTTTGG
CGGCTGCTTTCAAGTGTGTTTCTTGCTTTCAAGTGTGTTTCTTTCTTTGGCAGGGAGATTTTGATGATGGATGACTTGGTGAAGAAGTGGAAGTCGTGGTCACTTACCGG
GGAGGAGGCGGAGGTGATACGTCCGGGTGCAGTTTCGTCTTCTCTGTCAGCGGCAGACTTGGGTTTATGTGTGGTGGGTAAGGTCCTATCATCCAAGCGGGTCAATCCAG
AGGCCTTTCGCAATGGGGAGAAGTCGAGAGTTCTAGGTACGGGTCCTTGGGCTTTTGATCGGTCCCTTATCGTGCTTAAGGAGCCAAAGGATGTTGCGTCGGTGCTAGAA
ACAGAGTTTGAAGATTGTCCTTTTTGGGTGCAGATTCATAATGTTCCTATGAAGTGGCAGACGAGAGGTTGGGCGCAGCATCTAGGAGGCAAGATTGGGGTGGTTGAGGA
GGTGGGCGGATCAAATGAGAATGATTGGATAGGCCCGATTCTGCGGCTCAGAGTTCGTCTTCGTTTGGACCGACCACTACGAAGGGGGTTTCATATGCAGGATGATGATG
GGAAGGACCATTGGTGTCCAATGTTGTATGAGCGTCTCCCGGACTTCTGTTTTGGGTGTGGTTGTTTAGGGCACTCGAGGCGGGAGTGTGAAGGTGTGGGGGCGAGTTCT
TCAGGTGTGGGGGATGACCAGTATGGCGAGTGGCTGAGAGCAGGTGCCTTTCTTGGGGAGAGTATCCGGCAGGATTCAGGTCAGGTAGGGGCAAAGGACAGTGGGAAGCA
GGCCGTAGTGGAACCCATGCCTGGTTCAGGAGAGGTAGTAGGTCCTGTTCTGGGTGAGGTGCATGAGAAGGAGACAGGGGTATCTGATAGTGTTGCTGGGGAGGAAGTGC
CAGTTCCTGCTTTGGATACACGGACGGAGATTTGCACGGTGGAGGGAGGGAAGGAAGCTCAGGTGGAGGGTAGTGTGAAGGGTAAGCAGAAGGTGGGGCATGGCCCCTCC
GAACCATGGTCCACCTTTGTGGAGGGGGAGGCTATGGTGGTGGATAAGGTGGAGATGGTCCAGGGTAGAGAGGTTGAGAAGGGGGGCAAGAGTTCTAGTAGCAGGGGTGG
AAGAGGCGAGCGAGAGAGACTTTAG
Protein sequenceShow/hide protein sequence
MISVRSSSTWEWWSLRSRLCMWAALAAAAWTSRGASCSEEKKMYENLPLPCSEAMENEEKKSPSQIDLYQPLDSRKAESESEAAELKRFESFLKWICIIDQSNPFSASLS
CFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKGFAVGLRRVALFGSRGGGASSGRLW
RLLSSVFLAFKCVSFFGREILMMDDLVKKWKSWSLTGEEAEVIRPGAVSSSLSAADLGLCVVGKVLSSKRVNPEAFRNGEKSRVLGTGPWAFDRSLIVLKEPKDVASVLE
TEFEDCPFWVQIHNVPMKWQTRGWAQHLGGKIGVVEEVGGSNENDWIGPILRLRVRLRLDRPLRRGFHMQDDDGKDHWCPMLYERLPDFCFGCGCLGHSRRECEGVGASS
SGVGDDQYGEWLRAGAFLGESIRQDSGQVGAKDSGKQAVVEPMPGSGEVVGPVLGEVHEKETGVSDSVAGEEVPVPALDTRTEICTVEGGKEAQVEGSVKGKQKVGHGPS
EPWSTFVEGEAMVVDKVEMVQGREVEKGGKSSSSRGGRGERERL