; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007803 (gene) of Snake gourd v1 genome

Gene IDTan0007803
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein CURVATURE THYLAKOID 1C, chloroplastic
Genome locationLG06:2269351..2272901
RNA-Seq ExpressionTan0007803
SyntenyTan0007803
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025564 - Cyanobacterial aminoacyl-tRNA synthetase, CAAD domain
IPR033344 - Protein CURVATURE THYLAKOID 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593469.1 Malonate--CoA ligase, partial [Cucurbita argyrosperma subsp. sororia]3.2e-6693.15Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQ-KLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKL
        MASI ATLPPPLLAPGKSFT L T+Q KLTVFPIAKG SAN +VKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVV  WTATNLVTAVDKL
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQ-KLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKL

Query:  PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
Subjt:  PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

XP_022144822.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Momordica charantia]2.2e-6286.9Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASIVA+LPPPLLAPGKSFTLLNT QKLTVFPIAK  SA VVVKA GDSS+SS S+ IIKSV+N+WDQPEDRLGLFGLGFAAV A+WTATN VTA+DKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPG+LEFIG LVSWWFVYRYLLFKPNREELLQIINKS+ADVLGQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

XP_022964632.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Cucurbita moschata]3.2e-6693.15Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQ-KLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKL
        MASI ATLPPPLLAPGKSFT L T+Q KLTVFPIAKG SAN +VKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVV  WTATNLVTAVDKL
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQ-KLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKL

Query:  PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
Subjt:  PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

XP_023000405.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Cucurbita maxima]8.5e-6792.41Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASI ATLPPPLLAPGKSF  L T+QK+TVFPIAKG SAN +VKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVV  WTATNLVTAVDKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

XP_038897597.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Benincasa hispida]2.5e-6387.59Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASIVATLPPPLLAP KSFTLLNT QKLT FPIA   S NVVVKA+G SSESSTS+DIIKSVRNVWDQPEDRL LFGLGFAAV  VWTATNLVTA+DKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPGVLEFIG LVSWWFVYRYLLFKPNREELLQIINKS+ DV GQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

TrEMBL top hitse value%identityAlignment
A0A0A0KC66 CAAD domain-containing protein3.4e-6183.45Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASIVATLPPPLLAP KSFT+LN  QKL+VF  A G S NVVVKA+G SSESSTS+DI+KSVRNVWDQPEDRL LFGLGFAAV   WTATN+VTA+DKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPGVLEFIG LVSWWFVYRYLLFKPNREELLQIINKS+ DV GQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

A0A5D3BWY3 Protein CURVATURE THYLAKOID 1C1.7e-6083.45Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASIVATLPPPLLAP KSFT+LN  QKL+V   AKG   NVVVKA+G SSESSTS+DIIKSVRNVWDQPEDR  LFGLGFAAV   WTATNLVTA+DKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPGVLEFIG LVSWWFVYRYLLFKPNREELLQIINKS+ DV GQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

A0A6J1CUT1 protein CURVATURE THYLAKOID 1C, chloroplastic1.0e-6286.9Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASIVA+LPPPLLAPGKSFTLLNT QKLTVFPIAK  SA VVVKA GDSS+SS S+ IIKSV+N+WDQPEDRLGLFGLGFAAV A+WTATN VTA+DKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPG+LEFIG LVSWWFVYRYLLFKPNREELLQIINKS+ADVLGQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

A0A6J1HLG0 protein CURVATURE THYLAKOID 1C, chloroplastic1.6e-6693.15Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQ-KLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKL
        MASI ATLPPPLLAPGKSFT L T+Q KLTVFPIAKG SAN +VKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVV  WTATNLVTAVDKL
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQ-KLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKL

Query:  PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
Subjt:  PLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

A0A6J1KFS9 protein CURVATURE THYLAKOID 1C, chloroplastic4.1e-6792.41Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP
        MASI ATLPPPLLAPGKSF  L T+QK+TVFPIAKG SAN +VKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVV  WTATNLVTAVDKLP
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLP

Query:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
Subjt:  LLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

SwissProt top hitse value%identityAlignment
O04616 Protein CURVATURE THYLAKOID 1A, chloroplastic1.1e-1340.4Show/hide
Query:  SSESSTSID---IIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLG
        SSE ++SID   +I  ++  WD  E++  +   G  A+VAVW ++ +V A++ +PLLP V+E +G+  + WFVYRYLLFK +R+EL + I      + G
Subjt:  SSESSTSID---IIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLG

Q119Z5 Glutamate--tRNA ligase2.7e-0732.14Show/hide
Query:  VRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
        +  ++ Q + +L LFG     ++       ++ A+D +P+L  + E IG++   WFVYRYLL + NR+ELL  I     ++ G+
Subjt:  VRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

Q8LCA1 Protein CURVATURE THYLAKOID 1B, chloroplastic3.4e-1836.84Show/hide
Query:  NVVVKA---IGDSSESSTSI------DIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREE
        NVV +A   +G++  ++T        +I+K+ +  W++ +D+  +  L FA VVA+W +  +++A+D+LPL+PGVLE +GI  + WF Y+ L+FKP+RE 
Subjt:  NVVVKA---IGDSSESSTSI------DIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREE

Query:  LLQIINKSVADVLG
        L + +  +  D+LG
Subjt:  LLQIINKSVADVLG

Q8LDD3 Protein CURVATURE THYLAKOID 1D, chloroplastic1.2e-0732.98Show/hide
Query:  ESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLG
        E + +++ +  ++   D+    L L+G G  A+VA++  + +V++++ +PL P ++E +G+  + WF  RYLLFK NREEL   +++    VLG
Subjt:  ESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLG

Q9M812 Protein CURVATURE THYLAKOID 1C, chloroplastic3.3e-3751.27Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW
        MASI ATLP PLL   +  + L + QKL  F + +G              S +++VKA G+SS+SST +D++ +++NVWD+ EDRLGL GLGFA +VA+W
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW

Query:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
         + NL+TA+DKLP++    E +GIL S WF YRYLLFKP+R+EL +I+ KSVAD+LGQ
Subjt:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

Arabidopsis top hitse value%identityAlignment
AT1G52220.1 FUNCTIONS IN: molecular_function unknown2.3e-3851.27Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW
        MASI ATLP PLL   +  + L + QKL  F + +G              S +++VKA G+SS+SST +D++ +++NVWD+ EDRLGL GLGFA +VA+W
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW

Query:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
         + NL+TA+DKLP++    E +GIL S WF YRYLLFKP+R+EL +I+ KSVAD+LGQ
Subjt:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

AT1G52220.2 FUNCTIONS IN: molecular_function unknown1.7e-3650.63Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW
        MASI ATLP PLL   +  + L + QKL  F + +G              S +++VKA G+SS+SST +D++ +++N WD+ EDRLGL GLGFA +VA+W
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW

Query:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
         + NL+TA+DKLP++    E +GIL S WF YRYLLFKP+R+EL +I+ KSVAD+LGQ
Subjt:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

AT1G52220.3 FUNCTIONS IN: molecular_function unknown2.2e-2039.24Show/hide
Query:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW
        MASI ATLP PLL   +  + L + QKL  F + +G              S +++VKA G+SS+SST +D++ +++NV                      
Subjt:  MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKG-------------HSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVW

Query:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ
               A+DKLP++    E +GIL S WF YRYLLFKP+R+EL +I+ KSVAD+LGQ
Subjt:  TATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ

AT2G46820.1 photosystem I P subunit2.4e-1936.84Show/hide
Query:  NVVVKA---IGDSSESSTSI------DIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREE
        NVV +A   +G++  ++T        +I+K+ +  W++ +D+  +  L FA VVA+W +  +++A+D+LPL+PGVLE +GI  + WF Y+ L+FKP+RE 
Subjt:  NVVVKA---IGDSSESSTSI------DIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREE

Query:  LLQIINKSVADVLG
        L + +  +  D+LG
Subjt:  LLQIINKSVADVLG

AT2G46820.2 photosystem I P subunit2.4e-1936.84Show/hide
Query:  NVVVKA---IGDSSESSTSI------DIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREE
        NVV +A   +G++  ++T        +I+K+ +  W++ +D+  +  L FA VVA+W +  +++A+D+LPL+PGVLE +GI  + WF Y+ L+FKP+RE 
Subjt:  NVVVKA---IGDSSESSTSI------DIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIGILVSWWFVYRYLLFKPNREE

Query:  LLQIINKSVADVLG
        L + +  +  D+LG
Subjt:  LLQIINKSVADVLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATTGTTGCTACTCTTCCTCCGCCATTGTTGGCCCCTGGCAAAAGCTTCACCCTTCTGAATACTTTTCAGAAGCTCACTGTTTTTCCCATTGCAAAGGGGCA
TTCTGCCAATGTTGTTGTAAAGGCTATTGGAGACAGCTCTGAGTCTTCTACTTCCATCGATATTATTAAGTCTGTTCGAAATGTTTGGGATCAACCTGAAGATCGACTGG
GACTTTTTGGCCTGGGATTTGCAGCTGTAGTAGCTGTATGGACAGCAACAAATCTTGTTACGGCGGTTGACAAGCTACCACTGCTTCCAGGTGTATTAGAATTCATTGGA
ATATTGGTTTCTTGGTGGTTCGTGTATCGCTACCTCTTGTTCAAACCGAACCGGGAAGAGCTTTTGCAGATAATCAACAAGTCAGTAGCAGATGTATTGGGACAGTAA
mRNA sequenceShow/hide mRNA sequence
CAGCAAAGCATCGAGACCTTATCTTCTAAAGCAACAAAAAGTACAGGTTCGTCAATCGGAGGCCATGAATTCCCAATAATCACAATTTTTCGACTTAATTTTTGATTTGG
GAGTGTAAATTTGAATTCAAAGTTCATTGATTTCCCCTACTACTCAGTGTTCACTAAAATCATGGCTTCCATTGTTGCTACTCTTCCTCCGCCATTGTTGGCCCCTGGCA
AAAGCTTCACCCTTCTGAATACTTTTCAGAAGCTCACTGTTTTTCCCATTGCAAAGGGGCATTCTGCCAATGTTGTTGTAAAGGCTATTGGAGACAGCTCTGAGTCTTCT
ACTTCCATCGATATTATTAAGTCTGTTCGAAATGTTTGGGATCAACCTGAAGATCGACTGGGACTTTTTGGCCTGGGATTTGCAGCTGTAGTAGCTGTATGGACAGCAAC
AAATCTTGTTACGGCGGTTGACAAGCTACCACTGCTTCCAGGTGTATTAGAATTCATTGGAATATTGGTTTCTTGGTGGTTCGTGTATCGCTACCTCTTGTTCAAACCGA
ACCGGGAAGAGCTTTTGCAGATAATCAACAAGTCAGTAGCAGATGTATTGGGACAGTAAATTGTCCTTGTTGTAAAAGGCCCGATTTGGTCCAATCTTCATTGTATTCGT
TCCTTTTTACGGTCAAGTGTGAAGTTTAACCAAGATGGGTGTTCATTCATCAATGCATTCCAATCCCAAATCTTCAATCCTGTAGTCTATATCTATATCAGTTTATGTTT
CCAATCTTTCTTTTTCATAATTTATTTCCTTCAGCTTGTTCTCATTTTCCTGTTTCAAAATCAAATCTGTGCCCATGTGAAGATCGCGATTCAGTGAACGAGTCACAAAG
TCGAGTCCCTCTCCACCTTGTATCACAAACGAAGAGAATTTCTCGACTCCACAAGAAATAACACTTTTATTTTTAAACATATAGCTATTAGAAATGACAGATGGTTCATA
TGCTCGCCCTGTACAGTAAATGGAATTTCACTGGTTAGAGTTTGAAGTTTATCATTCATATCATATTGTAGTCAAGCATGAGGATTATTAAGTTTCTATAACATTGTTTG
GGTGATTGTGTTCAAAATCTAATGGGTTCGAAACACAACTCCATGATTTTGAGTAGTGTATTTCAAACATTGTTTTGCTAATTTTCAGGCTTCAAACACTGTTTTTTAGA
GTTCATAGATGTATTTTCCGAGGACTTACCCGATTTACCTCTAGCCAGGGAGGTGGATTTCAGTATCTCCTGGAGTCATACACTGCACCTCTATCAAAAGCCCCGTATCA
AATGGCCCCACTCGAATTAAAGGAACTCAAGATCCAGTTACAGGAGCTTCTAGACAGAGATTTCATTCGTCCCACTGTCTCACGTGGGGTGCTCCCGCACTCTTTGTGAA
AAAAAAAAAAAAAGGGCCGATGTGTTTATGTATTGACTACCGAGAGCTGAACAAGGTAACCATAAAAGAACAAGTACTCACTTCCCAGGGTCGACGATTTTGTTTGACCT
GTTACAGGGAGCAACAGTGTTCTCCAAGATCGATTTGCACTCGGGATATCACCAATTGAGGATACAGAAAGGAAGACATTCTGAAGACAACTTTCAG
Protein sequenceShow/hide protein sequence
MASIVATLPPPLLAPGKSFTLLNTFQKLTVFPIAKGHSANVVVKAIGDSSESSTSIDIIKSVRNVWDQPEDRLGLFGLGFAAVVAVWTATNLVTAVDKLPLLPGVLEFIG
ILVSWWFVYRYLLFKPNREELLQIINKSVADVLGQ