; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G011420 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G011420
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionEncodes a protein whose expression is responsive to nematode infection.
Genome locationchr04:15762418..15767937
RNA-Seq ExpressionLsi04G011420
SyntenyLsi04G011420
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140437.1 uncharacterized protein At1g66480 [Cucumis sativus]4.7e-8589Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGETMKL TP+QAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGA-TRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKE-KKDIIKPREVR
        LMLARRSASDLTIMKPKS+L EEGGGE+E    GS SGA TRVK+RLPKAEVERLLKE KDEAEAAERIMGLYKTREN   NDHKEKE KKDIIKPRE R
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGA-TRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKE-KKDIIKPREVR

XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]1.7e-9092.35Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGET+KLNTP+Q GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVL EEGGGE+E    GSGSGATRVKVRLPKAEVERLLKE KDEAEAAERIMGLYKTRE+VC NDHKEKEKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPRE

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]7.5e-9191.92Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGET+KLNTP+Q GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVR
        LMLARRSASDLTIMKPKSVL EEGGGE+E    GSGSGATRVKVRLPKAEVERLLKE KDEAEAAERIMGLYKTRE+VC NDHKEKEKKDIIKPRE R
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVR

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]3.3e-7883.08Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  P+QA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPREV
        LML+RRSASDLTIMKPKSVLAEEGG + E      GS ATRVKVRLPKAEVER+LKESKDEAEAAERIMGLY  K RE+VC N+ K EKEK  IIKPRE 
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPREV

Query:  R
        R
Subjt:  R

XP_038891934.1 uncharacterized protein At1g66480 [Benincasa hispida]2.1e-9394.44Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMNISGETMKLNTP+QAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKV KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVR
        LMLARRSASDLTIMKPKSVLAEEGGGE++GSGSGSG+ ATRVKVRLPKAEVERLLKE KDEAEAAERIMGLYKTRENVC NDHK++EKKDIIKPRE R
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVR

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X18.1e-9192.35Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGET+KLNTP+Q GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVL EEGGGE+E    GSGSGATRVKVRLPKAEVERLLKE KDEAEAAERIMGLYKTRE+VC NDHKEKEKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPRE

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X23.6e-9191.92Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGET+KLNTP+Q GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVR
        LMLARRSASDLTIMKPKSVL EEGGGE+E    GSGSGATRVKVRLPKAEVERLLKE KDEAEAAERIMGLYKTRE+VC NDHKEKEKKDIIKPRE R
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVR

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X21.6e-7883.08Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  P+QA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPREV
        LML+RRSASDLTIMKPKSVLAEEGG + E      GS ATRVKVRLPKAEVER+LKESKDEAEAAERIMGLY  K RE+VC N+ K EKEK  IIKPRE 
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPREV

Query:  R
        R
Subjt:  R

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X12.7e-7883.42Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  P+QA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPRE
        LML+RRSASDLTIMKPKSVLAEEGG + E      GS ATRVKVRLPKAEVER+LKESKDEAEAAERIMGLY  K RE+VC N+ K EKEK  IIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPRE

A0A6J1JB09 uncharacterized protein At1g66480 isoform X26.0e-7883.08Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGNTFG+KKT KVM +SG+T+KL  P+QA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPREV
        LML+RRSASDLTIMKPKSVLAEEG GE E      GS ATRVKVRLPKAEVER+LKES+DEAEAAERIMGLY  K  ENVC N  K EKEK  IIKPRE 
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY--KTRENVCGNDHK-EKEKKDIIKPREV

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.5e-3644.24Show/hide
Query:  MGNTFGVK-KTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSAI
        MGN+  VK K  KVM I GET ++ TP+ A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P E            RRV S I
Subjt:  MGNTFGVK-KTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEA-AERIMGLYKTRENVCGN-----DHK
        ++ AK+RL+ LML+RR+ SD+TI +       +GG   +G G   G G T V++RLP++++ +L++E+ ++A A AE+I+G+Y  R    G      D +
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEA-AERIMGLYKTRENVCGN-----DHK

Query:  EKEKKDIIKPREVRVPF
         +     IK RE +V F
Subjt:  EKEKKDIIKPREVRVPF

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 21.8e-3744.24Show/hide
Query:  MGNTFGVK-KTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSAI
        MGN+  VK K  KVM I GET ++ TP+ A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P E            RRV S I
Subjt:  MGNTFGVK-KTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEA-AERIMGLYKTRENVCGN-----DHK
        ++ AK+RL+ LML+RR+ SD+TI +       +GG   +G G   G G T V++RLP++++ +L++E+ ++A A AE+I+G+Y  R    G      D +
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEA-AERIMGLYKTRENVCGN-----DHK

Query:  EKEKKDIIKPREVRVPF
         +     IK RE +V F
Subjt:  EKEKKDIIKPREVRVPF

AT1G71015.1 unknown protein2.1e-4354.97Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +MNI+GE+ KL TP++AG VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+VELP+  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGL
        L L+RRS+SDL++MK K+ + +E   E E S          VK++LPK ++E+L KES+  ++ + +I  L
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.8.7e-4557.39Show/hide
Query:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVM I GET KL TP+ A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+VE     KE  PRRVRS I++SAK+RLES
Subjt:  MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRE
        LMLARRS+SDL+I+KP      E   E EG+         RVKVR+PKAE+E+L+KE   EAEA ++I  L+  ++
Subjt:  LMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRE

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)9.3e-3146.02Show/hide
Query:  MGNTFGVKKT-VKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQ--APRRVRSA-INMSAKD
        MGNT  V++  VKVM I G+  +L TP+ A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLV+LP V K      RRV S  I++ AK+
Subjt:  MGNTFGVKKT-VKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQ--APRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY
        RLE LML+RR+ SD+            G   ++  G G   G TRV++RLP++++ +L++ES D +E A +I+  Y
Subjt:  RLESLMLARRSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGAGTCAAGAAGACGGTGAAGGTCATGAACATCTCCGGCGAGACGATGAAACTAAACACCCCGATTCAAGCTGGAGATGTCGTCAAGGAT
TATCCCGGCTTTGTTCTACTCGAATCCGAGGCCGTGAAGCACTACGGAGTTCGAGCAAAGCCATTAGAGCTCCATCAGAAGCTCAGTACGAAGAGACTTTATTTC
CTCGTAGAACTGCCTAAAGTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCGGCGATCAACATGAGCGCGAAGGATAGGCTAGAGAGCTTGATGTTGGCACGA
CGGTCAGCATCGGACCTAACTATCATGAAACCAAAGAGTGTATTAGCGGAGGAGGGTGGTGGAGAGAATGAGGGATCGGGATCGGGATCAGGATCAGGAGCGACA
CGGGTGAAGGTACGGTTGCCGAAGGCCGAGGTGGAAAGGTTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGA
GAAAATGTTTGTGGAAATGATCACAAGGAGAAGGAGAAGAAGGATATCATCAAGCCACGTGAGGTGCGTGTGCCTTTTATTTTACTTCTTTTAAATATTGCAATT
TACAATATTCATAAAATTTCTAAGTCCATTTTGATGTAA
mRNA sequenceShow/hide mRNA sequence
CTTAAACGCCCGCCCACCACCATCTTCCTCTCTCTCTTTTTTTCAAAACTCTTTCCCTTTTGGCCGCCATGGGAAACACCTTTGGAGTCAAGAAGACGGTGAAGG
TCATGAACATCTCCGGCGAGACGATGAAACTAAACACCCCGATTCAAGCTGGAGATGTCGTCAAGGATTATCCCGGCTTTGTTCTACTCGAATCCGAGGCCGTGA
AGCACTACGGAGTTCGAGCAAAGCCATTAGAGCTCCATCAGAAGCTCAGTACGAAGAGACTTTATTTCCTCGTAGAACTGCCTAAAGTTCCAAAAGAACAGGCTC
CACGACGAGTACGGTCGGCGATCAACATGAGCGCGAAGGATAGGCTAGAGAGCTTGATGTTGGCACGACGGTCAGCATCGGACCTAACTATCATGAAACCAAAGA
GTGTATTAGCGGAGGAGGGTGGTGGAGAGAATGAGGGATCGGGATCGGGATCAGGATCAGGAGCGACACGGGTGAAGGTACGGTTGCCGAAGGCCGAGGTGGAAA
GGTTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGAGAAAATGTTTGTGGAAATGATCACAAGGAGAAGGAGA
AGAAGGATATCATCAAGCCACGTGAGGTGCGTGTGCCTTTTATTTTACTTCTTTTAAATATTGCAATTTACAATATTCATAAAATTTCTAAGTCCATTTTGATGT
AAGCGACGTGTGAGTTTTATGACGACAATGGAAGTTGGAACTCAGATTGCAGTAACATCTTAGCTAACACAAGAACGACAAGATTCATGAACAAATTCATTAAAG
TGATACAGTGACGATCTTTCGAGGGTACTACACGTTACGTCCTTCACTTCTACTTAAAGTATTTGAATCTTGAACAAGCTTTTGTAGAATATAGCTAGATTTACT
GCCCTTTATATATACCCATGTACTTATAATTAACTTCATGGTATATATATATATATATATATATATATATATATATATAT
Protein sequenceShow/hide protein sequence
MGNTFGVKKTVKVMNISGETMKLNTPIQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVPKEQAPRRVRSAINMSAKDRLESLMLAR
RSASDLTIMKPKSVLAEEGGGENEGSGSGSGSGATRVKVRLPKAEVERLLKESKDEAEAAERIMGLYKTRENVCGNDHKEKEKKDIIKPREVRVPFILLLLNIAI
YNIHKISKSILM