; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022971 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022971
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionKynurenine formamidase
Genome locationscaffold357:236986..238236
RNA-Seq ExpressionMS022971
SyntenyMS022971
Gene Ontology termsNA
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596247.1 hypothetical protein SDJN03_09427, partial [Cucurbita argyrosperma subsp. sororia]5.7e-11778.75Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE
        MNSL LHSSLGP L  KPLPC LK+P+ +  L G   S RIFSRS    V L PSLH+S+ +TC+STPSNEGVVSVINFEDLVEKDFSFLDSDDF S EE
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE

Query:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE
        +DQKIRRIISAGE+AESSQVMV+I SEGFVD+L+DSAPCRSLLV+HD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELD IFG 
Subjt:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE

Query:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD
        L++RC+PG RLVISHP G+ AL++EQQQF DVVVS LPD+ TLQKVAADHS  LTEFVD++GFYLAVLKF+KD
Subjt:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD

XP_004136559.1 uncharacterized protein LOC101219545 [Cucumis sativus]1.3e-11678.97Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK
        MNSL LHSSL PS+T KP    L+RP+ IC L GSH + RIF+  +S +P+LH+S  + C+STPSNEGVVSV+NFEDLVEKDFSFLDSDDFSS EE+ QK
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK

Query:  IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR
        IRRIISAGE+ ESSQVMVSI SEGFVDQLF  AP RSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFG LS+R
Subjt:  IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR

Query:  CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
        CV GARLVISHPNG+ ALE+EQQQFPDVVVS LPD+MTLQK AADHS DLTEF+D++GFYLA+LKF+KD S
Subjt:  CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS

XP_022151424.1 uncharacterized protein LOC111019361 [Momordica charantia]1.0e-150100Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK
        MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK

Query:  IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR
        IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR
Subjt:  IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR

Query:  CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
        CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
Subjt:  CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS

XP_022971263.1 uncharacterized protein LOC111470036 [Cucurbita maxima]2.0e-11779.12Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE
        MNSL LHSSLGP L  KPLPC LKRP+ +  L G   S RIFSRS    V L PSLH+S+ +TC+STPSNEGVVSVINFEDLVEKDFSFLDSDDF S EE
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE

Query:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE
        +D+KIRRIISAGE+AESSQVMV+I SEGFVD+L+DSAPCRSLLV+HD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFG 
Subjt:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE

Query:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD
        L++RC+PG RLVISHP G+ AL++EQQQF DVVVS LPD+ TLQKVAADHS  LTEFVD++GFYLAVLKF+KD
Subjt:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]9.4e-12079.56Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSV---SLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEY
        MNSL LHSSLGPS T KP PCSLKR + ICTL GSH + RIF+  +    LYP++H++K +TC STP +EGVVSVINFEDLVEKDFSFLDSD+FSS+EE+
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSV---SLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEY

Query:  DQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGEL
        DQKIRRIISAGE+ ESSQVMVSI SEGFVDQLFD APCRSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIF  L
Subjt:  DQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGEL

Query:  SKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
        SKRCV GARLVISHPNG+  LE+EQQQFPDVVVS LP++M L+  AADHS +LTEF+D+  FYLAVLKFHKDRS
Subjt:  SKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS

TrEMBL top hitse value%identityAlignment
A0A1S3B7V6 uncharacterized protein LOC1034867471.9e-11377.29Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLK--RPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYD
        MNSL LHSSL PSLT  P    LK  RP+ +C   GSH + RIF+ S+S +P+LH+S     +STPSNEGVVSV+NFEDLVEKDFSFLDSDDFSS EE+D
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLK--RPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYD

Query:  QKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELS
        +KIRRIISAGE+ ESSQVMVSI SEGFVDQLF  AP RSLLV+HDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAIFG LS
Subjt:  QKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELS

Query:  KRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
        +RCV GARLVISHP+G+ ALE+E+QQFPDVVVS LPD+MTLQK AADHS DLTEF+D++GFYLA+LKF+KDRS
Subjt:  KRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS

A0A5D3DP12 Uncharacterized protein1.9e-11377.29Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLK--RPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYD
        MNSL LHSSL PSLT  P    LK  RP+ +C   GSH + RIF+ S+S +P+LH+S     +STPSNEGVVSV+NFEDLVEKDFSFLDSDDFSS EE+D
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLK--RPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYD

Query:  QKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELS
        +KIRRIISAGE+ ESSQVMVSI SEGFVDQLF  AP RSLLV+HDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAIFG LS
Subjt:  QKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELS

Query:  KRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
        +RCV GARLVISHP+G+ ALE+E+QQFPDVVVS LPD+MTLQK AADHS DLTEF+D++GFYLA+LKF+KDRS
Subjt:  KRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS

A0A6J1DD17 uncharacterized protein LOC1110193615.0e-151100Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK
        MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQK

Query:  IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR
        IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR
Subjt:  IRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKR

Query:  CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
        CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS
Subjt:  CVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS

A0A6J1FPV5 uncharacterized protein LOC1114458891.1e-11678.39Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE
        MNSL LHSSLGP L  KPLPC LK P+ +  L G   S RIFSRS    V L PSLH+S+ +TC+STPSNEGVVSVINFEDLVEKDFSFLDSDDF S EE
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE

Query:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE
        +D+KIRRIISAGE+AESSQVMV+I SEGFVD+L+DSAPCRSLLV+HD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFG 
Subjt:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE

Query:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD
        L++RC+PG RLVISHP G+  L++EQQQF DVVVS LPD+ TLQKVAADHS  LTEFVD++GFYLAVLKF+KD
Subjt:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD

A0A6J1I6B9 uncharacterized protein LOC1114700369.5e-11879.12Show/hide
Query:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE
        MNSL LHSSLGP L  KPLPC LKRP+ +  L G   S RIFSRS    V L PSLH+S+ +TC+STPSNEGVVSVINFEDLVEKDFSFLDSDDF S EE
Subjt:  MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRS----VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEE

Query:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE
        +D+KIRRIISAGE+AESSQVMV+I SEGFVD+L+DSAPCRSLLV+HD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFG 
Subjt:  YDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGE

Query:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD
        L++RC+PG RLVISHP G+ AL++EQQQF DVVVS LPD+ TLQKVAADHS  LTEFVD++GFYLAVLKF+KD
Subjt:  LSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein6.2e-6959.21Show/hide
Query:  FSRS-VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLV
        FS+S VS Y +  VS      S+   EG VSV++F    EKD+SFL+S +  S+ E+ QKI RII AGE++ESS+V+VSI SE FVD+L +S+P + LL+
Subjt:  FSRS-VSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQKIRRIISAGEVAESSQVMVSIPSEGFVDQLFDSAPCRSLLV

Query:  LHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQ
        +HDS+ TLACIKEKYDKVKCWQGE+IYVPEKW P D VFLY+LPA+PF+LD +F  LS+RC  GAR+VISHP G+  LE+++++F DVVVS LPD+ TL 
Subjt:  LHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKRCVPGARLVISHPNGKTALEEEQQQFPDVVVSGLPDKMTLQ

Query:  KVAADHSLDLTEFVDDNGFYLAVLKFHK
         VA  HS +LT+FVD+ G YLAVLK  K
Subjt:  KVAADHSLDLTEFVDDNGFYLAVLKFHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTTTCCTTGCATTCTTCTCTTGGCCCCTCACTTACTTCAAAGCCTCTGCCATGCTCTCTTAAGAGGCCTCTACCTATTTGTACATTATGTGGCTCTCATCG
CAGTGCTCGGATATTTTCCCGTTCGGTTTCATTGTATCCTTCATTACATGTCAGCAAACCGGTCACCTGTGCTTCAACTCCTTCAAATGAGGGAGTAGTATCTGTGATCA
ATTTTGAAGACTTAGTTGAGAAGGATTTTTCGTTTCTCGATTCAGACGATTTTAGTTCCAGTGAAGAGTATGATCAAAAGATTAGGCGCATTATTTCTGCTGGGGAGGTT
GCAGAAAGTTCTCAGGTTATGGTTTCCATTCCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTCAGCTCCTTGCCGAAGTTTACTTGTCCTCCATGATTCTATTCTAAC
GTTAGCTTGTATCAAAGAAAAATATGACAAAGTTAAGTGCTGGCAAGGAGAAGTTATATATGTACCAGAAAAGTGGGGGCCTTTCGATGTTGTGTTTCTCTATTATCTGC
CTGCAATGCCTTTCGAACTCGACGCGATTTTTGGAGAACTCTCAAAACGTTGTGTACCAGGTGCAAGACTAGTTATTAGCCATCCCAATGGAAAGACAGCATTAGAGGAA
GAACAGCAACAGTTCCCAGATGTCGTAGTTTCGGGTTTACCTGATAAGATGACTTTGCAGAAAGTTGCTGCAGATCACTCTCTTGACCTGACTGAATTTGTAGATGATAA
TGGCTTTTATCTTGCAGTTTTGAAGTTCCACAAGGATAGAAGC
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTCTTTCCTTGCATTCTTCTCTTGGCCCCTCACTTACTTCAAAGCCTCTGCCATGCTCTCTTAAGAGGCCTCTACCTATTTGTACATTATGTGGCTCTCATCG
CAGTGCTCGGATATTTTCCCGTTCGGTTTCATTGTATCCTTCATTACATGTCAGCAAACCGGTCACCTGTGCTTCAACTCCTTCAAATGAGGGAGTAGTATCTGTGATCA
ATTTTGAAGACTTAGTTGAGAAGGATTTTTCGTTTCTCGATTCAGACGATTTTAGTTCCAGTGAAGAGTATGATCAAAAGATTAGGCGCATTATTTCTGCTGGGGAGGTT
GCAGAAAGTTCTCAGGTTATGGTTTCCATTCCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTCAGCTCCTTGCCGAAGTTTACTTGTCCTCCATGATTCTATTCTAAC
GTTAGCTTGTATCAAAGAAAAATATGACAAAGTTAAGTGCTGGCAAGGAGAAGTTATATATGTACCAGAAAAGTGGGGGCCTTTCGATGTTGTGTTTCTCTATTATCTGC
CTGCAATGCCTTTCGAACTCGACGCGATTTTTGGAGAACTCTCAAAACGTTGTGTACCAGGTGCAAGACTAGTTATTAGCCATCCCAATGGAAAGACAGCATTAGAGGAA
GAACAGCAACAGTTCCCAGATGTCGTAGTTTCGGGTTTACCTGATAAGATGACTTTGCAGAAAGTTGCTGCAGATCACTCTCTTGACCTGACTGAATTTGTAGATGATAA
TGGCTTTTATCTTGCAGTTTTGAAGTTCCACAAGGATAGAAGC
Protein sequenceShow/hide protein sequence
MNSLSLHSSLGPSLTSKPLPCSLKRPLPICTLCGSHRSARIFSRSVSLYPSLHVSKPVTCASTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSSEEYDQKIRRIISAGEV
AESSQVMVSIPSEGFVDQLFDSAPCRSLLVLHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGELSKRCVPGARLVISHPNGKTALEE
EQQQFPDVVVSGLPDKMTLQKVAADHSLDLTEFVDDNGFYLAVLKFHKDRS