; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023419 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023419
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationscaffold13:9194942..9220135
RNA-Seq ExpressionSpg023419
SyntenySpg023419
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.3e-2434.39Show/hide
Query:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV L ++K+  S   +  ++          SIK L RYV   +  +D +++ +S +IFG  K +YL  NDIMQ+C+M+EI  +C+  YIA+ W 
Subjt:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
         +E       F ++    I+P   + E   R+L     ++   ++V IPY  G HW+L ++N+ +N VY+L SL   + +D + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.3e-2434.39Show/hide
Query:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV L ++K+  S   +  ++          SIK L RYV   +  +D +++ +S +IFG  K +YL  NDIMQ+C+M+EI  +C+  YIA+ W 
Subjt:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
         +E       F ++    I+P   + E   R+L     ++   ++V IPY  G HW+L ++N+ +N VY+L SL   + +D + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.3e-2434.39Show/hide
Query:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV L ++K+  S   +  ++          SIK L RYV   +  +D +++ +S +IFG  K +YL  NDIMQ+C+M+EI  +C+  YIA+ W 
Subjt:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
         +E       F ++    I+P   + E   R+L     ++   ++V IPY  G HW+L ++N+ +N VY+L SL   + +D + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]3.5e-2234.74Show/hide
Query:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV   K+K +   T  KS   SS  T    +IK L RY +  +  DD +Q+ +S++I G  KT+YL  +DI+Q+C M EI  +C+  YIA  W 
Subjt:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGN-HWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
        +  ++     F ++    I+      E  +++L     ++   ++V IPYN G+ HWIL ++N+ +N VY++ SL   +L++ + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGN-HWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]4.1e-2334.92Show/hide
Query:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV   K+K +   T  KS   SS  T    +IK L RY +  +  DD +Q+ +S++I G  KT+YL  +DI+Q+C M EI  +C+  YIA  W 
Subjt:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
        +  ++     F ++    I+      E  +++L     ++   ++V IPYN G HWIL ++N+ +N VY++ SL   +L++ + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X14.2e-2134.04Show/hide
Query:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV + K+K +   T  +S+  SS  T    +IK L RY ++ +  +D +Q+ +S+ IFG  KT+YL  +DI+Q+C M EI  +C+  YIA  W 
Subjt:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGN-HWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLN
        +  E+     F ++    I+    + E  +R+L     +    ++V IPYN G  HWIL ++++ +N VY++  L   +L + + V+N
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGN-HWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLN

A0A5D3CYL9 ULP_PROTEASE domain-containing protein4.2e-2134.04Show/hide
Query:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV + K+K +   T  +S+  SS  T    +IK L RY ++ +  +D +Q+ +S+ IFG  KT+YL  +DI+Q+C M EI  +C+  YIA  W 
Subjt:  VAWPRALVALCKDKDSKHKTMIKSSFPSSATTP--PSIKFLYRY-VEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGN-HWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLN
        +  E+     F ++    I+    + E  +R+L     +    ++V IPYN G  HWIL ++++ +N VY++  L   +L + + V+N
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGN-HWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLN

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X16.2e-2534.39Show/hide
Query:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV L ++K+  S   +  ++          SIK L RYV   +  +D +++ +S +IFG  K +YL  NDIMQ+C+M+EI  +C+  YIA+ W 
Subjt:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
         +E       F ++    I+P   + E   R+L     ++   ++V IPY  G HW+L ++N+ +N VY+L SL   + +D + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X46.2e-2534.39Show/hide
Query:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV L ++K+  S   +  ++          SIK L RYV   +  +D +++ +S +IFG  K +YL  NDIMQ+C+M+EI  +C+  YIA+ W 
Subjt:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
         +E       F ++    I+P   + E   R+L     ++   ++V IPY  G HW+L ++N+ +N VY+L SL   + +D + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X26.2e-2534.39Show/hide
Query:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP
        VAWPR LV L ++K+  S   +  ++          SIK L RYV   +  +D +++ +S +IFG  K +YL  NDIMQ+C+M+EI  +C+  YIA+ W 
Subjt:  VAWPRALVALCKDKD--SKHKTMIKSSFPSSATTPPSIKFLYRYVE-KLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWP

Query:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT
         +E       F ++    I+P   + E   R+L     ++   ++V IPY  G HW+L ++N+ +N VY+L SL   + +D + V+NT+
Subjt:  HFEETGRLDMFKVMVSNDIAPMFGTPEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCCGAAGAAGGGGAAGAGGAGGAAACACCCTTGGTCCTTAAGCCGAAAGCAATGAGAATTAGGGAAAGGGAAGAAGCAAAAAAAGAAAGTGGGAGTGAAACCGC
AAACATAGCCGAGGCAGAAGAGGCAATGCCAGGTCTTGAGGCACTAAGCAGAAAAGATGCGCTCGCATCTAGGGTGCAAATGCAGTTTGTGCCCGCTTATGAGGGCCAAT
CTATAGCTTGTACAGTGGAGTATGCAACCGCAAATGTGGCGCAAACAAAGAAAGAACCAGAGATTGCACCAGTTCCTGAGGTGCAATCTAATGTGCAAATCCCACTACCA
CCAAAAGAGGAAGAAAATGTGCTCGAGGACCTGTTCGAGCACCAGAATGAGGCAATGTTCAAGGACTTGTTCAAACATGAGAAAGAAAAGGAGCAGCAAGCTCCAACTGA
TGAAGATGAGGTCCAAGAGATTTCTAAGGACCAAATGCCCAAGAGGGGCAGAAAAGGAACTGACAGTGAAGCCGAAGGGAAGGCTCAGAAGAAGAAAAGAACAAACATTT
CCCTGAGGAGGTCGAGCACTAGGCGAGTGACTTCTGTGCTCCCCCAAGCCAAGACACCATCACCACCACCACCTATAAGAAGAGAACCATCTGTTCCTCGACCAAAGAAG
GCTCCAACAAAGCAGTTGAGAGTGGAAGTTCGAGTGGAGACCCATCCAGACACTCAGACTCGTCTGGCCATTATGAAGAAACAGGTTCTGATGTGTGAGAAGGCCTTCTC
TACTATGACGGACCCTCTCCCAACATTCATTGAAAACATTCTACGCAAGTATGGTGGGATCAACTGTGCAAGGAGCCGAAAGCAGCTTCAATCCCTCTGGTCCATTCTAG
AAGTCGAGTCGTTACAGAGTGTAGAGATTTTCCTGTTGCAACGCTACTTGATTCTGTCAAAGGTTTATTGCAAAGGTGGTTTTATGAGCGAGGTTGATCCTATTAGTAAT
GTACAATTCAAAGTAATTGACAGAGATAACTATTTTATTGTGAAGTTGGATTTGAAATCATGTAGTTGTCATGTTTGGGATCTTGATGAAATCTCATGTGCTCATGCACT
TGCTGTTCTTCGTGGGCGGAATTCGAATACTTATTCTTTTGTCTTAGATTATTACTTTTCAAGAATATTGCCTTGGAACGGATTGTGGGATGATCCTTATTTGATAGGAA
TTGGAAAGGAAAGCTTGAAGTTCAAGCTTGAACAATGGCTTTTGGGACACTCCACTATAGCCCGTAGTGCTGCAGTTGCTTGGCCTCGTGCTTTGGTTGCTCTATGTAAA
GATAAGGACTCGAAACATAAAACAATGATAAAATCCTCATTTCCTAGTTCGGCAACCACACCTCCATCTATCAAATTCCTGTATCGTTATGTTGAAAAGCTATACAAGGA
TGATCCGATGCAAGTGCCCATCAGCGATGAGATATTTGGAGCAAGCAAAACATTATATCTCATGCCCAATGATATAATGCAATTTTGTAGTATGGTCGAGATATCAAATA
CCTGTGTATTTGTCTATATTGCGTTCTTTTGGCCGCATTTTGAGGAGACTGGTAGACTAGACATGTTTAAGGTCATGGTCTCAAACGACATTGCACCGATGTTTGGGACG
CCAGAAAAATGTGCAAGAAGTTTAACTACCGTCTTTTCTTTACTACAATCGAGGAAAATGGTATTTATTCCATATAATCCTGGGAATCACTGGATATTGTGTGTTGTGAA
TGTAAGTGACAATACTGTTTATCTATTGGGCTCCTTACATCCTAGTCTTTTGGATGACCTCAAACATGTGTTAAACACGACAGCGTCTCGACGCTGTCGACAGAATTCCT
ATAAATGCAATCTTTTAAGCTACAGCTACGGAGAGTCCATTTGGTCTTTAACAAAGGGTCCTACCCTCTCACCGGCACAAAAGACGTTTCTGTTTATTAGTTGGACCATA
AACAGGTTGTTCATTAGAGTAGTACTGGTACTTAAGGATATAGAGGTCCCACTGGTAGCTCATAAAGGCCCATTCCCCCCGTCGGAGTTCACGTCGTGCCGTCGCCGCCG
TTCGAGCCCTCCGTCGTCGCCTTTCCGTCCAGCCTTCGCCGTAGGTCTCAGCACCGCCGTCTCTTCGCCTAATTCGTGGGTTTTGTGGTTCGGTTTGCCCTCTCTCCGCG
CCATCACTGTGAAGCTCGAGTCTTCGCGCTGCTCTATCTTCCACGTTTCGCTGTATTCGCGCCGTCTCTCTCTGTCTTTGCGTGTTGGAAATTCTAAGATAGCATTGAAA
TTCTATTCTGTAGCGCCGTCTAAGTGTTCGATTGAGTTCGAATCACTTAAATTCGATCACCCACCGCCCAAGGAGCGTTGTAACACGCTGTTCGAGGTTATGTTGCCTGT
GAGTAAAGTTGCATGGTCTTGTGTGCTCATTGAGAGATATGGGTTGATTGTGAATTCTTCTATTTCTCCTCTAGTGCCTCAAGCAGTGTTGACATTGGATGCATTGCAAG
CTATGACTGACAATGCAATCCAAAGCAACTTAAAGCACCTTGGTGTGAACCAAGTCCTTGCTCATAGACGAGAAGCTCAGTTCTTCCAAAGCTTCAAGGAAGCAAGGTCT
CCATCGTTCGGTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCCGAAGAAGGGGAAGAGGAGGAAACACCCTTGGTCCTTAAGCCGAAAGCAATGAGAATTAGGGAAAGGGAAGAAGCAAAAAAAGAAAGTGGGAGTGAAACCGC
AAACATAGCCGAGGCAGAAGAGGCAATGCCAGGTCTTGAGGCACTAAGCAGAAAAGATGCGCTCGCATCTAGGGTGCAAATGCAGTTTGTGCCCGCTTATGAGGGCCAAT
CTATAGCTTGTACAGTGGAGTATGCAACCGCAAATGTGGCGCAAACAAAGAAAGAACCAGAGATTGCACCAGTTCCTGAGGTGCAATCTAATGTGCAAATCCCACTACCA
CCAAAAGAGGAAGAAAATGTGCTCGAGGACCTGTTCGAGCACCAGAATGAGGCAATGTTCAAGGACTTGTTCAAACATGAGAAAGAAAAGGAGCAGCAAGCTCCAACTGA
TGAAGATGAGGTCCAAGAGATTTCTAAGGACCAAATGCCCAAGAGGGGCAGAAAAGGAACTGACAGTGAAGCCGAAGGGAAGGCTCAGAAGAAGAAAAGAACAAACATTT
CCCTGAGGAGGTCGAGCACTAGGCGAGTGACTTCTGTGCTCCCCCAAGCCAAGACACCATCACCACCACCACCTATAAGAAGAGAACCATCTGTTCCTCGACCAAAGAAG
GCTCCAACAAAGCAGTTGAGAGTGGAAGTTCGAGTGGAGACCCATCCAGACACTCAGACTCGTCTGGCCATTATGAAGAAACAGGTTCTGATGTGTGAGAAGGCCTTCTC
TACTATGACGGACCCTCTCCCAACATTCATTGAAAACATTCTACGCAAGTATGGTGGGATCAACTGTGCAAGGAGCCGAAAGCAGCTTCAATCCCTCTGGTCCATTCTAG
AAGTCGAGTCGTTACAGAGTGTAGAGATTTTCCTGTTGCAACGCTACTTGATTCTGTCAAAGGTTTATTGCAAAGGTGGTTTTATGAGCGAGGTTGATCCTATTAGTAAT
GTACAATTCAAAGTAATTGACAGAGATAACTATTTTATTGTGAAGTTGGATTTGAAATCATGTAGTTGTCATGTTTGGGATCTTGATGAAATCTCATGTGCTCATGCACT
TGCTGTTCTTCGTGGGCGGAATTCGAATACTTATTCTTTTGTCTTAGATTATTACTTTTCAAGAATATTGCCTTGGAACGGATTGTGGGATGATCCTTATTTGATAGGAA
TTGGAAAGGAAAGCTTGAAGTTCAAGCTTGAACAATGGCTTTTGGGACACTCCACTATAGCCCGTAGTGCTGCAGTTGCTTGGCCTCGTGCTTTGGTTGCTCTATGTAAA
GATAAGGACTCGAAACATAAAACAATGATAAAATCCTCATTTCCTAGTTCGGCAACCACACCTCCATCTATCAAATTCCTGTATCGTTATGTTGAAAAGCTATACAAGGA
TGATCCGATGCAAGTGCCCATCAGCGATGAGATATTTGGAGCAAGCAAAACATTATATCTCATGCCCAATGATATAATGCAATTTTGTAGTATGGTCGAGATATCAAATA
CCTGTGTATTTGTCTATATTGCGTTCTTTTGGCCGCATTTTGAGGAGACTGGTAGACTAGACATGTTTAAGGTCATGGTCTCAAACGACATTGCACCGATGTTTGGGACG
CCAGAAAAATGTGCAAGAAGTTTAACTACCGTCTTTTCTTTACTACAATCGAGGAAAATGGTATTTATTCCATATAATCCTGGGAATCACTGGATATTGTGTGTTGTGAA
TGTAAGTGACAATACTGTTTATCTATTGGGCTCCTTACATCCTAGTCTTTTGGATGACCTCAAACATGTGTTAAACACGACAGCGTCTCGACGCTGTCGACAGAATTCCT
ATAAATGCAATCTTTTAAGCTACAGCTACGGAGAGTCCATTTGGTCTTTAACAAAGGGTCCTACCCTCTCACCGGCACAAAAGACGTTTCTGTTTATTAGTTGGACCATA
AACAGGTTGTTCATTAGAGTAGTACTGGTACTTAAGGATATAGAGGTCCCACTGGTAGCTCATAAAGGCCCATTCCCCCCGTCGGAGTTCACGTCGTGCCGTCGCCGCCG
TTCGAGCCCTCCGTCGTCGCCTTTCCGTCCAGCCTTCGCCGTAGGTCTCAGCACCGCCGTCTCTTCGCCTAATTCGTGGGTTTTGTGGTTCGGTTTGCCCTCTCTCCGCG
CCATCACTGTGAAGCTCGAGTCTTCGCGCTGCTCTATCTTCCACGTTTCGCTGTATTCGCGCCGTCTCTCTCTGTCTTTGCGTGTTGGAAATTCTAAGATAGCATTGAAA
TTCTATTCTGTAGCGCCGTCTAAGTGTTCGATTGAGTTCGAATCACTTAAATTCGATCACCCACCGCCCAAGGAGCGTTGTAACACGCTGTTCGAGGTTATGTTGCCTGT
GAGTAAAGTTGCATGGTCTTGTGTGCTCATTGAGAGATATGGGTTGATTGTGAATTCTTCTATTTCTCCTCTAGTGCCTCAAGCAGTGTTGACATTGGATGCATTGCAAG
CTATGACTGACAATGCAATCCAAAGCAACTTAAAGCACCTTGGTGTGAACCAAGTCCTTGCTCATAGACGAGAAGCTCAGTTCTTCCAAAGCTTCAAGGAAGCAAGGTCT
CCATCGTTCGGTGGTTAA
Protein sequenceShow/hide protein sequence
MEAEEGEEEETPLVLKPKAMRIREREEAKKESGSETANIAEAEEAMPGLEALSRKDALASRVQMQFVPAYEGQSIACTVEYATANVAQTKKEPEIAPVPEVQSNVQIPLP
PKEEENVLEDLFEHQNEAMFKDLFKHEKEKEQQAPTDEDEVQEISKDQMPKRGRKGTDSEAEGKAQKKKRTNISLRRSSTRRVTSVLPQAKTPSPPPPIRREPSVPRPKK
APTKQLRVEVRVETHPDTQTRLAIMKKQVLMCEKAFSTMTDPLPTFIENILRKYGGINCARSRKQLQSLWSILEVESLQSVEIFLLQRYLILSKVYCKGGFMSEVDPISN
VQFKVIDRDNYFIVKLDLKSCSCHVWDLDEISCAHALAVLRGRNSNTYSFVLDYYFSRILPWNGLWDDPYLIGIGKESLKFKLEQWLLGHSTIARSAAVAWPRALVALCK
DKDSKHKTMIKSSFPSSATTPPSIKFLYRYVEKLYKDDPMQVPISDEIFGASKTLYLMPNDIMQFCSMVEISNTCVFVYIAFFWPHFEETGRLDMFKVMVSNDIAPMFGT
PEKCARSLTTVFSLLQSRKMVFIPYNPGNHWILCVVNVSDNTVYLLGSLHPSLLDDLKHVLNTTASRRCRQNSYKCNLLSYSYGESIWSLTKGPTLSPAQKTFLFISWTI
NRLFIRVVLVLKDIEVPLVAHKGPFPPSEFTSCRRRRSSPPSSPFRPAFAVGLSTAVSSPNSWVLWFGLPSLRAITVKLESSRCSIFHVSLYSRRLSLSLRVGNSKIALK
FYSVAPSKCSIEFESLKFDHPPPKERCNTLFEVMLPVSKVAWSCVLIERYGLIVNSSISPLVPQAVLTLDALQAMTDNAIQSNLKHLGVNQVLAHRREAQFFQSFKEARS
PSFGG