; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003981 (gene) of Snake gourd v1 genome

Gene IDTan0003981
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG07:68425618..68429259
RNA-Seq ExpressionTan0003981
SyntenyTan0003981
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]2.6e-17873.7Show/hide
Query:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSS WS+I    P+   S MLKDYL DD SSCSSNGFRSFPRR CCATTVRFLLEIDLKVKDS+LTK FLPRT SRKIALSTISTLQ+ASDAVVRA
Subjt:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG
        FK+FPLPS  KPF  RS SRK+ILR FWKK D V  N RR KSFQEFLDEKEPP SR+DSA CTA++V GRNSISSCSN SISWTESEFTSEMIPSSSSG
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG

Query:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI
        +SESCSENDAVK DKDSP NLIGKRDGVTFGKDS EETTTA +AA          RE+ VK W NEEEKEQ SPVSVLDFPFEDEDQ   SSFNCNLHLI
Subjt:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI

Query:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR
        +GKKQKH+ + KRFENG E EP+DLKKRF D+   R+ F  IS+KEHQREQKA ELLKLVKST          TENLLLDFFHEKLEEN++ A     R 
Subjt:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR

Query:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        G+DF++ QVLK TEDWI GDAGE MA GWE PEGR  YIKDME+AG WRS AG++EELAAE   EVW+SL +EL+ID+S
Subjt:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

KAG7016946.1 hypothetical protein SDJN02_22057, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-17773.33Show/hide
Query:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSS WS+I    P+   S MLKDYL DD SSCSSNGFRSFPRR CCATTVRFLLEIDLKVKDS+LTK FLPRT SRKIALSTISTLQ+ASDAVVRA
Subjt:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSN-NRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSS
        FK+FPLPS RKPF  RS SRK+ILR FWKK D V  N  RR KSFQEFLDEKEPP SR+DSA CTA++V GRNSISSCSN SISWTESEFTSEMIPSSSS
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSN-NRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSS

Query:  GSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHL
        G+SESCSENDAVK DKDSP NLIGKRDGVTFGKDS EETTTA +AA          RE+ VK W NEEEKEQ SPVSVLDFPFEDEDQ   SSFNCNLHL
Subjt:  GSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHL

Query:  IEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRR
        ++GKKQKH+ + KRFENG E EP+DLKKRF D+    + F  IS+KEHQREQKA ELLKLVKST          TENLLLDFFHEKLEEN++ A     R
Subjt:  IEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRR

Query:  RGSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
         G+DF++ QVLK TEDWI GDAGE MA GWE PEGR  YIKDME+AG WRS AG++EELAAE   EVW+SL +EL+ID+S
Subjt:  RGSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

XP_022922340.1 uncharacterized protein LOC111430353 [Cucurbita moschata]1.7e-17773.49Show/hide
Query:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSS WS+I    P+   S MLKDYL DD SSCSSNGFRSFPRR CCATTVRFLLEIDLKVKDS+LTK FLPRT SRKIALSTISTLQ+ASDAVVRA
Subjt:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG
        FK+FPLPS  KPF  RS SRK+ILR FWKK D V  N RR KSFQEFLDEKEPP SR+DSA CTA++V GRNSISSCSN SISWTESEFTSEMIPSSSSG
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG

Query:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI
        +SESCSENDAVK DKDSP NLIGKRDGVTFGKDS EETTTA +AA          RE+ VK W NEEEKEQ SPVSVLDFPFEDEDQ   SSFNCNLHL+
Subjt:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI

Query:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR
        +GKKQKH+ K KRFENG E EP+DLKKRF D+   R+ F  IS+KE+QREQKA ELLKLVKST          TENLLLDFFHEKLEEN++ A     R 
Subjt:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR

Query:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        G+DF++ QVLK TEDWI GDAGE MA GWE PEGR  YIKDME AG WRS AG++EELAAE   EVW+SL +EL+ID+S
Subjt:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]3.1e-17974.11Show/hide
Query:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSS WS+I    P+   S MLKDYL DD SSCSSNGFRSFPRR CCATTVRFLLEIDLKVKDSSLTK FLPRT SRKIALSTISTLQ+ASDAVVRA
Subjt:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG
        FK+FPLPS RKPF  RS SRK+ILR FWKK D V  N RR KSFQEFLDEKEPP SR+DSA CTA++V GRNSISSCSN SISWTESEFTSEMIPSSSSG
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG

Query:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI
        +SESCSENDAVK DKDSP NLIGKRDGVTFGKDS EETTTA +AA          RE+ VKQW NEEEKEQ SPVSVLDFPFEDEDQ   SSFNCNLHL+
Subjt:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI

Query:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR
        +GKKQKH+Q+ KRFENG E EP+DL KRF D+   R+ FS IS+KEHQREQKA ELLKLVKST           ENLLLDFFHEKLEEN++TA     R 
Subjt:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR

Query:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        G+DF++ QVLK TEDWI GD GE MA GWE PEGR  YIKDME+AG WRS AG++EELAAE   EVW+SL +EL+ID+S
Subjt:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

XP_038906459.1 uncharacterized protein LOC120092443 [Benincasa hispida]8.2e-18073.01Show/hide
Query:  ASADSSNWSLIMPKP---ARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAF
        AS DSSNW+LI   P    +S MLKDYL DD+SSCSSNGFRSFPRR CC TTVRFLLEIDLKVKD+SLTK FLPRTTSRKIALSTISTLQ+ASDAV+RAF
Subjt:  ASADSSNWSLIMPKP---ARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAF

Query:  KQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPS----RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSS
        KQFPLPSSRK FLPRSISRKLI + FWKKS+ V  N RRWKSF+EFLDEKEPPS    R+DSA CTAI+VAGRNS SSCSN SISWTESEFTSEMIPSSS
Subjt:  KQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPS----RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSS

Query:  SGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAA------NREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIE
        SG+SESCSENDAVKDDKDSP NLIGKRDGV+FGKDS E+TTTA AAA       R++ VKQW N+EEKEQ SPVSVLDFPFEDEDQ ISSSFNCN++L+E
Subjt:  SGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAA------NREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIE

Query:  GKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRG
        GKKQKH +K+KR E G ELEPVDLKKRFTD+    + F+LI+KKEHQ E+KA E LKL+KST          TENLLLDFFH+KLEE+E+TA        
Subjt:  GKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRG

Query:  SDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        SDF++  +LK T++WI GDAG+MM MGWE PE R FYIKDME+AG W SFAG++EEL AE   EVWISL N+L+ID+S
Subjt:  SDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein1.1e-17472.19Show/hide
Query:  MASADSSNWSLIMPKP---ARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSSNW+LI   P    +SL+LKDYL DD SSCSSNGFRSFPRR CC+TTVRFLLEIDLKVKDSS+TK FLPRTTSRKIALSTISTLQ+ASDAV+RA
Subjt:  MASADSSNWSLIMPKP---ARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAV-PSNNRRWKSFQEFLDEKEPPS-------RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEM
        FKQFPLPSSRK F PRSISRKLI + F KKSD V P+ N+RWKSF+EFLDEKEPPS        +DSA CTAI+VAGRNSISSCSN SISWTESEFTSE+
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAV-PSNNRRWKSFQEFLDEKEPPS-------RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEM

Query:  IPSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTA----AAAAN----REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFN
        IPSS SG+SESCSENDAVKDDKDSP NLIGKRDGVTFGKDS EETTTA    AAA +    RE++VKQW NEEEKEQFSPVSVLDFPFEDEDQ ISSSFN
Subjt:  IPSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTA----AAAAN----REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFN

Query:  CNLHLIEGKKQK-HSQKTKRFENGAELEPVDLKKRFTDLGHC--RRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENES
        CN+HL+EGKKQK   QKTKR E G ELEPVDLKKRFT++     +  F+LI+KKEHQ E+KALE LKL+KST          TENLLLDFFH+KL+E+E+
Subjt:  CNLHLIEGKKQK-HSQKTKRFENGAELEPVDLKKRFTDLGHC--RRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENES

Query:  TAHESGRRRGSDFEE-QVLKLTEDWIYGDAGEMMAMG-WELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        T+        SDF++ Q+LK  +DWI G+AGE+  MG WELPE RNFYIKDME+   WRSF GD+EEL AE   EVWISLLN+L+ID+S
Subjt:  TAHESGRRRGSDFEE-QVLKLTEDWIYGDAGEMMAMG-WELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

A0A5A7TN51 Uncharacterized protein7.5e-17171.11Show/hide
Query:  MASADSSNWSLIMPKP---ARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSSNW+LI   P    +SL+LKDYL DD SSCSSNGFRSFPRR CC+TTVRFLLEIDLKVKDSS TK FLPRT+SRKIALSTISTLQ+ASDAV+RA
Subjt:  MASADSSNWSLIMPKP---ARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPS-------RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMI
        FKQFPLPSSRK F PRSISRKLI + F KKSD V  NNRRWKSF+EFLDEKEPPS        +DSA CTAI+VAGRNSISSCSN SISWTESEFTSE+I
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPS-------RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMI

Query:  PSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTT-----AAAAANREESVKQW-SNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNL
        PSS SG+SESCSEN AVKDDKDSP NLIGKRDGVTFGKDS EET T     AAA   RE++VK+W +NEEEKEQFSPVSVLDFPFEDEDQ ISSSFNCN+
Subjt:  PSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTT-----AAAAANREESVKQW-SNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNL

Query:  HLIEGKKQK-HSQKTKRFENGAELEPVDLKKRFTD---LGHCRRDFSLISK-KEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENEST
        HL+EGKKQK   QKTKR E G ELEPVDLKKRFT+   +   +  F+LI+K KEHQ E+KALE LKL+KST          TENLLLDFFH+KL+E+E+T
Subjt:  HLIEGKKQK-HSQKTKRFENGAELEPVDLKKRFTD---LGHCRRDFSLISK-KEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENEST

Query:  AHESGRRRGSDFEE-QVLKLTEDWIYGDAGEMMAMG-WELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        +        SDF++ Q+L+  +DW+ G+AGE+  MG WELPE RNFYIKDME+A  WRSF GD+EEL AE   EVWISLL++L+ID+S
Subjt:  AHESGRRRGSDFEE-QVLKLTEDWIYGDAGEMMAMG-WELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

A0A6J1DTT2 uncharacterized protein LOC1110234203.6e-17373.96Show/hide
Query:  MASADSSNWSLIMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFKQ
        MA A SS+WSL+  K +RSLMLKDYLRD++ S SSNGFRSFPRR CC TTVRFLLEIDLKVKDSS    FL RT SRK+ALSTISTLQKAS AVVRAFKQ
Subjt:  MASADSSNWSLIMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFKQ

Query:  FPLPS----SRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPS-----RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIP
        F  PS     RKPFLP    RKL+LR FWKK+D V S  RRWKSFQEFLDEKEPPS     R+DSAACTAI+VAGRNSISSCSN S SWTESEF SEMIP
Subjt:  FPLPS----SRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPS-----RADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIP

Query:  SSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAA--AANREESVK-QWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIE
         SSSG+SES SENDAVKD KDSP N IGKR+GVTFGKDS EETT  AA   A REE VK QW +EEEKEQFSPVSVLDFPF+DEDQ ISSSFNCNLHLIE
Subjt:  SSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAA--AANREESVK-QWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIE

Query:  GKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRG
        GKKQK+  KT+RFENGAELEP+DLKKRF DLG  RRDF LISK EHQRE++ALE+L LVKS I KS+CFI +TENLLLDFFHEKLEE ESTA ESG RRG
Subjt:  GKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRG

Query:  SDFE----EQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDI
         DFE     +VLKLTEDWI G    +M  GWE+ EGR+ Y+ +ME AG WRSFAG++EELA+E+  EVWISLL+EL+ID+
Subjt:  SDFE----EQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDI

A0A6J1E8H2 uncharacterized protein LOC1114303538.2e-17873.49Show/hide
Query:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSS WS+I    P+   S MLKDYL DD SSCSSNGFRSFPRR CCATTVRFLLEIDLKVKDS+LTK FLPRT SRKIALSTISTLQ+ASDAVVRA
Subjt:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG
        FK+FPLPS  KPF  RS SRK+ILR FWKK D V  N RR KSFQEFLDEKEPP SR+DSA CTA++V GRNSISSCSN SISWTESEFTSEMIPSSSSG
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG

Query:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI
        +SESCSENDAVK DKDSP NLIGKRDGVTFGKDS EETTTA +AA          RE+ VK W NEEEKEQ SPVSVLDFPFEDEDQ   SSFNCNLHL+
Subjt:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI

Query:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR
        +GKKQKH+ K KRFENG E EP+DLKKRF D+   R+ F  IS+KE+QREQKA ELLKLVKST          TENLLLDFFHEKLEEN++ A     R 
Subjt:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR

Query:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        G+DF++ QVLK TEDWI GDAGE MA GWE PEGR  YIKDME AG WRS AG++EELAAE   EVW+SL +EL+ID+S
Subjt:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

A0A6J1HZ34 uncharacterized protein LOC1114689623.1e-17773.7Show/hide
Query:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA
        MAS DSS WS+I    P+   S MLKDYL DD SSCSSNGFRSFPRR CCATTVRFLLEIDLKVKDSSLTK FLPRT SRKIALSTISTLQ+ASDAVVRA
Subjt:  MASADSSNWSLIM---PKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRA

Query:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG
        FK+FPLPS RKPF  RS SRK+ILR FWKK D V  N RR KSFQEFLDEKEPP SR+DSA CTA++V GRNSISSCSN SISWTESEFTSE IPSSSSG
Subjt:  FKQFPLPSSRKPFLPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPP-SRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSG

Query:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI
        +SESCSENDAVK DKDSP NLIGKRDGVTFGKDS EETTTA +AA          RE+ VKQW NEEEKEQ SPVSVLDFPFEDEDQ   SSFNCNLHL+
Subjt:  SSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAAN---------REESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLI

Query:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR
        +GKK  H+QK+KRFENG E EP+DLKKRF D+    + FSLIS+KEHQREQKA ELLKLVKST          TENLLLDFFHEKLEEN++ A     R 
Subjt:  EGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRR

Query:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS
        G+D ++ QVLK TEDWI GDAGE M  GWE PEGR  YIKDME+AG WRS  G++EELAAE   EVWISL +EL+ID+S
Subjt:  GSDFEE-QVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLLNELVIDIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein2.2e-1825.43Show/hide
Query:  RSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFKQF---PLPSSRKPFLPRS
        RS MLKD L +D +SCSSNGF+S PRR                        +  P    RK           A  AV+ A K      + S+    LPRS
Subjt:  RSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFKQF---PLPSSRKPFLPRS

Query:  ISRKLILRPFWKKSDAVP----SNNRRWKSFQEFLDEKE--PPSRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSGSSESCSENDAVK
        +SR+L  +   +   ++      +  RW S ++  ++     P +  +   T  + +   S +SCS    SW++ +FTSE +PSS   + E C E  +VK
Subjt:  ISRKLILRPFWKKSDAVP----SNNRRWKSFQEFLDEKE--PPSRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSGSSESCSENDAVK

Query:  DDKDSPANLIGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIEGKKQKHSQKTKRFENGAELE
        ++                G+DS      A      EE++     + EKE  SPVSV +   E+ D+   SSF+  L  +E  KQK  Q  +RFE+ A + 
Subjt:  DDKDSPANLIGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIEGKKQKHSQKTKRFENGAELE

Query:  PVDLKK------------------RFTDLGHCRR-DFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRGS
        P +L +                  ++ D  +C   D     +   + E+KA +L   VK    +        E+L++D+F ++L +  ++ HE+      
Subjt:  PVDLKK------------------RFTDLGHCRR-DFSLISKKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRGS

Query:  DFEEQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLL
         FE Q++   + W+ G     +  G    + R    +++E          +E E+    ++E   SLL
Subjt:  DFEEQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWISLL

AT4G11780.1 unknown protein2.7e-3230.63Show/hide
Query:  RSLMLKDYLRDDMSSCSSNGFRSFPRR--PCCATTVRFLLEIDLK----VKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAF-KQFPLPSS---R
        + L+L+DYL DD+SSCSSNGF+SFPRR  P  ++TVR LL+ ++K    +     TK   PR T R    +  + +  A      AF K  P PSS   +
Subjt:  RSLMLKDYLRDDMSSCSSNGFRSFPRR--PCCATTVRFLLEIDLK----VKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAF-KQFPLPSS---R

Query:  KPFLPRSISRKLILRPFWKKSDAVPSNNRR------------WKS--FQEFLDEK----EPPSRADSAACTAISVAGRNSISSCSNHSISWTESEF----
        +    RS S++L+   FW+K   V   +RR            W+S  ++E LD++       S  D     + S A    +    +   S   SEF    
Subjt:  KPFLPRSISRKLILRPFWKKSDAVPSNNRR------------WKS--FQEFLDEK----EPPSRADSAACTAISVAGRNSISSCSNHSISWTESEF----

Query:  TSEMIPSSSSG--------SSESCSENDAVKDDKDSPANL-IGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVI
        +SE++ SSSS         S E  SE DAV+D K+S  +L     DG +  +++        +  NR+E V      EEKEQ SPVS+L+ PF+D+D+  
Subjt:  TSEMIPSSSSG--------SSESCSENDAVKDDKDSPANL-IGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVI

Query:  SSSFNCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFS--LISKKEHQREQKALELLKLVKSTIAKSQCFIFK--TENLLLDFFHEK
          +   + +      +K ++K++R      LEP+DL KR       + ++S   +  +E + E +A  L  LVK  I ++   +     +NLLLD+  E 
Subjt:  SSSFNCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFS--LISKKEHQREQKALELLKLVKSTIAKSQCFIFK--TENLLLDFFHEK

Query:  LEENESTAHESGRRRGSDFEEQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDE-EELAAEVVDEVWISLLNELVIDI
                       G   E  ++K  EDW+ G   EM  M WE+   R  Y+K+M+    W    GDE E +  E+ +  + S ++E + D+
Subjt:  LEENESTAHESGRRRGSDFEEQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDE-EELAAEVVDEVWISLLNELVIDI

AT4G23020.1 unknown protein2.0e-2729.61Show/hide
Query:  MASADSSNWSL-IMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFK
        M S  SS+  L +  K  + L+L+D+L DD+SSCSSNGF+SFPR          LL  +++                    L+    + KAS A++ A K
Subjt:  MASADSSNWSL-IMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFK

Query:  QFPLPSS-RKPFLPRSISRKLILRPFWKK----------------SDAVPSNNRRWKSFQEFLDEKEPPSRADSAACTAISVAGRNSISSCSNHSISWTE
          P PSS +     R   + L  R FWKK                +D      +R +SF EFL E +      S     IS     S  +  +       
Subjt:  QFPLPSS-RKPFLPRSISRKLILRPFWKK----------------SDAVPSNNRRWKSFQEFLDEKEPPSRADSAACTAISVAGRNSISSCSNHSISWTE

Query:  SEFTSE--MIPSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSF
        S F+SE   +  SSSG        D V       ++L           D+TEE                    EEKEQ SP+S+LD PF+D+      + 
Subjt:  SEFTSE--MIPSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSF

Query:  NCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFS--LISKKEHQREQKALELLKLVKSTIAKSQCFIFKT---ENLLLDFFHEKLEE
        +   H  E  ++K  +K +R E+   LEPVDL+KR       R+D+   +I  +E Q E +A  L  LVKS I + Q  +  +   +N+LLDFF E    
Subjt:  NCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFS--LISKKEHQREQKALELLKLVKSTIAKSQCFIFKT---ENLLLDFFHEKLEE

Query:  NESTAHESGRRRGSDFEEQVLKLTEDWIY--GDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWI-SLLNELVIDIS
        N  T            E++++++ E+W+    D    M M W++ E R  Y+K+M+    W    GDE+E   E +   ++ SL++EL+ DIS
Subjt:  NESTAHESGRRRGSDFEEQVLKLTEDWIY--GDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWI-SLLNELVIDIS

AT4G23020.2 unknown protein1.3e-2629.41Show/hide
Query:  MASADSSNWSL-IMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFK
        M S  SS+  L +  K  + L+L+D+L DD+SSCSSNGF+SFPR          LL  +++                    L+    + KAS A++ A K
Subjt:  MASADSSNWSL-IMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFK

Query:  QFPLPSS-RKPFLPRSISRKLILRPFWKK----------------SDAVPSNNRRWKSFQEFLDEKEPPSRADSAACTAISVAGRNSISSCSNHSISWTE
          P PSS +     R   + L  R FWKK                +D      +R +SF EFL E +      S     IS     S  +  +       
Subjt:  QFPLPSS-RKPFLPRSISRKLILRPFWKK----------------SDAVPSNNRRWKSFQEFLDEKEPPSRADSAACTAISVAGRNSISSCSNHSISWTE

Query:  SEFTSE--MIPSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSF
        S F+SE   +  SSSG        D V       ++L    +    G +S       +        +K     EEKEQ SP+S+LD PF+D+      + 
Subjt:  SEFTSE--MIPSSSSGSSESCSENDAVKDDKDSPANLIGKRDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSF

Query:  NCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFS--LISKKEHQREQKALELLKLVKSTIAKSQCFIFKT---ENLLLDFFHEKLEE
        +   H  E  ++K  +K +R E+   LEPVDL+KR       R+D+   +I  +E Q E +A  L  LVKS I + Q  +  +   +N+LLDFF E    
Subjt:  NCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFS--LISKKEHQREQKALELLKLVKSTIAKSQCFIFKT---ENLLLDFFHEKLEE

Query:  NESTAHESGRRRGSDFEEQVLKLTEDWIY--GDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWI-SLLNELVIDIS
        N  T            E++++++ E+W+    D    M M W++ E R  Y+K+M+    W    GDE+E   E +   ++ SL++EL+ DIS
Subjt:  NESTAHESGRRRGSDFEEQVLKLTEDWIY--GDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGDEEELAAEVVDEVWI-SLLNELVIDIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCGCTGATTCTTCTAATTGGAGCCTCATCATGCCCAAACCTGCTAGGTCATTGATGCTCAAGGATTATCTTCGAGACGATATGAGTTCTTGCTCCTCCAATGG
CTTCCGATCGTTTCCACGACGTCCATGCTGCGCCACAACCGTCCGATTTCTCCTCGAAATCGATCTCAAAGTCAAAGATTCTTCCCTAACTAAATCATTCCTTCCTCGAA
CCACCTCCAGAAAAATCGCTCTCTCCACCATCTCCACTTTGCAAAAGGCCTCCGACGCCGTTGTCAGAGCCTTCAAGCAATTTCCCTTGCCTTCTTCTCGGAAGCCGTTT
TTGCCGAGGAGTATTTCGCGGAAACTGATTCTCAGACCGTTTTGGAAGAAATCGGATGCCGTTCCTTCCAACAACAGACGGTGGAAATCGTTTCAGGAATTTCTCGATGA
GAAAGAACCGCCGTCGCGCGCCGATTCCGCCGCTTGCACCGCCATTTCCGTCGCTGGAAGAAACTCGATCAGTAGCTGTAGTAATCATAGCATCAGTTGGACGGAGAGCG
AATTTACATCGGAGATGATTCCGTCGTCTTCCAGCGGTAGTTCCGAGAGTTGCAGCGAAAACGACGCCGTCAAGGACGATAAGGATTCACCTGCTAATCTCATCGGCAAA
AGAGATGGCGTAACGTTCGGAAAAGATTCCACCGAAGAAACAACCACCGCCGCCGCCGCCGCCAACCGGGAGGAATCCGTTAAGCAATGGTCAAATGAAGAAGAAAAGGA
ACAGTTCAGTCCCGTTTCAGTGTTGGATTTTCCTTTCGAAGACGAAGATCAAGTCATCTCCTCATCTTTCAATTGTAACCTTCACCTCATCGAAGGTAAGAAGCAAAAAC
ATTCTCAGAAGACGAAGCGATTCGAGAACGGAGCAGAATTGGAACCTGTAGACTTAAAGAAACGATTCACAGATTTAGGCCATTGCCGTCGCGATTTCAGCTTAATATCC
AAAAAAGAACACCAAAGGGAACAGAAGGCATTGGAGCTTTTAAAGCTCGTCAAATCAACAATAGCCAAATCCCAATGCTTCATTTTCAAAACAGAGAATCTGCTGCTCGA
TTTCTTCCACGAGAAGCTCGAAGAAAACGAATCAACTGCACACGAAAGTGGCCGTCGAAGAGGATCTGATTTCGAAGAACAGGTTTTGAAATTAACAGAAGATTGGATCT
ATGGGGACGCCGGAGAAATGATGGCCATGGGATGGGAGTTGCCGGAGGGACGGAATTTCTACATTAAGGATATGGAGATGGCCGGAATGTGGAGAAGTTTCGCCGGAGAT
GAGGAAGAATTGGCGGCGGAGGTTGTAGATGAGGTTTGGATTTCTTTGCTTAATGAGCTAGTAATTGACATCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCGCTGATTCTTCTAATTGGAGCCTCATCATGCCCAAACCTGCTAGGTCATTGATGCTCAAGGATTATCTTCGAGACGATATGAGTTCTTGCTCCTCCAATGG
CTTCCGATCGTTTCCACGACGTCCATGCTGCGCCACAACCGTCCGATTTCTCCTCGAAATCGATCTCAAAGTCAAAGATTCTTCCCTAACTAAATCATTCCTTCCTCGAA
CCACCTCCAGAAAAATCGCTCTCTCCACCATCTCCACTTTGCAAAAGGCCTCCGACGCCGTTGTCAGAGCCTTCAAGCAATTTCCCTTGCCTTCTTCTCGGAAGCCGTTT
TTGCCGAGGAGTATTTCGCGGAAACTGATTCTCAGACCGTTTTGGAAGAAATCGGATGCCGTTCCTTCCAACAACAGACGGTGGAAATCGTTTCAGGAATTTCTCGATGA
GAAAGAACCGCCGTCGCGCGCCGATTCCGCCGCTTGCACCGCCATTTCCGTCGCTGGAAGAAACTCGATCAGTAGCTGTAGTAATCATAGCATCAGTTGGACGGAGAGCG
AATTTACATCGGAGATGATTCCGTCGTCTTCCAGCGGTAGTTCCGAGAGTTGCAGCGAAAACGACGCCGTCAAGGACGATAAGGATTCACCTGCTAATCTCATCGGCAAA
AGAGATGGCGTAACGTTCGGAAAAGATTCCACCGAAGAAACAACCACCGCCGCCGCCGCCGCCAACCGGGAGGAATCCGTTAAGCAATGGTCAAATGAAGAAGAAAAGGA
ACAGTTCAGTCCCGTTTCAGTGTTGGATTTTCCTTTCGAAGACGAAGATCAAGTCATCTCCTCATCTTTCAATTGTAACCTTCACCTCATCGAAGGTAAGAAGCAAAAAC
ATTCTCAGAAGACGAAGCGATTCGAGAACGGAGCAGAATTGGAACCTGTAGACTTAAAGAAACGATTCACAGATTTAGGCCATTGCCGTCGCGATTTCAGCTTAATATCC
AAAAAAGAACACCAAAGGGAACAGAAGGCATTGGAGCTTTTAAAGCTCGTCAAATCAACAATAGCCAAATCCCAATGCTTCATTTTCAAAACAGAGAATCTGCTGCTCGA
TTTCTTCCACGAGAAGCTCGAAGAAAACGAATCAACTGCACACGAAAGTGGCCGTCGAAGAGGATCTGATTTCGAAGAACAGGTTTTGAAATTAACAGAAGATTGGATCT
ATGGGGACGCCGGAGAAATGATGGCCATGGGATGGGAGTTGCCGGAGGGACGGAATTTCTACATTAAGGATATGGAGATGGCCGGAATGTGGAGAAGTTTCGCCGGAGAT
GAGGAAGAATTGGCGGCGGAGGTTGTAGATGAGGTTTGGATTTCTTTGCTTAATGAGCTAGTAATTGACATCTCTTAGTTTCTTATTCTTCTTGTTATTATTGAGGGGGA
AAAAATGTAGCTAATTGGCAGCCAAATCTACTTCTCTAATTAAACAAAAATGGAAAAGAAACTAAAATTTTAGTTTAAATTATGCTCGTGTAGCGATTGTAATAGGAGTA
AGTAGGTTTGGCTGCTGATTAGTTTTAAAAAAATTTCAATTGAAT
Protein sequenceShow/hide protein sequence
MASADSSNWSLIMPKPARSLMLKDYLRDDMSSCSSNGFRSFPRRPCCATTVRFLLEIDLKVKDSSLTKSFLPRTTSRKIALSTISTLQKASDAVVRAFKQFPLPSSRKPF
LPRSISRKLILRPFWKKSDAVPSNNRRWKSFQEFLDEKEPPSRADSAACTAISVAGRNSISSCSNHSISWTESEFTSEMIPSSSSGSSESCSENDAVKDDKDSPANLIGK
RDGVTFGKDSTEETTTAAAAANREESVKQWSNEEEKEQFSPVSVLDFPFEDEDQVISSSFNCNLHLIEGKKQKHSQKTKRFENGAELEPVDLKKRFTDLGHCRRDFSLIS
KKEHQREQKALELLKLVKSTIAKSQCFIFKTENLLLDFFHEKLEENESTAHESGRRRGSDFEEQVLKLTEDWIYGDAGEMMAMGWELPEGRNFYIKDMEMAGMWRSFAGD
EEELAAEVVDEVWISLLNELVIDIS