; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029628 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029628
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold2:21165295..21169854
RNA-Seq ExpressionSpg029628
SyntenySpg029628
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW14425.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-8342.41Show/hide
Query:  KVDCSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIH
        +V C PM  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+  +    EV+ G+FS+S+ 
Subjt:  KVDCSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIH

Query:  FSLADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSM
        FSL      W++ VYG N    RK FW EL ++  +  P W +GGDFN+ R +SEK   S+  T +MR F+ FI    L D PL N  FTWS+ + +P  
Subjt:  FSLADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSM

Query:  SLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVF
          + R+L S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F +    WW      GW GH F+++L+ +K +LK+WN   F
Subjt:  SLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVF

Query:  GQQSDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK
        G+  +K+ ++  +L N D  E+ G ++   +S+R   K +L  ++  EEI WRQK+K+KW  EGD N+ F+H+ +AN RR +
Subjt:  GQQSDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK

RVW14425.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.9e-1330.46Show/hide
Query:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDEEDSWSWQLGNIDSFTTGSL
        VG G +  FWED+W G   L  +YP L+ + + K   I+ +  PS    WNL+ RRNL+++EI                  ED              G +
Subjt:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDEEDSWSWQLGNIDSFTTGSL

Query:  TKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLF
            +S SP    + +  +W   +P +VK F+W ++H  +NT D++Q R P  +LSP  C++C K    Q   F
Subjt:  TKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLF

RVW14425.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.7e-8342.48Show/hide
Query:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL
        C PM  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+  +    EV+ G+FS+S+ FSL
Subjt:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL

Query:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI
              W++ VYG N    RK FW EL ++  +  P W +GGDFN+ R +SEK   S+  T +MR F+ FI    L D PL N  FTWS+ + +P    +
Subjt:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI

Query:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ
         R+L S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F +    WW      GW GH F+++L+ +K +LK+WN   FG+ 
Subjt:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ

Query:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK
         +K+ ++  +L N D  E+ G ++   +S+R   K +L  ++  EEI WRQK+K+KW  EGD N+ F+H+ +AN RR +
Subjt:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK

RVW45068.1 putative ribonuclease H protein [Vitis vinifera]7.8e-2233.51Show/hide
Query:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDE-EDSWSWQLGNIDSFTTGS
        VG G +  FWED+W G   L T+YP L+ + + K   I+ +  PS    WNL+ RRNL+++EI +   L   L +  +     D+  W L +   F+  S
Subjt:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDE-EDSWSWQLGNIDSFTTGS

Query:  LTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLFSSCDFASEFWSHL
            L+ SS +        +W   +P +VK F+W ++H  +NT D++Q R P  +LSP  C++C K  E+  HLF  C      W  L
Subjt:  LTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLFSSCDFASEFWSHL

RVW77758.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-8442.64Show/hide
Query:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG
        M  ISWN+RGLGS  KR ++KDFL   NP +V+ QETK    DR+ + S+WS RN  WA L A G  GGI IIW+       EV+ G+FS+S+ F L   
Subjt:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG

Query:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY
           W++ VYG N    RK FW ELS+L  +  P+W +GGDFN+ R  SEK   S R T +MR F+ FI  + L D PL N  FTWS+ + +P    + R+
Subjt:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY

Query:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK
        L S+   + F  +    L R  SDH+PI L     +WGPTPFRF N WL H  F ++  SWW+     GW GH F++KL+ +K +LK WN N FG   ++
Subjt:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK

Query:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKKPEALWRKIIATKYGVARD
        + +++ E+ NID  E+ G +S   ++ R   K +L  ++  EEI W+QK+K+KW  EGD N+  FH+ +AN RR K    + KI+  + G+  D
Subjt:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKKPEALWRKIIATKYGVARD

RVW94236.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.2e-8442.74Show/hide
Query:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL
        C PM  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+  + S  EV+ G+FS+S+ FSL
Subjt:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL

Query:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI
              W++ VYG N    RK FW EL ++  +  P W +GGDFN+ R +SEK   S+  T +MR F+ FI    L D PL N  FTWS+ + +P    +
Subjt:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI

Query:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ
         R+L S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F +    WW      GW GH F+++L+ +K +LK+WN   FG+ 
Subjt:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ

Query:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK
         +K+ ++  +L N D  E+ G ++   +S+R   K +L  ++  EEI WRQK+K+KW  EGD N+ F+H+ +AN RR +
Subjt:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]3.1e-9546.67Show/hide
Query:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG
        M F++WNVRGL SWKK ALIK F+S  NP++VILQETKL+ +D  ++KS+WS+  I W++LDA G + GI I+WN+      E+I+G FSL+I+F L+DG
Subjt:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG

Query:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY
        F FWV+G+YG + +    LFW+EL +L  +C  +WI+ GDFN+TRW+ EKS+     T++M  FN FIE ++L D+PL NG+ TWS    N S SLI  +
Subjt:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY

Query:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK
        LL++    K G    +++ R  SDHFPI L  G+  WG TPFRF N WL+H+TF   +E+WW    L GWPGH  + KLK LK  +K W    F     +
Subjt:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK

Query:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK
        + +L   + ++D  E +  ++      R + K DL+++VA EE  WRQ+ K KW  EGD NT FFHR +AN RR+
Subjt:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK

TrEMBL top hitse value%identityAlignment
A0A438BTW6 LINE-1 retrotransposable element ORF2 protein5.9e-8442.41Show/hide
Query:  KVDCSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIH
        +V C PM  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+  +    EV+ G+FS+S+ 
Subjt:  KVDCSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIH

Query:  FSLADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSM
        FSL      W++ VYG N    RK FW EL ++  +  P W +GGDFN+ R +SEK   S+  T +MR F+ FI    L D PL N  FTWS+ + +P  
Subjt:  FSLADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSM

Query:  SLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVF
          + R+L S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F +    WW      GW GH F+++L+ +K +LK+WN   F
Subjt:  SLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVF

Query:  GQQSDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK
        G+  +K+ ++  +L N D  E+ G ++   +S+R   K +L  ++  EEI WRQK+K+KW  EGD N+ F+H+ +AN RR +
Subjt:  GQQSDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK

A0A438BTW6 LINE-1 retrotransposable element ORF2 protein1.9e-1330.46Show/hide
Query:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDEEDSWSWQLGNIDSFTTGSL
        VG G +  FWED+W G   L  +YP L+ + + K   I+ +  PS    WNL+ RRNL+++EI                  ED              G +
Subjt:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDEEDSWSWQLGNIDSFTTGSL

Query:  TKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLF
            +S SP    + +  +W   +P +VK F+W ++H  +NT D++Q R P  +LSP  C++C K    Q   F
Subjt:  TKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLF

A0A438BTW6 LINE-1 retrotransposable element ORF2 protein1.3e-8342.48Show/hide
Query:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL
        C PM  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+  +    EV+ G+FS+S+ FSL
Subjt:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL

Query:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI
              W++ VYG N    RK FW EL ++  +  P W +GGDFN+ R +SEK   S+  T +MR F+ FI    L D PL N  FTWS+ + +P    +
Subjt:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI

Query:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ
         R+L S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F +    WW      GW GH F+++L+ +K +LK+WN   FG+ 
Subjt:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ

Query:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK
         +K+ ++  +L N D  E+ G ++   +S+R   K +L  ++  EEI WRQK+K+KW  EGD N+ F+H+ +AN RR +
Subjt:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK

A0A438EBM8 Putative ribonuclease H protein3.8e-2233.51Show/hide
Query:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDE-EDSWSWQLGNIDSFTTGS
        VG G +  FWED+W G   L T+YP L+ + + K   I+ +  PS    WNL+ RRNL+++EI +   L   L +  +     D+  W L +   F+  S
Subjt:  VGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNG-SWNLHLRRNLNENEILEWAILSHHLTNFSIKDE-EDSWSWQLGNIDSFTTGS

Query:  LTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLFSSCDFASEFWSHL
            L+ SS +        +W   +P +VK F+W ++H  +NT D++Q R P  +LSP  C++C K  E+  HLF  C      W  L
Subjt:  LTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLFSSCDFASEFWSHL

A0A438GZW0 LINE-1 retrotransposable element ORF2 protein1.2e-8442.64Show/hide
Query:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG
        M  ISWN+RGLGS  KR ++KDFL   NP +V+ QETK    DR+ + S+WS RN  WA L A G  GGI IIW+       EV+ G+FS+S+ F L   
Subjt:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG

Query:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY
           W++ VYG N    RK FW ELS+L  +  P+W +GGDFN+ R  SEK   S R T +MR F+ FI  + L D PL N  FTWS+ + +P    + R+
Subjt:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY

Query:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK
        L S+   + F  +    L R  SDH+PI L     +WGPTPFRF N WL H  F ++  SWW+     GW GH F++KL+ +K +LK WN N FG   ++
Subjt:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK

Query:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKKPEALWRKIIATKYGVARD
        + +++ E+ NID  E+ G +S   ++ R   K +L  ++  EEI W+QK+K+KW  EGD N+  FH+ +AN RR K    + KI+  + G+  D
Subjt:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKKPEALWRKIIATKYGVARD

A0A438IBZ1 LINE-1 retrotransposable element ORF2 protein3.5e-8442.74Show/hide
Query:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL
        C PM  ISWNVRGLGS  KR ++KDFL S NP +V++QETK    DR+ + S+W+ RN  W +L A G SGGI IIW+  + S  EV+ G+FS+S+ FSL
Subjt:  CSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSL

Query:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI
              W++ VYG N    RK FW EL ++  +  P W +GGDFN+ R +SEK   S+  T +MR F+ FI    L D PL N  FTWS+ + +P    +
Subjt:  ADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLI

Query:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ
         R+L S+   + F       L R  SDH+PI +      WGPTPFRF N WL H  F +    WW      GW GH F+++L+ +K +LK+WN   FG+ 
Subjt:  GRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQ

Query:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK
         +K+ ++  +L N D  E+ G ++   +S+R   K +L  ++  EEI WRQK+K+KW  EGD N+ F+H+ +AN RR +
Subjt:  SDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRKK

A0A6J1E2G6 uncharacterized protein LOC1110254051.5e-9546.67Show/hide
Query:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG
        M F++WNVRGL SWKK ALIK F+S  NP++VILQETKL+ +D  ++KS+WS+  I W++LDA G + GI I+WN+      E+I+G FSL+I+F L+DG
Subjt:  MIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRKLIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADG

Query:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY
        F FWV+G+YG + +    LFW+EL +L  +C  +WI+ GDFN+TRW+ EKS+     T++M  FN FIE ++L D+PL NG+ TWS    N S SLI  +
Subjt:  FSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSASTRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRY

Query:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK
        LL++    K G    +++ R  SDHFPI L  G+  WG TPFRF N WL+H+TF   +E+WW    L GWPGH  + KLK LK  +K W    F     +
Subjt:  LLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDK

Query:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK
        + +L   + ++D  E +  ++      R + K DL+++VA EE  WRQ+ K KW  EGD NT FFHR +AN RR+
Subjt:  RNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.4e-1427.24Show/hide
Query:  IIGGDFNLTRWTSEKSSA--STRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFR-PNPSMSLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTL
        I+ GDF+    TS+  S   ++   R +  F   +  + L DIP     +TWS+ +  NP +  + R + + +    F SA        +SDH P  + L
Subjt:  IIGGDFNLTRWTSEKSSA--STRQTRAMRRFNRFIETTALQDIPLANGKFTWSSFR-PNPSMSLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTL

Query:  -GKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDKRNNLNQELLNIDKKEENGLISELDISRRTE-
            +     FR+ +   TH TFL ++   W+    +G       + LK  KK  K  N   FG    K     + L +++  +   L +  D   R E 
Subjt:  -GKERWGPTPFRFINAWLTHRTFLQTVESWWQTNSLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDKRNNLNQELLNIDKKEENGLISELDISRRTE-

Query:  -IKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK
          +       A  E  +RQKS++KW  +GD NT FFH+++  N+ K
Subjt:  -IKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-0424.18Show/hide
Query:  ASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLFSSCDFASEFWSHLQHAFGWLFARPGDIYTL
        A + P     L +R+W  P+  ++K F+W     ++ T + +  R  G  + PS C  CH+ +E+  H   +C FA+  W     +         D    
Subjt:  ASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASETQAHLFSSCDFASEFWSHLQHAFGWLFARPGDIYTL

Query:  LS--LSVCGSPFTKDKKLIWQIFLYAFLWNLWLERNARVFTDRHQSISSFIES
        +S  L+        D   +  ++L   +W +W  RN  VF    +S S  + S
Subjt:  LS--LSVCGSPFTKDKKLIWQIFLYAFLWNLWLERNARVFTDRHQSISSFIES

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-1025.29Show/hide
Query:  VGKGNKTLFWEDIWLGSSSLMTKYPSL--YNLSLKKEAFIADLWNPSNGSWNLHLRRNLNENEILEWAILSHHLTNFSIKDE---EDSWSWQL---GNID
        +G G    FW D W     L+T   +     L ++++A + +     NG W L   R+ N    L        LT   +  E   +DS+ W+      + 
Subjt:  VGKGNKTLFWEDIWLGSSSLMTKYPSL--YNLSLKKEAFIADLWNPSNGSWNLHLRRNLNENEILEWAILSHHLTNFSIKDE---EDSWSWQL---GNID

Query:  SFTTGSLTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLS-PSCCVMCHKASETQAHLFSSCDFASEFWSHLQHAFG
        SF++    +++   SPT        +W            W      + T D    R  G  ++ PS  V+C    ET AHLF  C F+   W      F 
Subjt:  SFTTGSLTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLS-PSCCVMCHKASETQAHLFSSCDFASEFWSHLQHAFG

Query:  WLFARPGDIYTLLSLS--VCGSPFTKDKKLIWQIFLYAFLWNLWLERNARVFTDRHQSISS
            RP   + L + S  +   P       I ++ L + ++++W ERNAR+FT    S SS
Subjt:  WLFARPGDIYTLLSLS--VCGSPFTKDKKLIWQIFLYAFLWNLWLERNARVFTDRHQSISS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCCCACACGCCGCCACTGTAAAAATTGATACAATTGGGGAAACATCTCAGAGCCCGTACAGACATGACTCCCAAAAAGCCCCCATTGATGTATTGTTCACATC
GACTGATCTGACGTCTACTTTATTGACAGAGGGGGGCCCACAGATACCAGCAGCCACAGATGAGCCTCAATCACCAAAATCCTATCCAGAAAGCCACCCGATTATAACCC
CCAATAAAGCCCTATCTAATGACCGTAATCAGAAAGGCCCACCCAACCCATCATCCCCATCTGGCCCCACTCACAAAGGCCCAACCCACCATCCCCTTAAAAACAAAAAG
CCCATCATTATCAATGATAAAAAAACCTACCTCCTCACCGACAATACCCTTATTGAGATTGAAGTGGAAGAAGATGACGAAGAGGGACAGAACACAGAGGACACAAGTAT
AGACCCAGCCGCCTATCTTCCCATTATTTTCCCTTGGTTGACAGAGCATGGCATGACGCCTTATGGAAAGGTCGACTGCAGCCCTATGATTTTTATCTCGTGGAATGTTA
GAGGATTGGGATCATGGAAGAAGAGAGCTCTTATCAAAGATTTTCTCTCCTCTCATAATCCATCTTTGGTGATTCTTCAAGAGACCAAGCTGGCCAAGATTGACAGGAAG
TTGATTAAATCTATATGGAGTTCCAGAAACATTGTCTGGGCTTCTCTTGATGCTGAGGGAACATCAGGTGGCATAGCCATTATCTGGAATGAATCATCCTTCTCGGTGCT
TGAGGTTATTAAAGGTACTTTCTCGTTATCAATTCATTTTTCTCTTGCTGATGGCTTTTCCTTCTGGGTTACAGGGGTGTATGGCCTAAACATATCTCGTGATAGGAAAT
TATTTTGGCGAGAACTATCCAATTTGCAAATGATGTGTCTTCCCAACTGGATTATTGGAGGCGATTTTAACTTAACTAGATGGACTTCGGAAAAATCTTCCGCTTCTACT
AGGCAGACCCGGGCCATGAGGAGATTTAATCGCTTCATTGAAACAACAGCATTACAGGACATCCCCCTCGCTAATGGTAAATTCACATGGTCTAGCTTCAGGCCGAATCC
CTCCATGTCCCTTATCGGCAGATACTTGCTCTCAGATAACATTCCTGTTAAATTCGGATCAGCCTCTGTTCGTAAGCTTGAGAGACCCATTTCTGACCACTTCCCTATAT
GTCTCACTTTGGGAAAGGAACGATGGGGACCAACTCCATTCAGATTCATCAATGCTTGGCTTACCCATAGAACCTTTCTACAAACTGTAGAATCTTGGTGGCAGACAAAC
TCCTTGTTAGGGTGGCCTGGACACGATTTCATTCAAAAGCTAAAAGGCCTAAAAAAAGAACTGAAACAGTGGAACCACAATGTTTTTGGTCAGCAATCTGATAAGAGGAA
CAACCTTAATCAGGAGCTTTTGAACATAGACAAGAAAGAGGAAAATGGCCTTATATCTGAATTAGACATCTCTAGAAGGACAGAGATAAAGGCTGACTTGATCAACATAG
TAGCAACCGAGGAGATTATCTGGCGCCAAAAAAGCAAATTAAAATGGTTCTTGGAAGGAGATGTCAACACAACTTTCTTCCACCGACTAATGGCTAACAACAGAAGGAAA
AAACCGGAGGCTTTATGGAGGAAAATCATCGCTACAAAATATGGGGTGGCTCGAGATCCCCTAAAGTTGGGTAACCACTCCATTGGCTCCTCCAAAGGACCATGGAAAGC
TATCCACAGCTTGCAACATTTTATCTATGATAACATTGATTCCAGAGTTGGGAAGGGTAATAAAACTCTTTTTTGGGAGGATATTTGGCTTGGCTCATCATCTCTTATGA
CTAAATACCCATCTCTGTATAACTTATCTCTAAAAAAAGAAGCATTCATTGCTGATCTATGGAATCCTAGTAATGGGTCTTGGAACCTTCATTTGAGGCGAAATCTCAAT
GAGAATGAAATCCTTGAATGGGCCATTTTATCTCATCATCTTACCAACTTCTCCATCAAAGACGAGGAAGACTCATGGAGTTGGCAGCTCGGAAACATTGACAGCTTCAC
CACGGGATCTCTCACGAAAAAATTGGCTTCCTCCTCTCCCACAACCGGAACAACATTATATAGTAGACTGTGGAAAGGTCCTATGCCGAAAGAGGTTAAATTCTTCATAT
GGGAACTCAGTCATGCTAGCATCAACACCGCAGACGTCATTCAGAAAAGATGTCCTGGAAATAGTCTTTCTCCTAGCTGCTGTGTCATGTGCCACAAAGCTAGCGAGACC
CAAGCTCATCTATTCAGTAGCTGCGATTTTGCATCAGAATTTTGGTCTCATCTCCAGCATGCTTTTGGGTGGCTTTTTGCTCGTCCGGGTGACATCTACACCCTTCTCTC
TTTATCTGTTTGTGGATCTCCATTTACAAAAGATAAAAAGCTGATATGGCAGATTTTTTTGTATGCTTTCCTATGGAATCTTTGGCTGGAAAGAAACGCTAGAGTCTTTA
CTGATAGGCATCAAAGCATCAGTTCTTTTATAGAATCCACTACTTATTTGGCCTTATATTGGAGTAGACACACCCCCCTATTCTGTAATTACTCCCTATCTTCCCTATTG
ACCCATTGGAGATGTCTTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCCCACACGCCGCCACTGTAAAAATTGATACAATTGGGGAAACATCTCAGAGCCCGTACAGACATGACTCCCAAAAAGCCCCCATTGATGTATTGTTCACATC
GACTGATCTGACGTCTACTTTATTGACAGAGGGGGGCCCACAGATACCAGCAGCCACAGATGAGCCTCAATCACCAAAATCCTATCCAGAAAGCCACCCGATTATAACCC
CCAATAAAGCCCTATCTAATGACCGTAATCAGAAAGGCCCACCCAACCCATCATCCCCATCTGGCCCCACTCACAAAGGCCCAACCCACCATCCCCTTAAAAACAAAAAG
CCCATCATTATCAATGATAAAAAAACCTACCTCCTCACCGACAATACCCTTATTGAGATTGAAGTGGAAGAAGATGACGAAGAGGGACAGAACACAGAGGACACAAGTAT
AGACCCAGCCGCCTATCTTCCCATTATTTTCCCTTGGTTGACAGAGCATGGCATGACGCCTTATGGAAAGGTCGACTGCAGCCCTATGATTTTTATCTCGTGGAATGTTA
GAGGATTGGGATCATGGAAGAAGAGAGCTCTTATCAAAGATTTTCTCTCCTCTCATAATCCATCTTTGGTGATTCTTCAAGAGACCAAGCTGGCCAAGATTGACAGGAAG
TTGATTAAATCTATATGGAGTTCCAGAAACATTGTCTGGGCTTCTCTTGATGCTGAGGGAACATCAGGTGGCATAGCCATTATCTGGAATGAATCATCCTTCTCGGTGCT
TGAGGTTATTAAAGGTACTTTCTCGTTATCAATTCATTTTTCTCTTGCTGATGGCTTTTCCTTCTGGGTTACAGGGGTGTATGGCCTAAACATATCTCGTGATAGGAAAT
TATTTTGGCGAGAACTATCCAATTTGCAAATGATGTGTCTTCCCAACTGGATTATTGGAGGCGATTTTAACTTAACTAGATGGACTTCGGAAAAATCTTCCGCTTCTACT
AGGCAGACCCGGGCCATGAGGAGATTTAATCGCTTCATTGAAACAACAGCATTACAGGACATCCCCCTCGCTAATGGTAAATTCACATGGTCTAGCTTCAGGCCGAATCC
CTCCATGTCCCTTATCGGCAGATACTTGCTCTCAGATAACATTCCTGTTAAATTCGGATCAGCCTCTGTTCGTAAGCTTGAGAGACCCATTTCTGACCACTTCCCTATAT
GTCTCACTTTGGGAAAGGAACGATGGGGACCAACTCCATTCAGATTCATCAATGCTTGGCTTACCCATAGAACCTTTCTACAAACTGTAGAATCTTGGTGGCAGACAAAC
TCCTTGTTAGGGTGGCCTGGACACGATTTCATTCAAAAGCTAAAAGGCCTAAAAAAAGAACTGAAACAGTGGAACCACAATGTTTTTGGTCAGCAATCTGATAAGAGGAA
CAACCTTAATCAGGAGCTTTTGAACATAGACAAGAAAGAGGAAAATGGCCTTATATCTGAATTAGACATCTCTAGAAGGACAGAGATAAAGGCTGACTTGATCAACATAG
TAGCAACCGAGGAGATTATCTGGCGCCAAAAAAGCAAATTAAAATGGTTCTTGGAAGGAGATGTCAACACAACTTTCTTCCACCGACTAATGGCTAACAACAGAAGGAAA
AAACCGGAGGCTTTATGGAGGAAAATCATCGCTACAAAATATGGGGTGGCTCGAGATCCCCTAAAGTTGGGTAACCACTCCATTGGCTCCTCCAAAGGACCATGGAAAGC
TATCCACAGCTTGCAACATTTTATCTATGATAACATTGATTCCAGAGTTGGGAAGGGTAATAAAACTCTTTTTTGGGAGGATATTTGGCTTGGCTCATCATCTCTTATGA
CTAAATACCCATCTCTGTATAACTTATCTCTAAAAAAAGAAGCATTCATTGCTGATCTATGGAATCCTAGTAATGGGTCTTGGAACCTTCATTTGAGGCGAAATCTCAAT
GAGAATGAAATCCTTGAATGGGCCATTTTATCTCATCATCTTACCAACTTCTCCATCAAAGACGAGGAAGACTCATGGAGTTGGCAGCTCGGAAACATTGACAGCTTCAC
CACGGGATCTCTCACGAAAAAATTGGCTTCCTCCTCTCCCACAACCGGAACAACATTATATAGTAGACTGTGGAAAGGTCCTATGCCGAAAGAGGTTAAATTCTTCATAT
GGGAACTCAGTCATGCTAGCATCAACACCGCAGACGTCATTCAGAAAAGATGTCCTGGAAATAGTCTTTCTCCTAGCTGCTGTGTCATGTGCCACAAAGCTAGCGAGACC
CAAGCTCATCTATTCAGTAGCTGCGATTTTGCATCAGAATTTTGGTCTCATCTCCAGCATGCTTTTGGGTGGCTTTTTGCTCGTCCGGGTGACATCTACACCCTTCTCTC
TTTATCTGTTTGTGGATCTCCATTTACAAAAGATAAAAAGCTGATATGGCAGATTTTTTTGTATGCTTTCCTATGGAATCTTTGGCTGGAAAGAAACGCTAGAGTCTTTA
CTGATAGGCATCAAAGCATCAGTTCTTTTATAGAATCCACTACTTATTTGGCCTTATATTGGAGTAGACACACCCCCCTATTCTGTAATTACTCCCTATCTTCCCTATTG
ACCCATTGGAGATGTCTTTTGTAA
Protein sequenceShow/hide protein sequence
MNSPHAATVKIDTIGETSQSPYRHDSQKAPIDVLFTSTDLTSTLLTEGGPQIPAATDEPQSPKSYPESHPIITPNKALSNDRNQKGPPNPSSPSGPTHKGPTHHPLKNKK
PIIINDKKTYLLTDNTLIEIEVEEDDEEGQNTEDTSIDPAAYLPIIFPWLTEHGMTPYGKVDCSPMIFISWNVRGLGSWKKRALIKDFLSSHNPSLVILQETKLAKIDRK
LIKSIWSSRNIVWASLDAEGTSGGIAIIWNESSFSVLEVIKGTFSLSIHFSLADGFSFWVTGVYGLNISRDRKLFWRELSNLQMMCLPNWIIGGDFNLTRWTSEKSSAST
RQTRAMRRFNRFIETTALQDIPLANGKFTWSSFRPNPSMSLIGRYLLSDNIPVKFGSASVRKLERPISDHFPICLTLGKERWGPTPFRFINAWLTHRTFLQTVESWWQTN
SLLGWPGHDFIQKLKGLKKELKQWNHNVFGQQSDKRNNLNQELLNIDKKEENGLISELDISRRTEIKADLINIVATEEIIWRQKSKLKWFLEGDVNTTFFHRLMANNRRK
KPEALWRKIIATKYGVARDPLKLGNHSIGSSKGPWKAIHSLQHFIYDNIDSRVGKGNKTLFWEDIWLGSSSLMTKYPSLYNLSLKKEAFIADLWNPSNGSWNLHLRRNLN
ENEILEWAILSHHLTNFSIKDEEDSWSWQLGNIDSFTTGSLTKKLASSSPTTGTTLYSRLWKGPMPKEVKFFIWELSHASINTADVIQKRCPGNSLSPSCCVMCHKASET
QAHLFSSCDFASEFWSHLQHAFGWLFARPGDIYTLLSLSVCGSPFTKDKKLIWQIFLYAFLWNLWLERNARVFTDRHQSISSFIESTTYLALYWSRHTPLFCNYSLSSLL
THWRCLL